You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: pkg/component/ai/instillmodel/v0/README.mdx
+26-26Lines changed: 26 additions & 26 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -62,7 +62,7 @@ Input data.
62
62
63
63
| Field | Field ID | Type | Note |
64
64
| :--- | :--- | :--- | :--- |
65
-
|[Embeddings](#embedding-embeddings)|`embeddings`| array | List of input data to be embedded. |
65
+
|[Input Embeddings](#embedding-input-embeddings)|`embeddings`| array | List of input data to be embedded. |
66
66
| Model |`model`| string | The model to be used for generating embeddings. It should be `namespace/model-name/version`. i.e. `admin/dummy-model/latest`. You can see the version from the Versions tab of Model page. |
67
67
</div>
68
68
<h4id="embedding-parameter">Parameter</h4>
@@ -134,10 +134,10 @@ Input parameter.
134
134
135
135
| Field | Field ID | Type | Note |
136
136
| :--- | :--- | :--- | :--- |
137
-
|[Embeddings](#embedding-embeddings)|`embeddings`| array | List of generated embeddings. |
137
+
|[Output Embeddings](#embedding-output-embeddings)|`embeddings`| array | List of generated embeddings. |
@@ -316,15 +316,15 @@ This task generates natural language responses based on a given input prompt. It
316
316
| Input | ID | Type | Description |
317
317
| :--- | :--- | :--- | :--- |
318
318
| Task ID (required) |`task`| string |`TASK_COMPLETION`|
319
-
|[Data](#completion-data) (required) |`data`| object | Contains the model name and the prompt for generating a response. This section defines the input prompt and guides the model's behavior using the specified system message. |
319
+
|[Input Data](#completion-input-data) (required) |`data`| object | Contains the model name and the prompt for generating a response. This section defines the input prompt and guides the model's behavior using the specified system message. |
320
320
|[Parameter](#completion-parameter)|`parameter`| object | Optional settings to control the model’s behavior, including token limit, randomness, and response length. |
321
321
</div>
322
322
323
323
324
324
<details>
325
325
<summary> Input Objects in Completion</summary>
326
326
327
-
<h4id="completion-data">Data</h4>
327
+
<h4id="completion-input-data">Input Data</h4>
328
328
329
329
Contains the model name and the prompt for generating a response. This section defines the input prompt and guides the model's behavior using the specified system message.
330
330
@@ -359,14 +359,14 @@ Optional settings to control the model’s behavior, including token limit, rand
359
359
360
360
| Output | ID | Type | Description |
361
361
| :--- | :--- | :--- | :--- |
362
-
|[Data](#completion-data)|`data`| object | Contains the generated responses and details like why the model stopped and when the response was generated. |
362
+
|[Output Data](#completion-output-data)|`data`| object | Contains the generated responses and details like why the model stopped and when the response was generated. |
363
363
|[Metadata](#completion-metadata) (optional) |`metadata`| object | Provides information on how many tokens were used in both the input prompt and the generated response. |
@@ -457,14 +457,14 @@ This section defines optional parameters that modify the behavior of the image g
457
457
458
458
| Output | ID | Type | Description |
459
459
| :--- | :--- | :--- | :--- |
460
-
|[Data](#text-to-image-data)|`data`| object | The data section contains the core output of the image generation task. It includes all relevant information generated by the model, such as the list of created images and their associated metadata, providing a comprehensive result for each input prompt. |
460
+
|[Output Data](#text-to-image-output-data)|`data`| object | The data section contains the core output of the image generation task. It includes all relevant information generated by the model, such as the list of created images and their associated metadata, providing a comprehensive result for each input prompt. |
461
461
|[Metadata](#text-to-image-metadata) (optional) |`metadata`| object | Provides additional details about the image generation request, such as usage statistics and resource consumption. |
462
462
</div>
463
463
464
464
<details>
465
465
<summary> Output Objects in Text To Image</summary>
@@ -547,14 +547,14 @@ Specifies the classification model to be utilized for the task. This field must
547
547
548
548
| Output | ID | Type | Description |
549
549
| :--- | :--- | :--- | :--- |
550
-
|[Data](#classification-data)|`data`| object | Contains the results of the classification task, including the predicted category and its associated confidence score. This structured output allows users to understand the model's classification decision and assess its reliability based on the score provided. |
550
+
|[Output Data](#classification-output-data)|`data`| object | Contains the results of the classification task, including the predicted category and its associated confidence score. This structured output allows users to understand the model's classification decision and assess its reliability based on the score provided. |
551
551
|[Metadata](#classification-metadata) (optional) |`metadata`| object | Contains additional information about the request, including usage statistics. This data provides context and insights regarding the classification process, helping users evaluate performance and resource consumption during the classification task. |
552
552
</div>
553
553
554
554
<details>
555
555
<summary> Output Objects in Classification</summary>
@@ -583,7 +583,7 @@ This task focuses on identifying and localizing multiple objects within images,
583
583
| Input | ID | Type | Description |
584
584
| :--- | :--- | :--- | :--- |
585
585
| Task ID (required) |`task`| string |`TASK_DETECTION`|
586
-
|[Data](#detection-data) (required) |`data`| object | Contains the data required for the detection process. It can include either an image URL or a base64-encoded image representation for detection. |
586
+
|[Input Data](#detection-input-data) (required) |`data`| object | Contains the data required for the detection process. It can include either an image URL or a base64-encoded image representation for detection. |
587
587
| Parameter |`parameter`| object | An object representing any additional parameters for the detection task. This section is currently empty but can be extended for future needs. |
588
588
</div>
589
589
@@ -628,14 +628,14 @@ Specifies the detection model to be utilized for the task. This field must conta
628
628
629
629
| Output | ID | Type | Description |
630
630
| :--- | :--- | :--- | :--- |
631
-
|[Data](#detection-data)|`data`| object | Contains the results generated by the detection task, including a list of objects detected in the input image along with their details. |
631
+
|[Output Data](#detection-output-data)|`data`| object | Contains the results generated by the detection task, including a list of objects detected in the input image along with their details. |
632
632
|[Metadata](#detection-metadata) (optional) |`metadata`| object | Contains metadata related to the detection task's output, providing insights into the request's processing and any relevant statistics. |
@@ -686,7 +686,7 @@ Detects and localizes multiple keypoints within objects in images, providing pre
686
686
| Input | ID | Type | Description |
687
687
| :--- | :--- | :--- | :--- |
688
688
| Task ID (required) |`task`| string |`TASK_KEYPOINT`|
689
-
|[Data](#keypoint-data) (required) |`data`| object | Contains input data options, allowing for an image URL or a base64-encoded image file for keypoint detection. Each input type requires model selection for processing. |
689
+
|[Input Data](#keypoint-input-data) (required) |`data`| object | Contains input data options, allowing for an image URL or a base64-encoded image file for keypoint detection. Each input type requires model selection for processing. |
690
690
| Parameter |`parameter`| object | Optional parameters for the keypoint detection task, allowing for adjustments in processing. |
691
691
</div>
692
692
@@ -727,14 +727,14 @@ Detects and localizes multiple keypoints within objects in images, providing pre
727
727
728
728
| Output | ID | Type | Description |
729
729
| :--- | :--- | :--- | :--- |
730
-
|[Data](#keypoint-data)|`data`| object | Contains detected objects and associated keypoints for keypoint detection output. Each object includes its bounding box, keypoints, and confidence score. |
730
+
|[Output Data](#keypoint-output-data)|`data`| object | Contains detected objects and associated keypoints for keypoint detection output. Each object includes its bounding box, keypoints, and confidence score. |
731
731
|[Metadata](#keypoint-metadata) (optional) |`metadata`| object | Contains additional metadata for the keypoint detection task output, including request-specific usage information. |
@@ -796,7 +796,7 @@ Optical Character Recognition (OCR) is a process that detects and extracts text
796
796
| Input | ID | Type | Description |
797
797
| :--- | :--- | :--- | :--- |
798
798
| Task ID (required) |`task`| string |`TASK_OCR`|
799
-
|[Data](#ocr-data) (required) |`data`| object | Contains input data options, allowing for an image URL or a base64-encoded image file for OCR. Each input type requires model selection for processing. |
799
+
|[Input Data](#ocr-input-data) (required) |`data`| object | Contains input data options, allowing for an image URL or a base64-encoded image file for OCR. Each input type requires model selection for processing. |
800
800
| Parameter |`parameter`| object | The parameter field is an optional object that allows you to provide specific configurations for the OCR task. Although empty by default, it can be populated with parameters that fine-tune processing or control the model’s behavior. These configurations can help adjust the output according to specific OCR requirements or performance optimizations. |
801
801
</div>
802
802
@@ -837,14 +837,14 @@ Optical Character Recognition (OCR) is a process that detects and extracts text
837
837
838
838
| Output | ID | Type | Description |
839
839
| :--- | :--- | :--- | :--- |
840
-
|[Data](#ocr-data)|`data`| object | Contains the structured output data of the OCR task, presenting a list of detected objects, each represented by a bounding box, recognized text, and associated confidence scores. |
840
+
|[Output Data](#ocr-output-data)|`data`| object | Contains the structured output data of the OCR task, presenting a list of detected objects, each represented by a bounding box, recognized text, and associated confidence scores. |
841
841
|[Metadata](#ocr-metadata) (optional) |`metadata`| object | Contains additional information on the OCR task, including performance metrics and system usage statistics for the request. |
@@ -895,7 +895,7 @@ This task involves classifying each pixel in an image into specific, predefined
895
895
| Input | ID | Type | Description |
896
896
| :--- | :--- | :--- | :--- |
897
897
| Task ID (required) |`task`| string |`TASK_SEMANTIC_SEGMENTATION`|
898
-
|[Data](#semantic-segmentation-data) (required) |`data`| object | Contains input data options, allowing for an image URL or a base64-encoded image file for semantic segmentation. Each input type requires model selection for processing. |
898
+
|[Input Data](#semantic-segmentation-input-data) (required) |`data`| object | Contains input data options, allowing for an image URL or a base64-encoded image file for semantic segmentation. Each input type requires model selection for processing. |
899
899
| Parameter |`parameter`| object | This object contains configuration options that allow customization of the semantic segmentation task, such as adjusting thresholds or model settings. |
900
900
</div>
901
901
@@ -936,14 +936,14 @@ This task involves classifying each pixel in an image into specific, predefined
936
936
937
937
| Output | ID | Type | Description |
938
938
| :--- | :--- | :--- | :--- |
939
-
|[Data](#semantic-segmentation-data)|`data`| object | Contains the output data of the semantic segmentation task, detailing the segmented areas through RLE masks. Each segment corresponds to a specific category, providing essential data for analyzing the results. |
939
+
|[Output Data](#semantic-segmentation-output-data)|`data`| object | Contains the output data of the semantic segmentation task, detailing the segmented areas through RLE masks. Each segment corresponds to a specific category, providing essential data for analyzing the results. |
940
940
|[Metadata](#semantic-segmentation-metadata) (optional) |`metadata`| object | Contains additional information about the output, including statistics regarding the usage of the segmentation request. It helps track the performance and efficiency of the task. |
941
941
</div>
942
942
943
943
<details>
944
944
<summary> Output Objects in Semantic Segmentation</summary>
@@ -981,7 +981,7 @@ This task is a computer vision task that identifies and distinguishes multiple o
981
981
| Input | ID | Type | Description |
982
982
| :--- | :--- | :--- | :--- |
983
983
| Task ID (required) |`task`| string |`TASK_INSTANCE_SEGMENTATION`|
984
-
|[Data](#instance-segmentation-data) (required) |`data`| object | Contains input data options, allowing for an image URL or a base64-encoded image file for instance segmentation. Each input type requires model selection for processing. |
984
+
|[Input Data](#instance-segmentation-input-data) (required) |`data`| object | Contains input data options, allowing for an image URL or a base64-encoded image file for instance segmentation. Each input type requires model selection for processing. |
985
985
| Parameter |`parameter`| object | Placeholder for additional input parameters, providing flexibility for task customization. |
986
986
</div>
987
987
@@ -1022,14 +1022,14 @@ This task is a computer vision task that identifies and distinguishes multiple o
1022
1022
1023
1023
| Output | ID | Type | Description |
1024
1024
| :--- | :--- | :--- | :--- |
1025
-
|[Data](#instance-segmentation-data)|`data`| object | Contains segmented instance data, including each detected object and its corresponding segmentation information. |
1025
+
|[Output Data](#instance-segmentation-output-data)|`data`| object | Contains segmented instance data, including each detected object and its corresponding segmentation information. |
1026
1026
|[Metadata](#instance-segmentation-metadata) (optional) |`metadata`| object | Metadata accompanying the output, providing additional context like usage statistics for the segmentation request. |
1027
1027
</div>
1028
1028
1029
1029
<details>
1030
1030
<summary> Output Objects in Instance Segmentation</summary>
0 commit comments