Создание embeddings

POST

embeddings

import os
from openai import OpenAI

client = OpenAI(
    base_url="https://api.cometapi.com/v1",
    api_key=os.environ["COMETAPI_KEY"],
)

response = client.embeddings.create(
    model="text-embedding-3-small",
    input="The food was delicious and the waiter was friendly.",
)

print(response.data[0].embedding[:5])  # First 5 dimensions
print(f"Dimensions: {len(response.data[0].embedding)}")

{
  "object": "list",
  "data": [
    {
      "object": "embedding",
      "index": 0,
      "embedding": [
        -0.0021,
        -0.0491,
        0.0209,
        0.0314,
        -0.0453
      ]
    }
  ],
  "model": "text-embedding-3-small",
  "usage": {
    "prompt_tokens": 2,
    "total_tokens": 2
  }
}

CometAPI поддерживает embedding-модели от нескольких провайдеров через единый endpoint. Передайте одну или несколько текстовых строк и получите числовые векторы для семантического поиска, кластеризации, классификации или retrieval-augmented generation (RAG). Список доступных embedding-моделей и цены см. в списке моделей.

Модели text-embedding-3-* поддерживают параметр dimensions, который уменьшает embedding-вектор без заметной потери точности. Это может сократить расходы на хранение, сохранив при этом большую часть семантической информации.

Чтобы создать embeddings для нескольких текстов в одном запросе, передайте массив строк в параметр input. Пакетный ввод значительно эффективнее, чем выполнение отдельных запросов.

Авторизации

Authorization

string

header

обязательно

Bearer token authentication. Use your CometAPI key.

Тело

application/json

model

string

обязательно

The embedding model to use. See the Models page for current embedding model IDs.

Пример:

"text-embedding-3-small"

input

обязательно

The text to embed. Can be a single string, an array of strings, or an array of token arrays. Each input must not exceed the model's maximum token limit (8,191 tokens for text-embedding-3-* models).

encoding_format

enum<string>

по умолчанию:float

The format of the returned embedding vectors. float returns an array of floating-point numbers. base64 returns a base64-encoded string representation, which can reduce response size for large batches.

Доступные опции:

float,

base64

dimensions

integer

The number of dimensions for the output embedding vector. Only supported by text-embedding-3-* models. Reducing dimensions can lower storage costs while maintaining most of the embedding's utility.

Требуемый диапазон: x >= 1

user

string

A unique identifier for your end-user, which can help monitor and detect abuse.

Ответ

200 - application/json

A list of embedding vectors for the input text(s).

object

enum<string>

The object type, always list.

Доступные опции:

list

Пример:

"list"

data

object[]

An array of embedding objects, one per input text. When multiple inputs are provided, results are returned in the same order as the input.

Show child attributes

model

string

The model used to generate the embeddings.

Пример:

"text-embedding-3-small"

usage

object

Token usage statistics for this request.

Show child attributes

Embeddings API

Создать moderation

Модерация контента

API-ключи

Авторизации

Тело

Ответ