Embedding

Embeddings are vector representations of text or other data that can be used in various machine learning and language processing applications. Embeddings convert words, sentences, or documents into numerical vectors in a high-dimensional space, making it easier for computers to understand and process the meaning of the data being analyzed. Here are some common uses of embeddings:

Search works by converting each document and query into vectors. Search results are ranked based on the similarity of embedding vectors between the query and documents. The closer the vectors, the more relevant the result.
Clustering works by grouping sentences converted into embedding vectors using clustering algorithms such as K-Means. Texts with similar embedding vectors are grouped together.
Recommendation works by converting item descriptions into embedding vectors. When a user shows interest in an item, the system searches for other items with similar embedding vectors to recommend.
Anomaly Detection works by converting sentences into vectors, so vectors that are significantly different from the majority of other vectors are identified as anomalies.
Diversity Measurement works by obtaining vectors from sentences to analyze how diverse the sentences are in vector space, which can be measured by looking at the distribution of distances between vectors.
Classification works by comparing sentence vectors with vectors of existing labels and classifying the sentence into the category with the most similar vector.

This endpoint uses the POST method, where request data is sent to the server for processing. The following endpoints are available for Embeddings that can be utilized.

At this time, only the following model, baai/bge-multilingual-gemma2, can be used for Embeddings. Here is the endpoint you can send:

Example Request

curl https://dekallm.cloudeka.ai/v1/embeddings \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $API_KEY" \
  -d '{
    "input": "Your text string goes here",
    "model": "baai/bge-multilingual-gemma2"
  }'

Example Request

import requests

# URL endpoint
url = "https://dekallm.cloudeka.ai/v1/embeddings"

# Header with type conten and authorization
headers = {
    "Content-Type": "application/json",
    "Authorization": "API_KEY"  # replace API_KEY with your API key
}

# JSON data to be sent
data = {
    "input": "Your text string goes here",
    "model": "baai/bge-multilingual-gemma2"
}

# Request POST
response = requests.post(url, headers=headers, json=data)

# Checks whether the request was successful
if response.status_code == 200:
    print("Response:", response.json())
else:
    print("Failed to get embeddings. Status code:", response.status_code)
    print("Response:", response.text)

Following are the results of the responses received.

Response

{
  "object": "list",
  "data": [
    {
      "object": "embedding",
      "embedding": [
        0.0023064255,
        -0.009327292,
        .... (1536 floats total for ada-002)
        -0.0028842222,
      ],
      "index": 0
    }
  ],
  "model": "text-embedding-ada-002",
  "usage": {
    "prompt_tokens": 8,
    "total_tokens": 8
  }
}

PreviousCompletions NextDelete Deka LLM

Last updated 4 months ago