| .. | ||
| __pycache__ | ||
| __init__.py | ||
| bm25_tokenizer.py | ||
| README.md | ||
| registry.py | ||
| schema_utils.py | ||
Embedding Function Schemas
This directory contains JSON schemas for all embedding functions in Chroma. The purpose of having this schema is to support cross language compatibility, and to validate that changes in one client library do not accidentally diverge from others.
Schema Structure
Each schema follows the JSON Schema Draft-07 specification and includes:
version: The version of the schematitle: The title of the schemadescription: A description of the schemaproperties: The properties that can be configured for the embedding functionrequired: The properties that are required for the embedding functionadditionalProperties: Whether additional properties are allowed (always set tofalseto ensure strict validation)
Usage
The schemas can be used to validate the configuration of embedding functions using the validate_config function:
from chromadb.utils.embedding_functions.schemas import validate_config
# Validate a configuration
config = {
"api_key_env_var": "CHROMA_OPENAI_API_KEY",
"model_name": "text-embedding-ada-002"
}
validate_config(config, "openai")
Adding New Schemas
To add a new schema:
- Create a new JSON file in this directory with the name of the embedding function (e.g.,
new_function.json) - Define the schema following the JSON Schema Draft-07 specification
- Update the embedding function to use the schema for validation
Schema Versioning
Each schema includes a version number to support future changes to embedding function configurations. When making changes to a schema, increment the version number to ensure backward compatibility.