Embedding Models
Embedding models convert raw text into numerical representations that can be used by machine learning models. Spice supports running embedding models locally or using remote services such as OpenAI or la Plateforme.
Embeddings are used for vector-based and similarity search, like document retrieval. For chat-based large language models, refer to Model Providers.
Spice supports various model sources and formats to provide embedding components:
| Name | Description | Status | ML Format(s) | LLM Format(s)* |
|---|---|---|---|---|
file | Local filesystem | Release Candidate | ONNX | GGUF, GGML, SafeTensor |
huggingface | Models hosted on HuggingFace | Release Candidate | ONNX | GGUF, GGML, SafeTensor |
openai | OpenAI (or compatible) LLM endpoint | Release Candidate | - | OpenAI-compatible HTTP endpoint |
azure | Azure OpenAI | Alpha | - | OpenAI-compatible HTTP endpoint |