Embedding Models
A comprehensive guide to selecting and using embedding models in your application
Overview of Embedding Models
Our application supports integration with several popular embedding model providers, offering you flexibility in choosing the model that best suits your needs. Here’s an overview of the available models:
OpenAI Models
text-embedding-3-small
OpenAI’s newer, more efficient embedding model. Good balance of performance and efficiency.
text-embedding-3-large
OpenAI’s large embedding model. Offers the best performance in the OpenAI lineup.
Credentials Setup
To use OpenAI models:
- Sign up for an account at OpenAI’s website.
- Follow their docs to get an API key.
- Use your OpenAI key in Onyx
Cohere Models
embed-english-v3.0
Cohere’s English embedding model. Good performance for English-language tasks.
embed-english-light-v3.0
Cohere’s lightweight English embedding model. Faster and more efficient for simpler tasks.
Credentials Setup
To use Cohere models:
- Create an account on Cohere’s platform.
- Follow their guide to gather an API key.
- Use your Cohere API key in Onyx.
Voyage Models
voyage-large-2-instruct
Voyage’s large embedding model. High performance with instruction fine-tuning.
voyage-light-2-instruct
Voyage’s lightweight embedding model. Good balance of performance and efficiency.
Credentials Setup
To use Voyage models:
- Sign up for an account at Voyage AI.
- Follow their guide to gather an API key.
- Use your Voyage AI API key in Onyx.
Vertex AI (Google) Model
gecko
Google’s Gecko embedding model. Powerful and efficient, but requires more setup.
Credentials Setup
To use the Gecko model from Vertex AI:
- Set up a Google Cloud Platform (GCP) account if you don’t have one.
- Create a new project in the Google Cloud Console.
- Enable the Vertex AI API for your project.
- Create a service account with the necessary permissions for Vertex AI.
- Generate a JSON key for the service account.
- Use the JSON key file in Onyx by uploading your file.
Choosing the Right Model
Consider these factors when selecting an embedding model:
- Task Complexity: More complex tasks may benefit from larger models like
text-embedding-3-large
orvoyage-large-2-instruct
. - Language Specificity: For English-specific tasks, consider Cohere’s models.
- Performance vs. Efficiency: Balance the trade-off between model performance and computational efficiency based on your needs.
- Integration Complexity: Consider the ease of setup, especially for models like Gecko that require more configuration.
Best Practices
- Experiment: Test different models with your typical data to compare performance.
- Monitor Performance: Keep track of model performance for your specific use cases.
- Stay Updated: Regularly check for updates or new model releases from providers.
- Security: Always use the secure interface provided by the application to input your API keys or upload credential files.
Remember to review each provider’s documentation for the most up-to-date information on model capabilities and integration details.