Supported Language Models
Javelin is committed to integrating and supporting the most popular models in the industry. From the groundbreaking innovations of OpenAI and Azure OpenAI to the dynamic platforms like HuggingFace and Anthropic, Javelin ensures seamless interfacing with these models.
Our platform's adaptability allows users to leverage the unique strengths of each model, ensuring optimal results for diverse applications. Javelin Gateway harmoniously brings together the world of popular LLMs, simplifying and amplifying their capabilities for our users.
Supported Model Providers
We are always adding support for new models, supported models include those from:
Models | Base URL | Endpoints (text & embeddings) |
---|---|---|
Azure OpenAI | https://<your-resource-name>.openai.azure.com | - /openai/deployments/{deployment-name}/completions - /openai/deployments/{deployment-name}/chat/completions - /openai/deployments/{deployment-name}/embeddings |
OpenAI | https://api.openai.com/v1 | - /completions - /chat/completions - /embeddings |
Amazon Bedrock | https://bedrock.<region>.amazonaws.com | - /model/{modelId}/invoke - /model/{modelId}/invoke-with-response-stream - /model/{modelId}/converse - /model/{modelId}/converse-stream |
Google Gemini | https://generativelanguage.googleapis.com | /v1beta/openai/chat/completions |
HuggingFace | https://api-inference.huggingface.co | Create an endpoint following the instructions here https://huggingface.co/docs/inference-endpoints/guides/create_endpoint |
NVIDIA | NVIDIA AI Endpoints | |
Llama | ||
Anthropic | https://api.anthropic.com/v1 | |
Mistral | https://api.mistral.ai/v1 | |
Inflection | ||
Perplexity | ||
Cohere | ||
AnyScale | ||
TogetherAI | ||
and more... |
Fallback Behavior for Unknown Models
When encountering unknown model types or custom provider combinations, Javelin implements the following fallback behavior:
-
OpenAI Schema Compatibility: By default, Javelin will attempt to use OpenAI model specifications as a fallback for handling unknown provider/model combinations. This means:
- If your custom provider matches OpenAI's chat endpoint schema, Javelin's guardrails will continue to function
- The system expects request/response formats similar to OpenAI's chat completion endpoints
-
Guardrail Behavior:
- For compatible schemas: All configured guardrails and processors will function normally
- For incompatible schemas: Requests will be proxied directly to the provider without guardrail processing
-
Response Handling:
- Compatible formats: Full processing and guardrail application
- Incompatible formats: Direct proxy of provider responses without modification
This fallback mechanism ensures basic functionality while we work on expanding native support for more provider-specific schemas. For optimal guardrail functionality, we recommend using supported model providers or ensuring your custom implementation follows OpenAI-compatible schemas.
Note: Future updates will expand native support for diverse provider schemas and custom model specifications. We are actively tracking and working on enhanced model specification handling.