Model catalog
All models are accessible at https://api.ecohash.com/v1/... using your API key. Pass the model ID in the model field of your request.
note
The model list evolves. Run GET https://api.ecohash.com/platform/models for a JSON snapshot of what's currently deployed with live pricing.
Text LLMs
meta-llama/Llama-3.1-8B-Instruct
General-purpose chat model, fast and cheap.
- Context: 128K tokens
- Endpoint:
POST /v1/chat/completions - Regions: multi-region
- Autoscaling: yes
- Best for: everyday chat, summarization, extraction, simple reasoning
google/gemma-4-31b-it
Larger, higher-quality chat model. Slower and more expensive than Llama 3.1 8B.
- Context: 128K tokens
- Endpoint:
POST /v1/chat/completions - Regions: multi-region (large-VRAM GPU required)
- Autoscaling: yes
- Best for: reasoning-heavy tasks where you need more headroom than Llama 3.1 8B
Vision LLMs
Qwen/Qwen2.5-VL-7B-Instruct
Vision + language. Pass images as base64 or URLs in the chat messages.
- Context: 32K tokens
- Endpoint:
POST /v1/chat/completions - Regions: mv (bf16), tn (fp8), hk (fp8)
- Autoscaling: yes
- Best for: image captioning, OCR-like tasks, visual QA