Model Overview
Terramind provides access to industry-leading AI models from multiple providers. All models are accessed through the same unified interface.Claude Models
Anthropic’s Claude models, known for excellent coding and reasoning capabilities.claude-sonnet-4-5
Latest and most capable Claude model- Best for: Complex coding tasks, detailed analysis, long conversations
- Context: Up to 200K tokens
- Strengths: Code generation, debugging, technical writing, complex reasoning
- Speed: Balanced (medium)
claude-sonnet-4
Previous generation Sonnet- Best for: General coding and analysis tasks
- Context: Up to 200K tokens
- Strengths: Code understanding, documentation, technical explanations
- Speed: Balanced (medium)
claude-opus-4-1
Most powerful Claude model- Best for: Extremely complex tasks requiring deep reasoning
- Context: Up to 200K tokens
- Strengths: Research, complex problem solving, academic writing
- Speed: Slower but more thorough
- Cost: Higher
claude-haiku-4-5
Fastest Claude model- Best for: Quick responses, simple tasks, high-volume requests
- Context: Up to 200K tokens
- Strengths: Speed, efficiency, cost-effectiveness
- Speed: Very fast
- Cost: Lower
claude-3-5-haiku
Previous generation Haiku- Best for: Fast responses with good quality
- Context: Up to 200K tokens
- Speed: Fast
- Cost: Lower
GPT Models
OpenAI’s GPT models, known for versatility and strong general capabilities.gpt-5
Latest GPT model- Best for: General tasks, creative writing, reasoning
- Context: Up to 128K tokens
- Strengths: Versatility, creativity, general knowledge
- Speed: Fast
gpt-5-codex
Code-specialized GPT- Best for: Code generation and understanding
- Context: Up to 128K tokens
- Strengths: Code completion, bug fixing, test generation
- Speed: Fast
- Specialization: Optimized for programming tasks
Chinese Models
Leading Chinese AI models with excellent multilingual capabilities.glm-4.6
Zhipu AI’s GLM model- Best for: Chinese language tasks, bilingual content
- Strengths: Chinese understanding, translation, cultural context
- Languages: Chinese, English, and more
kimi-k2
Moonshot AI’s Kimi model- Best for: Long context understanding
- Context: Very large context window
- Strengths: Long document analysis, Chinese language
qwen3-coder
Alibaba’s Qwen coding model- Best for: Code generation in Chinese/English
- Strengths: Programming, Chinese code comments
- Specialization: Coding tasks
Specialized Models
grok-code
xAI’s Grok code model- Best for: Code analysis and generation
- Strengths: Programming tasks, technical analysis
- Specialization: Software development
big-pickle
Specialized model for specific tasks- Best for: Domain-specific applications
- Use case: Custom enterprise needs
Model Comparison
This table helps you choose the right model for your use case.
| Model | Best For | Speed | Cost | Context Window |
|---|---|---|---|---|
claude-sonnet-4-5 | Complex coding, analysis | Medium | Medium | 200K |
claude-opus-4-1 | Deep reasoning | Slow | High | 200K |
claude-haiku-4-5 | Quick tasks | Fast | Low | 200K |
gpt-5 | General tasks | Fast | Medium | 128K |
gpt-5-codex | Code generation | Fast | Medium | 128K |
glm-4.6 | Chinese language | Medium | Medium | Large |
kimi-k2 | Long documents | Medium | Medium | Very Large |
qwen3-coder | Chinese coding | Fast | Medium | Large |
grok-code | Code analysis | Medium | Medium | Large |
Choosing the Right Model
For Coding Tasks
Complex refactoring or architecture:- Use
claude-sonnet-4-5orclaude-opus-4-1
- Use
claude-haiku-4-5orgpt-5-codex
- Use
gpt-5-codexorclaude-sonnet-4-5
For Chinese Language
Chinese content:- Use
glm-4.6for general tasks - Use
qwen3-coderfor coding - Use
kimi-k2for long documents
For Cost Optimization
High volume, simple tasks:- Use
claude-haiku-4-5
- Use
claude-sonnet-4-5orgpt-5
- Use
claude-opus-4-1
Model-Specific Features
Long Context
Models with large context windows are better for:- Analyzing entire codebases
- Processing long documents
- Maintaining conversation history
claude-sonnet-4-5(200K)claude-opus-4-1(200K)kimi-k2(very large)
Streaming
All models support streaming for real-time responses:Tool Calling
All models support tool/function calling:Pricing
Pricing varies by model. Contact Terramind for detailed pricing information. Generally:- Fast models (Haiku): Lower cost
- Balanced models (Sonnet, GPT-5): Medium cost
- Powerful models (Opus): Higher cost
