DeepSeek API Integration

Available Models

DeepSeek-V3 (deepseek-chat)

Generalist model with 671B parameters trained on 15 trillion tokens. Uses Mixture-of-Experts architecture with 128K context window.

• API Name: deepseek-chat
• 671B parameters, 37B activated per token
• $0.27 per million input tokens, $1.10 per million output tokens
• 128K context window

DeepSeek-R1 (deepseek-reasoner)

Advanced reasoning model designed for complex math, coding, and logical tasks. Competes directly with OpenAI's o1 model.

• API Name: deepseek-reasoner
• 671B parameters with reasoning capabilities
• Advanced reasoning for complex tasks
• 128K context window

DeepSeek-R1-0528

Updated R1 model with system prompts, JSON output, and function calling support for agentic AI use cases.

• API Name: deepseek-r1-0528
• System prompt support
• JSON output and function calling
• Released May 2025

DeepSeek-V3-0324

Improved version of V3 with enhanced performance. Updated weights for better quality and capabilities.

• API Name: deepseek-v3-0324
• Improved V3 weights
• Enhanced performance metrics
• Updated March 2025

⚠️ Important Notice

Information about models, pricing, and features may be outdated or incorrect. Always consult the official provider documentation for the most current and accurate data.

Key Features

Cutting-Edge Architecture

DeepSeek-V3 uses Mixture-of-Experts (MoE) with Multi-head Latent Attention (MLA) for efficient training and inference.

• 671B total parameters, 37B activated
• MLA and DeepSeekMoE architectures
• Only 2.788M H800 GPU hours for training
• Exceptional performance metrics

Performance Benchmarks

Leading performance across various benchmarks including MMLU (87.1%), BBH (87.5%), and mathematical reasoning tasks.

• MMLU: 87.1% accuracy
• BBH: 87.5% performance
• Outperforms open-source models
• Comparable to leading closed-source models

API Compatibility

OpenAI-compatible API format makes integration straightforward for developers familiar with OpenAI tools.

• OpenAI SDK compatibility
• Easy migration from other providers
• Standard REST API endpoints
• Comprehensive documentation

Open Source & Cost-Effective

Available under open-source license with significantly lower pricing compared to competitors like GPT-4o.

• Open-source model weights
• Cheaper than average pricing
• $0.48 per 1M tokens (blended 3:1)
• No vendor lock-in

Technical Specifications

• Secure API key storage with iOS Keychain
• Native iOS integration
• Support for streaming responses
• Automatic token counting
• Rate limiting and monitoring
• Error handling and retry logic