Skip to main content
Neosantara AI offers a curated selection of state-of-the-art models, ranging from our own Indonesian-optimized fine-tunes to the best open-source models from Meta, Google, Alibaba, and more.
Billing & Pricing: For a detailed explanation of how credits work, pricing mechanics (Pay-As-You-Go), and throughput tiers, please refer to the Token Credits & Pricing documentation.

Text, Reasoning & Coding Models

Below is the list of available text generation, reasoning, and coding models.
Model IDDescriptionContextPricing (Input/Output)
nusantara-baseFlagship. General purpose, vision-capable, balanced.64kRp 300 / 1,500
garda-beta-miniFastest. High efficiency, massive context.131kRp 2,500 / 10,499
archipelago-70bDeep Culture. Llama-3.3 70B fine-tuned for cultural nuance.24kRp 4,710 / 36,543
sea-lion-v4-27b-itRegional. Specialized for Southeast Asian languages.128kRp 350 / 560
deepseek-chat-v3.1Hybrid. 671B MoE model.164kRp 3,333 / 8,333
kimi-k2MoE. 1T params (MoE) optimized for agentic tasks.131kRp 16,642 / 49,929
kimi-k2:latestMoE. The latest standard Kimi K2 model.128kRp 16,642 / 49,929
kimi-k2:searchSearch. Kimi K2 with online search capability.128kRp 16,642 / 49,929
kimi-k2:researchResearch. Optimized for deep research tasks.128kRp 16,642 / 49,929
kimi-k2:mathMath. Optimized for complex mathematical logic.128kRp 16,642 / 49,929
kimi-k2:silentSilent. Direct answers without search process logs.128kRp 16,642 / 49,929
kimi-k1Legacy. The first generation Kimi model.128kRp 16,642 / 49,929
llama-3.3-nemotron-super-49b-v1.5Super. 49B model with exceptional reasoning.132kRp 4,560 / 31,991
glm-4.6Z.ai. SOTA coding and agent benchmarks.128kRp 823 / 2,823
glm-4.6-plusZ.ai. More powerful coding context.128kRp 823 / 2,832
grok-4.1-fast-non-reasoningMassive Context. xAI’s best tool-calling model.2M0.30/0.30 / 0.72 (USD)
llama-3.3-70b-instructStandard. Robust general intelligence.24kRp 4,560 / 31,991
llama-3.3-70b-turboTurbo. Optimized version of Llama 70B.8kRp 300 / 1,200
gemma-3-27b-itNew. Google’s latest open model.32kRp 200 / 1,500
gemma2-9b-itEfficient. High quality for its size.8kRp 200 / 1,000
llama-3.2-11bLight. Efficient vision-capable model.8kRp 100 / 400
granite-3-8b-instructEnterprise. IBM’s enterprise-grade model.128kRp 2,880 / 11,520
gpt-oss-20bLow Latency. Medium-sized open-weight model.131kRp 400 / 1,600
gemini-3-flashSpeed. Google’s most intelligent model built for speed.1M0.50/0.50 / 3.00 (USD)
qwen3-coderCoding. 480B MoE specialist.262kRp 3,333 / 8,333
grok-code-fastCoding. xAI’s latest coding model.256k0.20 / \1.50 (USD)
claude-4.5-sonnetGateway. Access Anthropic via Vercel.200k3.00 / \15.00 (USD)

Image & Vision Models

Models specialized for image generation and optical character recognition (OCR).
Model IDDescriptionPricing
neosantara-gen-2045Image. High-speed, high-quality generation.1024px: Rp 1,500
512px: Rp 800
256px: Rp 400
deepseek-ocrVision. High-accuracy text extraction.Rp 100 / image

Embeddings

Models for generating text embeddings, useful for search, clustering, and RAG applications.
Model IDDescriptionPricing
nusa-embedding-0001Embedding. Text embedding for search and RAG.Rp 100 / 1M
gemini-embedding-001Embedding. Gemini embedding model.Free

Pricing Note: ā€œRpā€ indicates Indonesian Rupiah. ā€œUSDā€ indicates US Dollars. Prices are per 1 Million tokens unless otherwise stated.

Ready to Build?

Start integrating these models into your application today.
Last modified on December 23, 2025