SN

SambaNova

online

RDU-powered inference. Llama 405B for free.

LLMest. 2017 · Palo Alto, CA
82
/ 100 APIVault Score

// At a glance

Free Tier
600 req/min · free · no card
Category
LLM
Credit Card
Not required
Last Verified
2m ago

// Free tier details

Available Models

Llama 3.3 70BLlama 3.1 405BDeepSeek-R1

Rate Limit

600 requests/minute on smaller models

No credit card needed
No phone verification

// Quick start

300">"text-purple-400">from openai 300">"text-purple-400">import OpenAI

client = OpenAI(
    api_key=300">"YOUR_SAMBANOVA_KEY",
    base_url=300">"https://api.sambanova.ai/v1",
)

response = client.chat.completions.create(
    model=300">"Meta-Llama-3.3-70B-Instruct",
    messages=[{300">"role": 300">"user", 300">"content": 300">"Hello."}],
)

print(response.choices[0].message.content)

// Overview

SambaNova's RDU (Reconfigurable Dataflow Unit) runs Llama 3.3 70B and 405B at exceptional speeds. Free Cloud tier with a simple signup.

// Pros

  • Llama 405B free access
  • High rate limits
  • OpenAI-compatible

// Cons

  • Smaller ecosystem
  • Fewer docs than big players

// Score breakdown

Reliability (35%) (from 2m ago health check)100/100
Free Tier Generosity (30%) (computed from quota, no-CC, no-phone fields)60/100
Documentation (20%) (human rating)86/100
Popularity (15%) (GitHub stars (log-normalised), or manual baseline)80/100

Methodology: apivault.dev/methodology

// Best for

Large model inferenceReasoning tasksResearch

// Recent changes

Apr 10, 2026Added DeepSeek-R1 to free tieradded