Flagship-Level Coding in a 27B Dense Model

22nd April 2026 – Link Blog

Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model (via) Big claims from Qwen about their latest open weight model:

Qwen3.6-27B delivers flagship-level agentic coding performance, surpassing the previous-generation open-source flagship Qwen3.5-397B-A17B (397B total / 17B active MoE) across all major coding benchmarks.

On Hugging Face Qwen3.5-397B-A17B is 807GB, this new Qwen3.6-27B is 55.6GB.

I tried it out with the 16.8GB Unsloth Qwen3.6-27B-GGUF:Q4_K_M quantized version and llama-server using this recipe by benob on Hacker News, after first installing llama-server using brew install llama.cpp:

llama-server         
    -hf unsloth/Qwen3.6-27B-GGUF:Q4_K_M 
    --no-mmproj 
    --fit on 
    -np 1 
    -c 65536 
    --cache-ram 4096 -ctxcp 2 
    --jinja 
    --temp 0.6 
    --top-p 0.95 
    --top-k 20 
    --min-p 0.0 
    --presence-penalty 0.0 
    --repeat-penalty 1.0 
    --reasoning on 
    --chat-template-kwargs '{"preserve_thinking": true}'

Here’s the transcript for “Generate an SVG of a pelican riding a bicycle”. This is an outstanding result for a 16.8GB local model:

Performance numbers reported by llama-server:

Reading: 20 tokens, 0.4s, 54.32 tokens/s
Generation: 4,444 tokens, 2min 53s, 25.57 tokens/s

What's Hot

These are the first Nvidia RTX Spark laptops

Escaping the Valley of Choice in BI

Strava declares war on scrapers ahead of IPO

An OpenAI model solved a famous math problem that stumped humans for 80 years

Cognition’s Scott Wu says AI coding agents shouldn’t replace humans

Take Google’s vibe coded I/O 2026 quiz

These are the first Nvidia RTX Spark laptops

Escaping the Valley of Choice in BI

Strava declares war on scrapers ahead of IPO

Quantization from the ground up

David Sacks is done as AI czar — here’s what he’s doing instead

Judge sides with Anthropic to temporarily block the Pentagon’s ban

Most Popular

These are the first Nvidia RTX Spark laptops

Escaping the Valley of Choice in BI

Strava declares war on scrapers ahead of IPO

Our Picks

Quantization from the ground up

David Sacks is done as AI czar — here’s what he’s doing instead

Judge sides with Anthropic to temporarily block the Pentagon’s ban

Subscribe to Updates

What's Hot

Flagship-Level Coding in a 27B Dense Model

Related Posts

Subscribe to Updates