Haskell tag

Benchmarking a Local LLM on Advent of Code 2025 (Ollama)

Posted on February 27, 2026

The previous benchmarks in this series (Haskell, OCaml, Python, ReScript, Ruby, Elixir, Java, Elm) all used cloud API models — big, frontier-class LLMs served by Anthropic, OpenAI, and others. But what about a local model? Can a 14-billion-parameter model running on a single machine solve the same puzzles?

This post answers that question using qwen2.5-coder:14b via Ollama, tested on AoC 2025 Day 1 in both Python and Haskell.

Benchmarking LLMs on Advent of Code 2025 (Haskell)

Posted on February 24, 2026

I benchmarked 11 LLMs on Advent of Code 2025 Days 1–5, each solving independently in Haskell. The goal: see which models can reliably produce correct, working solutions — and how fast. One additional model (claude-haiku-4-5) was tested retroactively and has been added to the results.

Developers, developers, developers!

Blog about programming, programming and, ah more programming!

[ Haskell ]

Benchmarking a Local LLM on Advent of Code 2025 (Ollama)

Benchmarking LLMs on Advent of Code 2025 (Haskell)