Devstral 2 Collection A couple of agentic LLMs for software engineering tasks, excelling at using tools to explore codebases, edit multiple files, and power SWE Agents. • 3 items • Updated 22 days ago • 37
ScandEval: A Benchmark for Scandinavian Natural Language Processing Paper • 2304.00906 • Published Apr 3, 2023 • 4
Danish Benchmarks Collection Benchmarks for evaluating Danish Models. • 2 items • Updated Jun 9, 2024 • 4
State-of-the-art Danish Models Collection These models constitute state-of-the-art models for Danish within their respective domain (highlighted below the model). • 18 items • Updated Nov 4 • 16