Lang-comp

Description

Lang-comp is a benchmark suite for comparing the performance of AI coding agent effectivenes working in different programming languages on a set of common programming tasks. The benchmark includes implementations of various algorithms and data structures, as well as a set of test cases to evaluate the performance of each implementation.

Main goal of the bench is to see how language affordances (e.g. static vs. dynamic typing) and availability in the training data affect token cost .

Source code attribution

All source code used in this benchmark was adapted from the exercism project.

Main harness

These benchmarks use the Github copilot harness if not explicitly stated differently. Fresh context (system prompt, tool definitions etc): 15.200 tokens

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
.rtk		.rtk
.vscode		.vscode
assets		assets
experiments		experiments
scripts		scripts
.gitignore		.gitignore
AGENTS.md		AGENTS.md
LICENSE		LICENSE
README.md		README.md
meta_prompt.md		meta_prompt.md
results-dashboard.html		results-dashboard.html
results-fsharp-multimodel.html		results-fsharp-multimodel.html
results.csv		results.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lang-comp

Description

Source code attribution

Main harness

Benchmark results

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Lang-comp

Description

Source code attribution

Main harness

Benchmark results

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages