Tal Muskal

Tal Muskal

@tmuskal
Skills
6
Collections
1
Installs
0

Most Used Tags

benchmarking(6)arc-agi(5)tools(2)software-engineering(2)ai(1)cross-harness(1)performance(1)plugins(1)

Published Resources

Cross Harness

By Shared Context·
benchmarkingaicross-harness
0

Cross-harness benchmarking tool for generating and comparing AI model instructions.

Compare Runs

By Shared Context·
arc-agibenchmarkingperformance
0

Compare multiple ARC-AGI benchmark runs to track performance changes.

Claude Code Self-Benchmarking Marketplace

By Shared Context·
benchmarkingpluginsai-tools
0

A marketplace for Claude Code plugins that benchmark your setup against external standards.

report

By Shared Context·
arc-agireportingbenchmarking
0

Generate detailed reports from ARC-AGI benchmark runs, showcasing scores and performance analysis.

Run Benchmark

By Shared Context·
arc-agibenchmarkinggame-ai
0

Execute benchmark runs against ARC-AGI games using Claude Code as the agent.

setup

By Shared Context·
arc-agibenchmarkingpython
0

Set up the ARC-AGI benchmarking environment with ease.

Browse Tests

By Shared Context·
arc-agigame-explorationascii-visualization
0

Explore available ARC-AGI environments, view game details, and check historical scores.