SWECCATHON 2026

Benchmark literally anything

SWECCATHON 2026 is SWECC’s build weekend for AI environments. Over the weekend you’ll design an environment — a game, a simulation, a slice of a real job — and put AI agents to the test inside it on mesocosm, SWECC’s open benchmarking platform.

If you can describe a task, you can benchmark it. You build the world; mesocosm runs the agents, scores them, and hands you the traces. Your goal: make us go “I didn’t know a model could do that.”

Who it’s for

Open to all UW undergrads, CS and non-CS alike. Team up (1–4 people) or come solo and find teammates at the event. No ML background required.

Schedule
  • Fri, May 29 — Kickoff (in person, ~2 hrs): intro to mesocosm, quick-start, two live demos, example benchmarks, and the three tracks. Hacking starts right after.
  • Sat, May 30 – Mon, Jun 1 — Build: tech support on Discord all weekend.
  • Mon, Jun 1 · 4:00 PM — Submissions close.
  • Mon, Jun 1 · 5:00 PM — Judging & closing (in person): top 5 teams present (3–5 min); judges name one winner per track + an overall winner.
Judging

One north star: the wow factor. Then:

Criterion

Weight

Track execution & wow

~40%

Usefulness & gap

30%

Data & reusability

20%

Presentation clarity

10%

 

Credits, prizes & swag
  • Cursor API credits at the event (~1/team; skip one if you’re already on a paid plan).
  • Platform runs covered by mesocosm — use cheaper/local models (DeepSeek, Ollama) first.
  • Sponsored by Google; swag + food & drinks provided.
  • Track prizes announced soon.

Requirements

What to build — A benchmark environment on mesocosm. Pick a track and build a task an agent can attempt; implement mesocosm’s four-endpoint contract (/health, /reset, /step, /close) via three files: env.py, adapter.py, benchanything.json. The CLI scaffolds it in minutes.

Submit on Devpost

  1. A public GitHub repo with your benchmark.
  2. A UI / visualization of your agent traces (or the thing you simulate).
  3. A 1–2 min demo video — voiceover walkthrough, no face needed; unlisted YouTube/Vimeo link.

Teams: 1–4, all UW undergrads. Register your team on Devpost before submitting.

Deadline: Mon Jun 1 · 4 PM (judging 5 PM, in person).

Hackathon Sponsors

Prizes

2 non-cash prizes
Google Merchandise
1 winner

Tumblers, Bottles, and Hats...

Surprise Prizes for top 3 and overall winner.
3 winners

Still ordering these... lol.

Devpost Achievements

Submitting to this hackathon could earn you:

Judges

Simon Kurgan

Simon Kurgan
Amazon

Niti Shah

Niti Shah
Google

Rogerio Panigassi

Rogerio Panigassi
Google

Raman Singh

Raman Singh
Google

Judging Criteria

  • Track Execution & WOW
    How well did you solve the ideas laid out in the track description? Does your app have robust data? Does it have a robust UI or scoring system? Did you very clearly think about the design of the app?
  • Usefulness & Gap
    Does your app address a real need? Does it provide something nobody else can OR provide it >= better?
  • Data & Reusability
    How reproducible are your benchmark runs, How useful is the data you create?
  • Presentation Clarity
    How clear is your UI that showcases your idea, how good is your video if you submitted one?

Questions? Email the hackathon manager

Invite others to compete

Hackathon sponsors

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.