NoCap.wiki

AI agent polygraph for research-grade code.

What it does

After an agent writes the implementation for a paper, the researchers will activate No Cap within their Slack. Give it the paper (arXiv ID or a PDF) and the implementation of an agent (Python file or PR diff). It will return a verdict: pass or an anomaly, including confidence and per-equation evidence.

  • Catches what humans take hours to find: missing math terms, faulty algorithm implementations, dropped normalization constants
  • Four checks running in parallel: does the math symbolically match? Does it match when you plug in real numbers? Does the algorithm have the right number of steps? Are the hyperparameters the same as the paper's defaults?
  • Accessible from inside the Slack with a single command, so the engineers will not need to adapt to a new dashboard.

# nocap-verifications

No Cap app · research implementation checks

Slack
🧢

Deniz10:42 AM

/nocap verify-impl 1412.6980 https://github.com/LA-Hacks-Deniz/nocap/blob/main/benchmark/implementations/adam_clean.py

🧢

No Capapp

🔍 Verifying paper 1412.6980... (<30s)

🟢 No Cap — Implementation matches paper

Confidence
0.95
Paper
arxiv:1412.6980 §Algorithm 1
Function
step
Trace
16405808-09be-4a3f-9e78-327029d17556
CognitionMLH × GemmaMLH × MongoDB AtlasMLH × GoDaddyFigma Flicker to FlowCloudflare