What it does
After an agent writes the implementation for a paper, the researchers will activate No Cap within their Slack. Give it the paper (arXiv ID or a PDF) and the implementation of an agent (Python file or PR diff). It will return a verdict: pass or an anomaly, including confidence and per-equation evidence.
- Catches what humans take hours to find: missing math terms, faulty algorithm implementations, dropped normalization constants
- Four checks running in parallel: does the math symbolically match? Does it match when you plug in real numbers? Does the algorithm have the right number of steps? Are the hyperparameters the same as the paper's defaults?
- Accessible from inside the Slack with a single command, so the engineers will not need to adapt to a new dashboard.
# nocap-verifications
No Cap app · research implementation checks
🧢
Deniz10:42 AM
/nocap verify-impl 1412.6980 https://github.com/LA-Hacks-Deniz/nocap/blob/main/benchmark/implementations/adam_clean.py
🧢
No Capapp
🔍 Verifying paper 1412.6980... (<30s)
🟢 No Cap — Implementation matches paper
- Confidence
- 0.95
- Paper
- arxiv:1412.6980 §Algorithm 1
- Function
step- Trace
16405808-09be-4a3f-9e78-327029d17556
CognitionMLH × GemmaMLH × MongoDB AtlasMLH × GoDaddyFigma Flicker to FlowCloudflare