Nilesh Sarkar

Research Blogs

Thoughts and project notes on AI, LLMs, and engineering.

01

Highlights

ConferenceDeadline: every research deadline, in one place
Jun 2026 · Erdős AI Lab · Live

I made my internal deadline tracker public. Search 250M papers, track 12K venues and 2K conferences across 53 fields, and ask in plain language. Over 1,000 researchers on board, built and maintained solo.

SAE Experiments: Pythia-410M & toy-model validation
Active · Erdős AI Lab

SAE experiments tied to a KD paper: toy validation of the minimum-width theorem plus three Pythia-410M trials.

AI × Bio: Protein Folding on a single GPU
Jun 2026 · Erdős AI Lab

Four protein experiments on one A100: a 1 µs chignolin fold, titin I27 force-pulling, an 8-protein ESMFold sweep in 12.9 s, and ESMFold confidence calibration. Each validated against the published number.

CausalGrok: Grokking for Under-Sampled Datasets
Ongoing · Independent Research

An ongoing investigation into using grokking for better generalisation on under-sampled image data. A first check of off-recipe behaviour on Camelyon17, before the full domain experiments.

Erdős AI Lab: Founding Principal AI Researcher
Feb 2026 - Present · Lab

Knowledge distillation, mechanistic interpretability, and world models. KD paper under review at COLM 2026.

Interpretability Experiments
Ongoing · Research

Circuit-level studies of transformer internals: attention patterns, probing, and activation patching to understand what models actually compute.

Medical AI: PCOS Detection
200 epochs · 18 architectures

An 18-architecture benchmark on PCOS ultrasound. A hybrid CNN-ViT hit 99.81%. A three-stage dedup pipeline cleaned 70% of the public dataset.

Language Modeling Experimentation
Jan - Feb 2026 · Ongoing

Fine-tuning Qwen 0.5B-7B on Indic languages; deploying Gemma 3 1B on Jetson Nano. Compression and diffusion-LM threads now live.

Agentic RAG chatbot with LangGraph
Jul 2024 · Project Notes

An Agentic RAG workflow with LangGraph and Streamlit. The system prioritizes PDF Q&A and falls back to web search when necessary.

Image generation with Stable Diffusion
Apr 2024 · Experiments

Experiments with Stable Diffusion v1.5 via diffusers: negative prompts, guidance scales, and prompt-engineering basics.