Nilesh Sarkar
/
Blog
Thoughts and project notes on AI, LLMs, and Engineering.
Back to Home
Events
Highlights
SAE Experiments - Pythia-410M & toy-model validation
Active • Erdős AI Lab • COLM 2026
Sparse Autoencoder experiments tied to a KD paper under review at COLM 2026. Toy-model validation of the KD minimum-width theorem plus three real-LM trials sweeping L1 sparsity on Pythia-410M (24 layers × 6 checkpoints each on Hugging Face).
Agentic RAG chatbot with LangGraph - project notes
Jul 2024 • Project Notes
Building an Agentic RAG workflow with LangGraph and Streamlit. The system prioritizes PDF Q&A and falls back to web search when necessary.
Image generation with Stable Diffusion - quick notes
Apr 2024 • Experiments
Running experiments with Stable Diffusion v1.5 via diffusers. Exploring negative prompts, guidance scales, and prompt engineering basics.
Erdős AI Lab - AI Researcher
Feb 2026 - Present • Lab
Knowledge distillation, mechanistic interpretability, and world models. KD paper under review at COLM 2026.
Interpretability Experiments
Ongoing • Research
Circuit-level studies of transformer internals: attention patterns, probing, and activation patching to understand what models actually compute.
Medical AI: PCOS Detection (published journal)
Published • 200 epochs · 18 architectures
Systematic benchmark of 13 CNNs and 5 ViTs for PCOS ultrasound detection. Hybrid CNN-Transformer models (EfficientFormer-L1, MobileViT-Small) hit 99.81% accuracy with AUC up to 1.0. Includes a novel three-stage MD5 + pHash + cross-class deduplication pipeline that cleaned 70.4% of the public PCOS-XAI dataset.
Protein Folding Experiments
Ongoing • Research
Structure-prediction studies on small proteins - pLDDT confidence, folding trajectories, and transformer folding stacks vs. classical MD baselines.
Language Modeling Experimentation
Jan-Feb 2026 • Ongoing Project
Hands-on experimentation with LLM fine-tuning, compression, and deployment across diverse hardware. Testing Qwen models (0.5B → 7B) on Indic languages and deploying Gemma 3 1B on Jetson Nano edge devices.
Events & Talks
Founding Erdős AI Lab
Mar 2026 • Venture
Met Dawar and Dhruv at the India AI Impact Summit - and together we started Erdős AI Lab, a collective devoted to advancing frontier AI through knowledge distillation, world models, and rigorous research.
India AI Impact Summit 2026
Feb 2026 • Conference
Representing DSU at India's premier AI conference in New Delhi, showcasing university innovation and presenting research projects including LLM Architecture, Medical AI, and Autonomous Drones.
Bangalore Drone Community Meetup
Oct 2025 • Event
Insights from the Bangalore Drone Community Meetup: Startups, regulations, and the future of UAVs.
Build with AI: Google’s Agent Development Kit & MCP
May 2025 • Workshop
My experience at the Build with AI workshop organized by Google and Deep Tech Stars. Exploring ADK and Model Context Protocol.
NASA Space Apps: Landsat SR on the fly
Oct 2024 • Hackathon
A solution for comparing ground observations with Landsat Surface Reflectance data and alerting users of satellite overpasses.
Notes & Experiments
Sutskever's List: Foundational AI Research
Reading List • Ilya Sutskever
A curated collection of 30 research papers that Ilya Sutskever (OpenAI) claims contain "90% of what matters today" in AI.
Hacker's Guide to Neural Networks
Reproduction • Andrej Karpathy
A reproduction of Andrej Karpathy's legendary "Hacker's Guide to Neural Networks". A code-first approach to understanding backpropagation and real-valued circuits.
Research & Engineering
Moog Controls - AI Research Intern
Jun 2025 - Apr 2026 • Industry
Building agentic systems (LangChain, LangGraph, RAG) on top of internal engineering data at Moog India Technology Centre, Bengaluru.
LLM Curriculum
Resource • Learning Path
A practitioner-oriented learning path through large language models - what to read, what to build, what to skip.
Medical AI: Deep Learning for PCOS Detection
Dec 2025 • Research Project
Exploring deep learning for medical imaging, focusing on automated PCOS detection and using generative models for better data augmentation.
LectureToSlides: Video to PDF Converter
Tool • Computer Vision & AI
A browser-based tool I built that uses computer vision to find slide transitions in videos and Gemini AI to generate notes and quizzes.
Project Humanoid Robot
Sep 2025 • Research Project
Research project focused on developing a teleoperated robotic arm system for humanoid robotics applications, with an emphasis on intuitive human-robot interaction.
Autonomous Intelligence & Robotics (AIR): Autonomous Surveillance Drone
Present • Research Group
We are in the concept of building surveillance drones. More stuff coming soon.