About Me

I work on enabling robots to navigate and act in unstructured environments using foundation models.

Currently, I’m a student at UIUC UIUC pursuing a MS in Computer Science with a focus on Robotics. I’m advised by Girish Chowdhary at Distributed Autonomous Systems Lab. My research is focused on Outdoor Navigation with Vision Language Action models and embodiment aware grounding for Vision Large Language Models.

Previously, I worked at Earthsense, on hardware design, autonomy, and optimization for a 750kg payload AGV. My contributions included developing the lower-level control systems and implementing 4-wheel independent torque vectoring for the TerraMax Robot’s dual Ackermann steering.

I completed my B.Tech in Mechanical Engineering with a Minor in Computer Engineering at College of Engineering, Pune COEP.

I enjoy running and cycling, currently preparing for a 5k in 25 minutes.

robot_video_terramax

Publications

CATNAV: Cached Vision-Language Traversability for Efficient Zero-Shot Robot Navigation

Paper: Link

Zero-shot embodiment-aware traversability framework using multimodal LLMs for costmap generation; visuosemantic caching reduces online VLM queries by 85.7%, achieving 10% higher goal-reaching rate and 33% fewer behavioral constraint violations versus state-of-the-art VLA baselines on a quadruped robot.

Visual-Language-Guided Task Planning for Horticultural Robots

Paper: Link

Modular VLM-guided framework for precision agriculture interleaving natural language queries with action primitives for autonomous crop monitoring, benchmarked long-horizon planning using MLLMs, finding human-comparable performance on short-horizon tasks but degradation in long-horizon scenarios with noisy semantic maps.

more coming soon …

Projects

Low-Rank Adaptation for Video Generation with semantic relative pose prompts (CS 598 3D Vision, HACKER Project)

Code: Link

Fine-tuned Wan 2.2 (1.3B) with LoRA to generate robot-POV navigation videos from text and motion plans for scalable outdoor data synthesis. Evaluated motion fidelity and failure modes, proposing hierarchical motion-primitive curriculum training to improve alignment.

GhibliDream – Studio-Ghibli inspired Stylization of Stable Diffusion (CS 444)

Report: Link

Fine-tuned StableDiffusion-2.0 with DreamBooth on curated Ghibli images using LLM-assisted auto-captioning, achieving 0.90+ CLIP-I cosine similarity on foreground characters while retaining background quality.

generated sample images

Salto Simulator for development

Code: Link

Gazebo plugin simulating Salto-1P jumping motion as a point object, supporting parabolic trajectory jumping and simulated odometry/pose estimation to accelerate autonomy development.

salto-animation-1

VLM assisted Octomap generation for navigation

Implemented open-vocabulary segmentation with CLIPSeg and stereo depth-based point cloud generation via Open3D to build octomaps for 3D robot navigation, tested on the NVIDIA r2b_2023 dataset.

Design and Analysis of Tendon actuated Robotic arm using Bowden cables and Mechanical Multiplexing (Thesis @ COEP)

Report: Link

Designed a modular 4-DOF Bowden cable-driven tendon-actuated robot arm with a mechanical multiplexer enabling full control with only 2 stepper motors, prototyped with FDM printing and including FK/IK solvers. Awarded Best Working Project (Mechanical) at COEP; provisional patent filed (IN202321006687).

multiplexer-arm-2
multiplexer-arm-1

Semantic-aware segmentation and navigation using CLIPSeg (CS440)

Slides: Link Report: Link

Combined CLIPSeg with Depth-Anything V2 for obstacle-aware gridmap generation (94% accuracy); modified A* planner with goal-object validation achieves 81% success rate for line-of-sight pathfinding.

smartnav image output

ROS1 ROS2 bridge for faster debugging and code refactoring

  • Created a Docker image generation repo to accelerate image creation and facilitate easier refactoring. Link

Competitive Robotics

fll logo wro logo

I am a FIRST and WRO alumni and I have participated in Robotics competitions since 2013. I have represented India Internationally in Robotics competitions thrice and also mentored a team which won Runner up Best Project Research in FLL Europe Opens in Estonia in 2018. I continue to mentor teams at Robominds for FLL, FTC and Vex Robotics.

Achievements


Made using minimal-mistakes