About
Highly accomplished ML Research Engineer with 4 years of experience in developing and deploying cutting-edge AI solutions across the full lifecycle, from ideation to production. Proven expertise in Computer Vision, Autonomous Driving, LLMs, and Generative AI, with a strong track record of pioneering scalable systems, optimizing complex models, and leading cross-functional teams to deliver high-impact technological advancements.
Work
→
Summary
Led the development and deployment of advanced AI/ML solutions, including 3D Digital Twin pipelines, neural rendering models, and object detection systems, driving innovation in autonomous systems and model optimization.
Highlights
Pioneered the industry's first globally scalable 3D Digital Twin pipeline, integrating multi-sensory data and leveraging Blender/Open3D in Python to enable automated material assignment.
Designed and developed an ML pipeline for the "Segment Anything Mesh Model" to zero-shot label 3D meshes, optimizing material assignment and establishing a scalable framework for automated city-scale 3D model labeling.
Led a cross-functional team of 7 engineers in the end-to-end development of an autonomous tricycle cart, integrating Stereo Cameras, SLAM, and ROS for outdoor navigation and obstacle avoidance.
Engineered advanced neural rendering models (NeRF, Gaussian splatting) to enhance reconstruction quality of complex outdoor environments, integrating ICP for precise Point Cloud alignment within the Digital Twin.
Advised over 10 interns and capstone students on cutting-edge research projects focused on LLMs, Generative AI, and Model Optimization, fostering next-generation talent.
→
Summary
Developed and optimized advanced ML models for 5G channel compression and Snapdragon devices, spearheading a low-latency ML driver framework and building ETL pipelines for large-scale data analysis.
Highlights
Developed CNN and Transformer-based ML models for 5G Channel Compression, leveraging PyTorch and Ray for distributed training, featured in Qualcomm's and Nokia's MWC Barcelona 2024 demo.
Optimized large ML models for Snapdragon phones, reducing memory computation from FP16 to UINT4 while maintaining accuracy through Post Training Quantization.
Spearheaded the development of a C-based ML driver framework, enabling model inference on Hexagon Tensor Processor with less than 10us latency overhead, showcased at MWC ‘23 and ‘24.
Constructed an ETL data pipeline using Dask on AWS, facilitating Exploratory Data Analysis on terabytes of 5G modem data from real-world deployments.
→
Summary
Conducted research on gait analysis and Parkinson's Disease detection using IMUs, developing a novel parameter and a C++ library to enhance diagnostic accuracy.
Highlights
Devised "FluMo," a novel parameter using Modulated Spectrograms to quantify motion fluidity, improving SOTA ML algorithm accuracy by 0.2% on the PAMAP2 dataset.
Developed "LibIMU," a C++ library for synchronous data acquisition from multiple IMU devices at up to 100 Hz.
→
Summary
Developed a computer vision pipeline for 3D rebar detection from drone imagery, significantly improving detection precision and efficiency through advanced ML models and GUI development.
Highlights
Developed a perception pipeline utilizing decision-level fusion, integrating classical CV and Deep Learning models (Faster RCNN, UNet, PSPNet) with PyTorch and OpenCV for rebar extraction from aerial images.
Utilized Point Cloud Registration to fuse Photogrammetry and LIDAR 3D data, achieving 0.92 precision and 0.98 recall with a 3 cm offset for 3D rebar detection.
Programmed a Qt-based GUI labeling framework in Python, reducing labeling time and effort by over 50%.
Education
Awards
Second Prize Winner, ThinkOnward Generative AI Challenge
Awarded By
ThinkOnward
Awarded second prize in an international Generative AI competition, recognizing innovative solutions and securing a $25,000 prize.
Publications
Published by
Qualcomm
Summary
Authored 8 patents (provisional approved) and 2 pending patents with Qualcomm in the fields of Machine Learning, Computer Vision, Robotics and Wireless Communication, demonstrating significant contributions to intellectual property and innovation.
Languages
English
Skills
Languages
Python, C++, C, MATLAB, SQL, Flutter.
Packages and Tools
Pytorch, Tensorflow, Ray, mmDetection, MeshLab, Open3D, OpenCV, Scikit-Learn, PySpark, Dask, Pandas, AWS, Robot Operating System.
Expertise
Computer Vision, Autonomous Driving, LLMs, Generative AI, ML model optimization.