Radha Saraf

I earned a master's in Robotics at Worcester Polytechnic Institute (WPI) in December 2023. While at it, I worked at Skydio and Humane as a computer vision intern.

Previously, I was a part of Skylark Drones, solving computer vision problems associated with drone surveys. I was also the primary backend developer for Drone Mission Ops, the proprietary software for managing commercial drone operations.

I obtained my bachelors in Electrical Engineering from VNIT(India) in 2018, where I actively engaged in projects as part of the robotics club, Ivlabs.

I am interested in visual perception, deep learning & system software and am currently open to new opportunities.

Email / Resume / LinkedIn / Github

Work Experience

	SLAM Engineer Collaborate with the hardware team to address questions that influence SLAM, such as sensor specifications & placements for the VR headset. Develop algorithms for low-latency pose estimation, sensor fusion, calibration, optimization. Work with the embedded team to make the early-stage prototypes of the VR headset functional.
	Computer Vision Engineering Intern Integrated hand gesture recognition feature using MediaPipe model into an android application. Interfaced this feature with the Speech service of the application to speak when the gesture is recognized. Animated a rigged 3D hand model using joint transformations to optimize the rendering of laser effects onto the palm region, resulting in improved visual realism & interactivity. Building a deep learning pipeline for temporal gesture recognition.
	Autonomy Engineer Intern: Computer Vision Worked on enhancing localization accuracy of Skydio’s Visual Positioning System (VPS), enabling more precise drone positioning. Used a sequence of images with relative pose constraints to refine the optimization problem for reducing VIO drift. Utilized SymForce, a symbolic computation library for code generation and nonlinear optimization. Analyzed the performance on flight logs with sensible, intuitive metrics in the absence of ground truth data.
	Perception for drone surveys: Developed an object detection algorithm for Ground Control Points (GCPs) in aerial drone images, combining traditional computer vision tools with a CNN inspired by the LeNet model which resulted in 94.6% accuracy(F-score) Achieved 86% accuracy in estimating crop count for a farm using machine learning techniques(SVM, CNNs) Drone mission planning & operations: Primary back-end developer & maintainer for the proprietary drone operations management application, Drone Mission Ops. Created multiple RESTful API endpoints with the best software development practises & secured them with unit tests. Optimized several API routes to achieve 40-80% reduction in latency using MongoDB. Integrated Celery for background processing of time-consuming tasks like drone mission creation, elevation profile generation for areas of interest, etc. Added support for registration & authentication workflows, conditional requests, push notifications, and pagination among others.

Research & Projects

	Standardization of the Shape of Ground Control Point (GCP) and the Methodology for Its Detection in Images for UAV-Based Mapping Applications Aman Jain, Milind Mahajan, Radha Saraf CVC(SAI), 2019 Springer To establish an accurate earth to orthomap correspondence for aerial images from UAVs, we proposed the design of a simple marker along with the pipeline for its detection
	3D Reconstruction of a scene using SfM and NeRF project page Reconstructed a 3D scene from a set of images with different viewpoints using CV & DL methods, sfM and NeRF resp.
	Zhang's camera calibration project page Implemented camera calibration method as presented by Zhengyou Zhang in his 1998 paper from Microsoft Used eigen decomposition for solving homogenous system of linear equations & MLE for optimization of calibration parameters The mean re-projection error post optimization was close to 0.5 pixels
	Swap faces in a video using CV and deep learning approaches project page Used Delaunay triangulation and Thin plate splines based warping to swap two faces in a video. Blended the faces using Poisson blending
	Panorama stitching for two or more images project page Estimated homography between image pairs using feature correspondences. Used Adaptive non-maximal suppression for uniform features and RANSAC for removing the outliers among the feature matches. Also, trained HomographyNet, a CNN based supervised learning architecture for homography estimation using synthetically generated data.
	Pose estimation of a mobile robot project video Carried out calibration for camera intrinsics of an autonomous mobile robot. Estimated pose of the robot using differential-RGB color-space, image processing tools, and the perspective-n-point algorithm from the OpenCV library. Accuracy was within +/-5 cm.
	Probability based edge detection project page Developed a simplified version of pb, which finds boundaries by examining texture and color discontinuities in addition to intensity discontinuities across multiple scales. Outperforms Canny and Sobel by suppressing the false positives produced by classical methods in textured regions.
	Graph search algorithms: BFS, DFS, Dijkstra's, A* project page Implemented planning algorithms- BFS, DFS, Dijkstra’s, & A* on a grid world setup of configurable obstacle density.
	Deep q-learning to play breakout project page Implemented a Deep QLearning Network(DQN) to play Breakout for an averaging reward over 40 points in 100 episodes.
	Comparison of reinforcement learning techniques on Super Mario Bros. poster Implemented Dueling DQN(DDQN), Asynchronous Advantage Actor Critic(A3C), Proximal Policy Optimization(PPO) on Super Mario Bros. environment
	PID Control of robot manipulator in ROS Used ROS’ client-service, publisher-subscriber frameworks for PID control of robot manipulator end-effector pose.