ANSHUMAAN DASH

Deep Learning | Computer Vision | Problem Solving | Leadership

ABOUT


Lead AI Product Engineer with 7+ years of industry experience, across big-techs and startups, in building scalable, high performant, and power-efficient AI systems. I specialize in designing and optimizing deep neural networks for edge/compute-constrained devices.

I have done Post-Graduation in AI and Leadership from Plaksha TLF(in collaboration with UC Berkeley), Founding batch of 2020.

My journey has been about understanding complex CNN-based architectures and building a knowledge base to translate business problems into simple technical tasks that can be solved using AI.

Since I remember and think through images, I believe working in deep learning-based computer vision will help me figure out how to make machines realize human intelligence and perceive the world the way we do.

Relevant Projects


Exam Proctoring Application

Application checks if examinee is looking at screen or not with Accuracy = 92%. Used RetinaNet with ResNet50 backbone on WiderFace Dataset. mAP (IoU=0.5): 94.62% (Easy); 81.94% (hard)

Font Similarity

A RESTful API using Flask that accepts a font family and a number between -1.0 to 1.0 that tells whether to fetch similar, contrasting or pleasingly contrast fonts. Based on a Resnet-50 feature extractor and a custom similarity scrore calculator.

Pose based Gait Extraction

Recognize a person from their Gait, based on the temporal flow of their pose keypoints. Recognition Accuracy - 86%

Observer's Attention

Attention Heatmap generation for visual designs, based on the observers' attention.

Object detection on Kitti dataset

Pytorch based Object detection on Kitti Dataset. Used Faster-RCNN with Resnet-50 backbone having feature pyramid network. [ mAP (0.50) = 0.944 ]

Two Factor Authentication

Pytorch based Facial Expression recognition system using the CK+ dataset on Mobilenet v2. Six classes {Anger, Disgust, Fear, Happiness, Sad, Surprise, Neutral} [Accuracy = 94.64]

Yolo-v3 with OHEM

Keras based Yolo-v3, trained on Pascal VOC-2012 dataset. Improved performance and accuracy optimization using Online Hard Example Mining (OHEM). [mAP = 0.75]

Customized Neural Style Transfer

Pytorch based Neural style transfer that utilizes the ImageNet pretrained VGG-16 model to stylize an image by extracting content from one image and style from the other.

Speech to text for numbers

LSTM based network to classify input audio as a numerical value. It mainly utilized the MFCC vector as an input to a multi-layered LSTM network.

N-tier CBIR

N-tier Content Based Image Retrieval System. F1 Score - 0.87

Activities


Startup Venture

Grammarly for visual design - Design Validation using AI

Conduct Hackathon

Deep learning hackathon on image classification and object detection

Organize AI Meetups

Active speaker at Cellstrat AI Labs, Bangalore

Webinars

Webinars on Computer Vision based Deep Learning

Design Thinking

Case Study on Healthy Employee Initiatives in corporates

Ethics in AI

Case Study on Ethics in Surveillance Based Social Credit Score

Contact