Hi, I'm Nikhil Soni

AI/ML Engineer | Data Scientist | Software Developer

I design intelligent systems and data-driven solutions combining machine learning, backend development, and clean code to solve real-world problems.

Nikhil Soni

1 Year

Internship Experience

About Me

Nikhil Soni

Who am I?

I'm a Computer Science graduate from NYU, passionate about building scalable AI/ML pipelines and robust backend systems. I've interned at Emerson, Junglee Games, and HPE, and worked on impactful projects like CrisisCast and dementia prediction models.

My approach combines technical expertise with creative problem-solving to deliver high-quality solutions. I'm dedicated to writing clean, efficient code and creating intuitive user experiences.

Problem Solving
Responsive Design
Clean Code
Team Collaboration

My Projects

Here are some of my recent projects. Each one was carefully crafted to solve specific problems and deliver value.

CrisisCast Dashboard

CrisisCast: Real-Time Crisis Detection

Developed an end-to-end real-time pipeline that ingests social signals from Reddit and Google, classifies posts using a fine-tuned LLM, and visualizes crises on a dynamic dashboard. Leveraged Kafka, PySpark, FastAPI, MongoDB, and Qdrant for scalable processing and semantic search.

Kafka PySpark FastAPI MongoDB Qdrant LLM Docker
GenVision Image Generator

GenVision: Personalized Image Generator

Built a full-stack app fine-tuning Stable Diffusion XL with DreamBooth and LoRA for generating user-personalized AI images. Integrated a Flask backend and Gradio frontend for real-time image generation with a feedback slider.

Stable Diffusion XL DreamBooth LoRA Flask Gradio
Rent Raja Dashboard

Rent Raja: NYC Rental Price Prediction

Built an ML-powered tool to estimate NYC rental prices using 300K+ property listings scraped via APIs. Developed a hybrid classification-regression model with 81% accuracy on price bins and RMSE under $300. Deployed via Flask and Dash with LLM-powered report generation for broker insights.

Scikit-learn Flask Dash LLM APIs

My Skills

Here's a snapshot of my technical toolkit—ranging from machine learning and big data to full-stack development and deployment.

AI/ML

PyTorch / TensorFlow 90%
scikit-learn 95%
LLM (LoRA, RAG) 85%
MLOps 75%

Backend / Big Data

PySpark / Kafka 90%
FastAPI / Flask 85%
Docker / REST APIs 80%
Hugging Face 75%

Data Storage

MongoDB / PostgreSQL 85%
Qdrant / Redis 75%
Snowflake 70%
AWS S3 / Athena 70%

Tools & DevOps

Git / GitHub 90%
Docker / Kubernetes 75%
Streamlit / Dash 80%
Linux / VS Code 85%

My Journey

My professional path and educational background that shaped my expertise.

Experience

Graduate Teaching Assistant - Deep Learning

Jan – May 2025

New York University

Mentored 400+ students through office hours, clarifying concepts in diffusion models, RL, and advanced deep learning

AI/ML Intern

Jun - Aug 2024

Emerson

Built an LLM-based validation tool using T5 and One-Class SVMs to automate DeltaV code checks, saving manual QA hours.

Data Science Intern

Jan - Jul 2023

Junglee Games

Extracted and modeled user gameplay data, deployed ML pipelines with AWS Lambda, S3, API Gateway, and tracked metrics.

Software Developer Intern

Jun - Jul 2022

Hewlett Packard Enterprise

Built a document management platform using Django, HTML, CSS, JS; integrated AWS services for storage and authentication.

Education

MSc in Computer Science

2023 - 2025

New York University (Tandon School of Engineering)

Focus: AI/ML, Big Data, and Deep Learning. TA for Deep Learning. Projects include CrisisCast, LLM fine-tuning, and RAG pipelines.

B.Tech in Computer Science

2019 - 2023

Manipal University Jaipur

Graduated with Honors. Worked on NLP projects including hybrid language detection and dementia prediction (published).

Get In Touch

Have a project in mind or want to discuss potential opportunities? Feel free to reach out!

Contact Information

Email

nikhilsoni700@gmail.com

Phone

+1 (934) 233-1695

Location

Brooklyn, NY

Follow Me

Send Me a Message