AI / ML Engineer

Himanshu
Yadav

Building practical machine learning systems

UTC +05:30
Based in India
Open to ML/AI internships

I'm a pre-final year computer science student focused on machine learning systems and applied AI engineering.

Most of my work involves building systems around models - inference pipelines, monitoring, orchestration, observability, and reliability.

I enjoy understanding how systems behave under real-world constraints and exploring how machine learning works beyond isolated experimentation.

View Résumé ↗
Himanshu Yadav

Education

B.Tech in Computer Science & Engineering

Galgotias University

2023 – 2027

01

RealSignal

Real-time ML system for adaptive fraud monitoring and drift-aware inference

RealSignal Architecture
Problem

Fraud patterns evolve continuously, making static detection systems unreliable in production.

Approach

Streaming-first anomaly detection with monitoring and adaptive retraining workflows.

Execution

Kafka streaming → online feature engineering → Isolation Forest inference → PSI drift monitoring.

Outcome

Real-time anomaly detection with observability, drift awareness, and operational ML lifecycle management.

FastAPI · Kafka · Isolation Forest · MLflow · Docker · AWS

02

PathwayIQ

Reliability-aware clinical decision support intelligence system

PathwayIQ Architecture
Problem

Hospital readmissions increase operational costs and patient risk when high-risk cases are missed.

Approach

Predict calibrated 30-day readmission risk with explainable machine learning.

Execution

Clinical preprocessing → calibrated XGBoost inference → SHAP explanations → drift monitoring and retraining.

Outcome

Explainable risk predictions with monitoring, lifecycle tracking, and clinician-support workflows.

FastAPI · XGBoost · SHAP · Docker · GitHub Actions · AWS

03

AegisAI

Self-healing orchestration layer for reliable LLM systems

AegisAI Architecture
Objective

Control-layer architecture with security filtering, RAG, adversarial multi-agent validation, and self-healing recovery loops.

System

Multi-stage pipeline with input security, RAG, multi-agent validation, and recovery orchestration with full observability.

Results
  • Reduced hallucination pass-through to ~5%
  • Achieved 100% security block rate
Core Insight

LLMs cannot be trusted as standalone systems — reliability requires external control, validation, and failure-aware orchestration.

Python · Local LLMs (Ollama) · Agent-based validation · RAG pipeline · Observability tooling

04

MeetSync

A system that removes friction from meetings

MeetSync Architecture
Problem

Meetings create information, but decisions and ownership get lost.

Approach

Transform audio into structured decisions and actionable tasks.

Execution

Speech → Text → schema-validated task extraction.

Outcome

Clear decisions, assigned tasks, instant results, reduced manual effort.

Flask · PostgreSQL · Deepgram · Gemini

05

Transformer From Scratch

Mechanistic analysis of attention in transformers

Transformer Architecture
Objective

Analyze whether attention heads develop functional roles and quantify their importance using causal ablation.

Method

Built a transformer from scratch and analyzed attention using entropy, positional patterns, and causal ablation.

Key Findings
  • Specialization is task-dependent, emerging primarily in early layers
  • Head importance is asymmetric; identity heads act as critical bottlenecks
Core Insight

Transformer computation is not uniform — a small subset of heads carries most of the critical behavior.

PyTorch · Custom attention modules · Causal analysis pipeline

Get in Touch

I'm currently looking for opportunities in Machine Learning, Applied AI Engineering, and Infrastructure-oriented roles.

Email: radhekrishna8267@gmail.com