Hi, I'm Pawan

I operationalize LLMs with a focus on enterprise scalability, responsible AI, and ISO 42001 compliance

Senior Machine Learning Engineer | NLP & GenAI | Cloud Native AI

Profile Picture
01.

About Me

With over 6+ years of professional experience, including more than 4 years specializing in NLP, LLMs, and Generative AI, I focus on building scalable and ethical AI systems ranging from RAG-powered chatbots to multi-modal agents, using tools like LangChain, Hugging Face, Vertex AI, SageMaker, crewAI and Terraform. My work bridges applied machine learning, infrastructure design, and business-aligned AI delivery.

I’ve led AI initiatives from architecture to production, enabling distributed deployments for 250+ users, reducing operational overhead by up to 40%, and increasing user engagement by 25%. From writing robust ML pipelines and implementing CI/CD workflows to setting up monitoring for continuous training, I take ownership of solutions end-to-end.

My technical focus includes building multi-agent orchestration systems, implementing guardrails for responsible AI use, and generating synthetic data to enhance quality and reduce annotation costs. I’ve also worked on advanced RAG architectures with hybrid retrievers and built evaluation frameworks to measure hallucination rates and factual consistency.

Outside of project delivery, I regularly contribute to the ML community on GitHub and Medium, and I mentor through workshops and peer sessions. I believe in creating practical, high-impact AI solutions while helping others grow along the way.

+6 Years of ML Experience
Conversational AI, Agentic Workflows, RAG, MLOPS Specialization
+40% Infra Cost Reduction via MLOps Optimization
02.

Experience

Senior Machine Learning Engineer

March 2022 – Present
  • Led technical direction for cross-functional GenAI initiatives, delivering 5+ production-grade agentic workflows.
  • Standardized MLOps practices across 5+ business units by designing and implementing CI/CD pipelines and governance protocols.
  • Architected and operationalized cloud-native ML infrastructure on GCP, achieving 30% cost savings.
  • Established AI governance frameworks across AI use cases, integrating responsible AI practices.

Machine Learning Research Assistant

Apr 2021 – Feb 2022
  • Designed RL-based scheduling algorithm for Kubernetes clusters under EU-funded Braine Project, improving resource utilization by 40%.
  • Researched advanced RL algorithms, developing and maintaining distributed ML experiments on Kubernetes clusters for real-time applications.

Software Engineer

Nov 2015 – Aug 2018
  • Managed deployment and runtime operations for enterprise Java applications on WebSphere and WebLogic, ensuring 99.9% system availability.
  • Collaborated with cross-functional teams to troubleshoot deployment pipelines and support production rollouts.
07.

References & Endorsements

Dr. Javad Chamanara

Project Lead, Forschungszentrum L3S – Leibniz University Hannover

“Pawan Kumar demonstrated exceptional technical depth and initiative during his time at L3S. His contributions to machine learning-driven healthcare simulation and edge orchestration were delivered with a high degree of autonomy and rigor. His professionalism and dedication left a lasting impact on our research efforts.”

Ramon Marrero

Senior Head of Data/ML Operations, DISH Digital Solutions GmbH

“Pawan is a rare talent who combines deep technical expertise with business acumen and cross-functional leadership. His GenAI solutions, cloud migration strategies, and CI/CD automation have tangibly improved our operational KPIs. He is a force multiplier for enterprise-scale AI initiatives and a cornerstone of our ML engineering team.”

03.

Skills

My technical toolkit is built around cloud-native AI development, with a focus on implementing scalable and responsible ML systems.

Generative AI & LLMs

  • Google Agent Development Kit
  • MCP Servers
  • Human-in-the-loop
  • GPT-4o
  • Claude 3.7 Sonnet
  • Gemini 2.5 Pro
  • Llama 3
  • Gemini 2.0 Flash
  • Qwen 2.5
  • Few Shot Contextualization
  • Prompt Engineering
  • Synthetic Data Generation
  • Visual QNA
  • PII Data Masking
  • Multimodal AI systems

RAG & Vector DB

  • LlamaIndex
  • LangChain
  • CrewAI
  • Vector Databases (Qdrant, Chroma)
  • Embeddings (OpenAI, Cohere)
  • BM25 & Hybrid Retrievers
  • Multi-Query Retrieval
  • Parent-Child Document Patterns
  • Context Compression
  • Semantic Caching
  • Multi-Agent Chatbots
  • AI Guardrails
  • FAISS
  • BigQuery Vector Search
  • Re-ranking Algorithms
  • Self Reflection

MLOps & ML Engineering

  • Cloud ML platforms (Vertex AI, SageMaker)
  • ML Monitoring
  • ML Pipelines (MLflow, Kubeflow)
  • Feature Stores
  • Batch Predictions
  • LLM fine-tuning
  • Model distillation
  • Predictive Scaling
  • LLMOps best practices

NLP Frameworks

  • TensorFlow
  • PyTorch
  • NLTK
  • spaCy
  • Hugging Face Transformers
  • Scikit-learn

Cloud & Infrastructure

  • GCP
  • AWS
  • GPU optimization
  • Vertex AI
  • SageMaker
  • Cloud Run
  • Cloud Functions
  • Pub Sub
  • Cloud Load Balancer
  • Cloud DNS
  • Databricks

CI/CD & DevOps

  • Cloud Build
  • Jenkins
  • PyTest
  • Artifact/Docker Registry
  • Rolling & Canary Deployments
  • Cloud Load Balancer
  • Cloud Run
  • Firebase
  • Terraform
  • Docker
  • Kubernetes

APIs & Web Development

  • Flask
  • FastAPI
  • Streamlit
  • RESTful Services

Data Engineering

  • SQL/NoSQL
  • BigQuery
  • Pandas
  • Spark
  • Airflow
  • ETL Pipelines

Responsible & Compliant AI (ISO 42001 Aligned)

  • Confident AI
  • Google's What-If tool
  • Bias detection and mitigation
  • Explainable AI techniques
  • Privacy-preserving ML
  • RLHF
  • Ethical AI governance
  • AI security (prompt injection defense)
04.

Projects

A selection of projects that showcase my expertise in Generative AI, RAG, Multi-Agent, and cloud-native AI development. Click on a card for more details.

05.

Education

M.Sc. Digital Engineering

Otto von Guericke University - Magdeburg, Germany (2022)

Specialization in Machine Learning, Deep Learning & Reinforcement Learning

B.Sc. Computer Science

Guru Gobind Singh Indraprastha University - Delhi, India (2015)

Specialization in Object Oriented Programming, Web Development & DBMS

06.

Certifications

Google ML Engineer Certification

Professional Machine Learning Engineer

Google Certified (May 2024 - Present)

Certified Professional Machine Learning Engineer with hands-on experience in building, deploying, and fine-tuning machine learning models on Google Cloud. Skilled at working with complex datasets, writing reusable code, and committed to incorporating responsible AI practices to ensure fairness.

Verify Certification
HashiCorp Terraform Certification

Terraform Associate (003)

HashiCorp Certified (Mar 2025 - Present)

Certified Terraform Associate with hands-on experience in designing, provisioning, and managing cloud infrastructure using Terraform. Skilled in Infrastructure as Code (IaC), automation, and state management, ensuring scalable and secure deployments that support AI and MLOps.

Verify Certification
Databricks AWS Platform Architect Badge

AWS Platform Architect

Databricks Certified (May 2025 - Present)

Certified Databricks AWS Platform Architect with expertise in configuring and optimizing Databricks on AWS. Proficient in platform administration, account API usage, integrating external storage, and implementing secure cloud services. Skilled in managing customer-managed VPCs and encryption keys for enterprise-grade analytics and ML solutions.

Verify Certification
07.

Contact

Let's build something amazing together

I'm always open to discussing new projects, creative ideas, or opportunities to be part of your vision.