🟢 Open to Work

Jay
Krishna

AI/ML Engineer  — 

8+ years designing and deploying production-grade AI/ML systems, Generative AI pipelines, and cloud data platforms for Wells Fargo, Walmart, and United Health Group.

8+
Years Exp.
4
Employers
40+
Technologies
F500
Clients
Jay Krishna
Jay Krishna
AI/ML Engineer
jaykrishna0527@gmail.com
📞
+1 (404) 851-1688
📍
Newark, NJ, USA
Open to Work
Scroll
Profile

About Me

I am an AI/ML Engineer with 8+ years of experience designing and deploying scalable, production-grade AI/ML and data solutions across cloud and enterprise environments. I have full ownership of the ML lifecycle — from data ingestion and feature engineering through model training, deployment, monitoring, and retraining.

My expertise spans Generative AI (RAG, LLM fine-tuning, LangChain, LangGraph, Azure OpenAI), deep learning (CNNs, LSTMs, Transformers, BERT, GPT-4), and classical ML (XGBoost, LightGBM, Random Forest, Isolation Forest). I build and deploy AI microservices with FastAPI and Docker on Kubernetes (AKS/EKS).

I have delivered end-to-end AI/ML workloads on AWS SageMaker and Azure ML, integrated real-time inference with Apache Kafka and Spark Streaming, and implemented full MLOps pipelines using MLflow, DVC, GitHub Actions, and Terraform.

Strong background in Python, PySpark, Scala, and the Hadoop ecosystem, with hands-on experience in vector databases (FAISS, Pinecone) for semantic search and recommendation systems.

Currently at Wells Fargo (Oct 2023 – Present) — building an AI-powered real-time fraud detection system using XGBoost, Isolation Forest, LSTMs, and a RAG-based investigation assistant powered by LangChain and FAISS on Azure.
📞
Phone
+1 (404) 851-1688
📍
Location
USA
📋
Methodology
Agile / Scrum • SDLC
💾
Cloud Platforms
Microsoft Azure • AWS
Availability
Open to New Opportunities
Expertise

Technical Skills

A production-grade AI/ML stack built over 8+ years across finance, retail, and healthcare.

💻
Languages & Scripting
PythonSQLScalaPySparkBashShell Scripting
🧠
ML / DL Frameworks
Scikit-learnXGBoostLightGBMTensorFlowPyTorchKerasSpark MLlibHuggingFace Transformers
🤖
Generative AI & LLMs
GPT-4BERTT5RAGPrompt EngineeringLLM Fine-tuningLangChainLangGraphFAISSPinecone
💬
NLP & Text Analytics
spaCyNLTKSentenceTransformersText ClassificationNamed Entity RecognitionSentiment Analysis
🚀
Model Deployment
FlaskFastAPIDockerKubernetes (AKS/EKS)ONNXTensorRTREST APIs
MLOps & CI/CD
MLflowDVCGitHub ActionsJenkinsTerraformAzure DevOpsDrift DetectionModel Versioning
Cloud — Azure
Azure MLDatabricksData FactorySynapseADLS Gen2AKSAzure OpenAI ServiceAzure AI Search
🔸
Cloud — AWS
SageMakerS3EC2LambdaEMRRedshiftGlueDynamoDB
Big Data & Streaming
Apache SparkPySparkHadoop (HDFS/Hive/HBase)KafkaNiFiAirflow
📈
Databases
OracleMySQLPostgreSQLMongoDBCassandraSnowflakeDynamoDB
📉
Monitoring & Visualization
PrometheusGrafanaPower BITableauPlotly
Career

Work Experience

8+ years across financial services, retail, healthcare, and IT consulting.

8+
Years
4
Employers
3
Certifications
2
Cloud Platforms
40+
Technologies
AI/ML Engineer
🏢 Wells Fargo
Oct 2023 — Present
  • Designed end-to-end ML lifecycle covering data ingestion, feature engineering, model training, real-time inference, and monitoring using PySpark pipelines in Azure Databricks, orchestrated through Azure Data Factory.
  • Built fraud classification models using XGBoost and Random Forest on transaction amount, merchant category, geolocation, and device fingerprint features; implemented real-time anomaly detection using Isolation Forest and Autoencoder on streaming data.
  • Developed time-series models using LSTM and ARIMA to detect seasonal fraud trends and predict high-risk transaction windows.
  • Built a RAG-based fraud investigation assistant using FAISS and HuggingFace SentenceTransformers to help analysts retrieve historical cases and investigation playbooks; orchestrated multi-step reasoning with LangGraph.
  • Containerized and deployed ML scoring services via FastAPI, Docker, and Azure Kubernetes Service (AKS) with autoscaling for sub-second latency fraud scoring.
  • Implemented model explainability using SHAP and LIME for regulatory audit compliance under BSA/AML guidelines.
  • Managed experiment tracking and model versioning using Azure ML integrated with MLflow; developed Prometheus and Grafana dashboards to monitor fraud scoring latency and data drift.
PythonPySparkAzure DatabricksAzure MLAzure OpenAILangChainLangGraphRAGFAISSXGBoostLSTMFastAPIDockerAKSMLflowKafka
Machine Learning Engineer
🏪 Walmart
Apr 2022 — May 2023
  • Designed end-to-end ML pipelines for data ingestion, feature engineering, model training, batch scoring, and monitoring using PySpark on Azure Databricks, orchestrated through Apache Airflow.
  • Built demand forecasting models using XGBoost, LightGBM, and ARIMA to predict product demand patterns, reducing inventory costs and improving stock availability across regions.
  • Implemented customer segmentation using K-Means clustering on large-scale POS transaction data to support targeted marketing strategies.
  • Engineered features from retail transaction data including rolling aggregates and seasonal indicators using Spark window functions; applied SHAP for interpretable feature importance insights.
  • Deployed containerized ML scoring services with Docker; orchestrated CI/CD pipelines using Jenkins and Git for reproducible deployments.
  • Implemented Kafka and Spark Streaming for live POS data analytics; developed Power BI dashboards for KPI tracking and executive reporting.
  • Migrated pipeline orchestration from Oozie to Airflow; recreated application logic in Azure Data Lake and Azure Data Factory.
PythonPySparkScalaAzure DatabricksXGBoostLightGBMARIMAK-MeansSHAPMLflowDockerJenkinsKafkaAirflowPower BI
Big Data Engineer
🏥 United Health Group
Mar 2020 — Apr 2022
  • Used Azure Data Factory extensively for ingesting healthcare data from disparate source systems, automating jobs using Event, Scheduled, and Tumbling Window triggers.
  • Created numerous ADF v2 pipelines with Copy, Filter, ForEach, and Databricks notebook activities for end-to-end data movement and transformation.
  • Provisioned Databricks clusters for batch and streaming workloads; developed PySpark jobs for complex table-to-table operations and data transformations.
  • Ingested data in mini-batches and performed RDD transformations using Spark Streaming for streaming analytics in Databricks.
  • Integrated Azure Active Directory authentication to Cosmos DB requests; created Build and Release definitions for CI/CD using Azure DevOps.
  • Authored high-level technical design and application design documents; worked with complex SQL, Stored Procedures, and Triggers across large databases.
Azure Data FactoryAzure DatabricksPySparkCosmos DBSpark StreamingAzure DevOpsSnowflakeOracleMySQL
Data Engineer
🖥 EPAM • Hyderabad, India
Jan 2018 — Feb 2020
📍 Hyderabad, India
  • Administered and maintained Cloudera Hadoop clusters on Linux environments; managed cluster coordination via Zookeeper.
  • Wrote multiple MapReduce programs for data extraction, transformation, and aggregation from 20+ sources across XML, JSON, and CSV formats.
  • Created Hive external tables, loaded data, and performed HQL queries for ad-hoc analytics; created Oozie workflows orchestrating Sqoop imports and Hive scripts.
  • Worked in AWS (EC2, S3) for deployment of custom Hadoop applications; transferred data from S3 to Redshift using Informatica.
  • Utilized Python Matplotlib and Scikit-Learn for prototype visualizations and early ML experiments; generated business reports using SSRS.
  • Performed data validation using MapReduce by building custom models to filter invalid records and cleanse datasets.
HadoopHiveHBaseSqoopMapReduceAWS S3RedshiftScikit-LearnTableauSSISOozie
Portfolio

Key Projects

Highlights from enterprise AI/ML and data engineering work across multiple industries.

🔐
AI-Powered Real-Time Fraud Detection
Wells Fargo • 2023–Present

Built an end-to-end fraud detection system analyzing millions of transactions in real time using XGBoost, Isolation Forest, and LSTM models on Azure Databricks with Kafka streaming. Deployed as FastAPI microservices on AKS with sub-second latency.

XGBoostIsolation ForestLSTMAzure DatabricksKafkaFastAPIAKS
🤖
RAG-Based Fraud Investigation Assistant
Wells Fargo • 2023–Present

Built a Generative AI investigation assistant using RAG (FAISS + HuggingFace SentenceTransformers) and LangGraph multi-step reasoning to help fraud analysts retrieve historical cases and regulatory guidelines. Integrated GPT-4 via Azure OpenAI Service.

RAGLangChainLangGraphFAISSGPT-4Azure OpenAI
📊
Retail Demand Forecasting & Segmentation
Walmart • 2022–2023

Built demand forecasting models using XGBoost, LightGBM, and ARIMA to predict product demand across regions. Implemented K-Means customer segmentation on POS data to support targeted marketing. Managed full MLflow experiment tracking on Databricks.

XGBoostLightGBMARIMAK-MeansSHAPMLflowPower BI
ML Pipeline Orchestration & CI/CD
Walmart • 2022–2023

Designed end-to-end ML pipelines on Azure Databricks orchestrated via Apache Airflow. Deployed containerized scoring services with Docker and Jenkins CI/CD. Implemented Kafka + Spark Streaming for live POS analytics and Power BI dashboards for leadership.

AirflowDatabricksDockerJenkinsKafkaSpark Streaming
🏥
Healthcare Data Pipeline on Azure
United Health Group • 2020–2022

Built large-scale ETL workflows on Azure to process healthcare data from multiple source systems. Used ADF v2 with Databricks notebooks, Spark Streaming for mini-batch ingestion, and Azure DevOps CI/CD. Integrated Cosmos DB with Azure AD authentication.

Azure Data FactoryDatabricksPySparkCosmos DBAzure DevOps
🌀
Multi-Source Hadoop ETL Platform
EPAM • 2018–2020

Administered Cloudera Hadoop clusters on Linux and wrote MapReduce jobs to ETL data from 20+ heterogeneous sources. Used Scikit-Learn for early ML prototypes; transferred data between HDFS, AWS S3, and Redshift via Sqoop and Informatica.

HadoopMapReduceHiveAWS S3Scikit-LearnRedshift
Education

Education & Certifications

Academic foundation backed by years of hands-on AI/ML enterprise practice.

🎓
Master of Science — Data Science
Saint Peter's University — New Jersey, USA
Advanced graduate study in machine learning, statistical modeling, deep learning, big data systems, and applied AI — directly underpinning production ML systems built across finance, retail, and healthcare domains.
🎓
Bachelor of Technology — Computer Science & Engineering
JNTUH — Hyderabad, India
Strong foundation in software engineering, algorithms, data structures, databases, and distributed systems — forming the basis for 8+ years of enterprise AI/ML and data engineering practice.
AWS
AWS Certified Machine Learning Specialty
Amazon Web Services — Professional Certification
Validates expertise in building, training, tuning, and deploying ML models on AWS — covering SageMaker, feature engineering, model evaluation, and MLOps on the AWS platform.
SageMakerAWS S3LambdaEMREC2DynamoDB
Microsoft Certified: Azure Data Engineer Associate
Microsoft — Professional Certification
Validates expertise in designing and implementing data storage, processing, and security solutions using the full Azure data stack — Azure Data Factory, Databricks, Synapse Analytics, ADLS Gen2, and Azure ML.
Azure MLDatabricksData FactorySynapseADLS Gen2
🔳
Databricks Certified Generative AI Engineer Associate
Databricks — Professional Certification
Validates ability to design and build production Generative AI applications on the Databricks Lakehouse platform — including RAG pipelines, LLM fine-tuning, vector search, and deploying LLM-integrated apps with MLflow.
RAGLLM Fine-tuningVector SearchMLflowDatabricks
Contact

Get In Touch

Available for new AI/ML engineering roles, consulting, and collaborations.

Let's Work Together

Available for full-time roles, contract work, or consulting engagements in AI/ML engineering and cloud architecture.

📞
Phone
+1 (404) 851-1688
📍
Location
Newark, NJ, USA
Status
Open to Work

Send a Message

Fill out the form below and I'll get back to you promptly.