Hi, I'm

Hamida Jafarova

Building intelligent systems with machine learning, data analytics, and Generative AI.

About Me

I'm a Data Scientist based in Baku, Azerbaijan, currently working at NovaLingua where I build Python data pipelines and Tableau dashboards to track learner engagement across 30K+ user records. My work sits at the intersection of data analytics and machine learning — turning messy data into clear business insights.

Previously at Lavanda MMC, I built classification and regression models with Scikit-Learn and contributed to a RAG-based Q&A tool using LangChain that helped reduce consultant research time. My path into data science started during a UNDP-backed internship at the Khazar Women Resource Center, where I discovered the power of data-driven decision making.

I'm passionate about applying AI to real-world problems — from detecting corrosion in industrial pipelines to predicting faults in power transmission lines. My research on agricultural AI and cold-start problems has been accepted at SRC2026 for presentation at IEEE.

0+
Years Experience
0
Projects
0
Publication

Projects

CaspianShield

AI-Powered Corrosion Detection for Industrial Pipelines

Dec 2025 – Present
  • Built a CNN-based pipeline to detect and classify industrial pipeline corrosion from image data
  • Implemented severity-based scoring to help prioritise maintenance and reduce safety hazards
  • Applied data augmentation techniques to improve model performance across varying conditions
PythonTensorFlowCNNOpenCV

ECAI

AI-Powered Fault Prediction System for Power Transmission Lines

Mar 2026 – Present
  • Built an IoT-to-AI pipeline using Arduino sensors to capture real-time voltage, current, and temperature data
  • Developed LSTM and XGBoost models to classify normal vs fault conditions and predict failures
  • Designed a real-time monitoring dashboard that alerts operators to anomalies before blackouts occur
  • Engineered the full data flow from sensor collection through model inference to dashboard visualisation
PythonLSTMXGBoostArduinoFlask

RAG-based Q&A Tool

Intelligent Research Assistant — Lavanda MMC

  • Contributed to a RAG-based Q&A tool using LangChain for document retrieval and question answering
  • Helped reduce consultant research time by surfacing relevant information from large document collections
PythonLangChainRAGLLM

Research

Accepted — Poster/Demo Presentation

Addressing the Cold-Start Problem in Agricultural AI: A Synthetic Data–Driven Decision Support System for Smallholder Farming

Hamida Jafarova, Fatima Alakbarli

17th Student Research Conference on Applied Computing (SRC2026) Submitted to IEEE Xplore Digital Library

Experience

Data Analyst

@ NovaLingua
Oct 2025 – Present Baku, Azerbaijan
  • Built Python pipelines to clean, transform, and analyse 30K+ user learning records
  • Created Tableau dashboards tracking learner retention, engagement, and conversion metrics
  • Performed analysis on subscription data, helping identify patterns that improved retention by 20%
  • Wrote SQL queries and reporting workflows for content performance across 4 product modules

Data Scientist

@ Lavanda MMC
Oct 2024 – Oct 2025 Azerbaijan
  • Conducted exploratory data analysis across multiple business datasets to surface actionable trends
  • Built classification and regression models using Scikit-Learn, iterating on feature engineering and tuning
  • Contributed to a RAG-based Q&A tool using LangChain, helping reduce consultant research time
  • Prepared data-driven reports and visualisations to support stakeholder decision-making

Business Data Analyst Intern

@ Khazar Women Resource Center (UNDP)
Jun 2023 – Sep 2024 Baku, Azerbaijan
  • Managed digital documentation workflows, ensuring data integrity across UNDP-backed programs
  • Built project tracking spreadsheets to monitor timelines, budgets, and deliverables
  • Prepared weekly status reports, improving cross-team visibility on project progress
  • Analysed project execution phases to identify bottlenecks and improve planning efficiency

Skills

Programming

Python R SQL

ML / AI

TensorFlow Scikit-Learn RAG LangChain LLM Integration Prompt Engineering

Data & Analytics

Pandas NumPy EDA Statistical Modelling Data Cleaning

Visualization

Tableau Power BI Excel

Engineering

Flask FastAPI Docker Git

Cloud & Big Data

AWS Azure Hadoop Spark

Education

Bachelor of Science in Computer Science

ADA University

Baku, Azerbaijan

Get In Touch

Have a question or want to work together? Feel free to reach out.

Location

Baku, Azerbaijan