Hello, I'm

Anna FP

Data AI Engineer — Building intelligent, data-driven systems.

Competencies

Data Engineering & Backend

Architecting robust Python pipelines and scalable async APIs.Designing multi- layer data flows across Vector databases, Elasticsearch, and MongoDB, with strict schema validation via Pydantic and orchestration through Airflow and Docker.

PythonApache AirflowETL PipelinesPydanticVector DatabasesQdrantChromaDBMongoDBElasticsearchSQLDockerDocker ComposeFastAPIAsync PythonREST APIsRedisGitHub ActionsCI/CD

AI & LLM Engineering

Orchestrating intelligent agents with LangGraph and LangChain. Building sophisticated RAG systems, agentic workflows with structured outputs, and multi-turn reasoning pipelines — integrated with external tools and MCP-compatible data sources.

LLM OrchestrationLangGraphLangChainRAGAgentic SystemsPrompt EngineeringStructured OutputsFine-tuningHuman-in-the-LoopTool Using & MCPs

Evaluation & MLOps

Implementing rigorous LLM evaluation, tracing, and drift detection with LangSmith and RAGAS. Managing the full MLOps lifecycle from experiment tracking and model registry to automated CI/CD eval suites and production model serving with MLflow.

LLM EvaluationLangSmithRAGASCircuit BreakersDrift DetectionLLM TracingMLflowScikit-learnXGBoostSupervised LearningImbalanced DatasetsClusteringPCAMultimodal ModelsExperiment TrackingModel Registry & ServingCI/CD for ML

Projects

ArtGuide

ArtGuide

AI Art Detector and Audio Guide

ArtGuide is an AI-driven solution that recognizes artworks instantly and produces smooth, natural audio descriptions to enhance your experience in museums and exhibitions.

PythonDockerQdrantDBLangchainCLIPPiperTTSLanggraphFastAPIOpenAIStreamlit
ML Resilience Lab

ML Resilience Lab

Resilient Real-Time Data Ingestion & Fraud Detection Pipeline

An experimental playground showcasing resilience patterns, fault tolerance, and chaos engineering principles within production-grade machine learning pipelines.

PythonMongoDBMLflowScikitLearnPydanticRandom ForestXGBoostDockerMedallion ArchuitectureResilience PrinciplesData DriftsChaos EngineeringCircuit BreakersModel Registry & ServiceKill SwitchImbalanced Datasets
The News Hub

The News Hub

Real-Time News Engine and Insights

The News Hub is a comprehensive news aggregation and analysis platform driven by AI. It serves as a centralized system to collect, process, and interactively explore global news content efficiently. By combining advanced data engineering with natural language processing, it empowers users to stay informed through structured insights rather than just raw headlines.

PythonAirflowSklearnDockerChromaDBMongoDBLangchainFastAPIOpenAINext.js

Experience

Data Engineer

@ BMAT Music Innovators
Feb 2024 - Present
  • Engineered robust Python scripts to synchronize and maintain critical backend data flows, guaranteeing continuous data consistency across Dev, Staging, and Prod environments.
  • Developed high-concurrency RESTful APIs using FastAPI and asynchronous programming, providing stable support for multiple downstream services and ensuring strict data validation with Pydantic.
  • Optimized search performance and metadata discovery by implementing and managing Elasticsearch clusters for high-volume music catalog indexing, ensuring millisecond-latency retrieval.
  • Designed and deployed interactive production dashboards, centralizing scattered data to facilitate real-time data visualization and decision-making for key stakeholders.
PythonAirflowDockerMongoDBElasticsearchFastAPIDashETLPydantic