Anna FP
Data AI Engineer — Building intelligent, data-driven systems.
Competencies
Data Engineering & Backend
Architecting robust Python pipelines and scalable async APIs.Designing multi- layer data flows across Vector databases, Elasticsearch, and MongoDB, with strict schema validation via Pydantic and orchestration through Airflow and Docker.
AI & LLM Engineering
Orchestrating intelligent agents with LangGraph and LangChain. Building sophisticated RAG systems, agentic workflows with structured outputs, and multi-turn reasoning pipelines — integrated with external tools and MCP-compatible data sources.
Evaluation & MLOps
Implementing rigorous LLM evaluation, tracing, and drift detection with LangSmith and RAGAS. Managing the full MLOps lifecycle from experiment tracking and model registry to automated CI/CD eval suites and production model serving with MLflow.
Projects
ArtGuide
AI Art Detector and Audio Guide
ArtGuide is an AI-driven solution that recognizes artworks instantly and produces smooth, natural audio descriptions to enhance your experience in museums and exhibitions.
ML Resilience Lab
Resilient Real-Time Data Ingestion & Fraud Detection Pipeline
An experimental playground showcasing resilience patterns, fault tolerance, and chaos engineering principles within production-grade machine learning pipelines.
The News Hub
Real-Time News Engine and Insights
The News Hub is a comprehensive news aggregation and analysis platform driven by AI. It serves as a centralized system to collect, process, and interactively explore global news content efficiently. By combining advanced data engineering with natural language processing, it empowers users to stay informed through structured insights rather than just raw headlines.
Experience
Data Engineer
@ BMAT Music Innovators- •Engineered robust Python scripts to synchronize and maintain critical backend data flows, guaranteeing continuous data consistency across Dev, Staging, and Prod environments.
- •Developed high-concurrency RESTful APIs using FastAPI and asynchronous programming, providing stable support for multiple downstream services and ensuring strict data validation with Pydantic.
- •Optimized search performance and metadata discovery by implementing and managing Elasticsearch clusters for high-volume music catalog indexing, ensuring millisecond-latency retrieval.
- •Designed and deployed interactive production dashboards, centralizing scattered data to facilitate real-time data visualization and decision-making for key stakeholders.