Architecting Intelligent Systems.
AI Engineer & Team Lead specializing in RAG pipelines, agentic workflows, and scalable LLM architectures. Currently leading core AI at AskTuring.ai.
Winner 2025
Google Nano Banana Hackathon
Users Scaled
10K+
Years in AI
3+
Winner 2025
Google Nano Banana Hackathon
10K+
3+
Core Stack
The Philosophy
I bridge the gap between bleeding-edge research and production-ready applications. My focus is on Reliable AI—systems that aren't just impressive in demos, but robust enough for enterprise scale.
From winning the Google Nano Banana Hackathon 2025 to scaling systems from 100 to 10,000+ users, I specialize in minimizing hallucinations, optimizing retrieval latencies, and designing intuitive agentic interfaces.
Latest Insights
Deep dives into the mechanics of modern AI.
Selected Work
Featured experiments and production platforms that push the boundaries of LLM capabilities.
More experiments on GitHub.
View Open SourceCareer
Professional Path
Building AI systems that scale — from early research to production architecture.
AI Consultant
CurrentPart-timeFocused on making high-quality AI accessible to Bangladeshi students, enabling more effective, engaging, and enjoyable learning experiences.
- —Architecting AI-powered learning systems tailored to the Bangladeshi education context
- —Advising on LLM integration, curriculum design, and responsible AI adoption
Applied AI Engineer (L-2)
Full-timeLed the Core RAG & AI Team, architecting and shipping a production-ready RAG platform handling real-world workloads without vendor lock-in.
- —Led Core RAG & AI Team — owned end-to-end architecture from retrieval design to agent orchestration
- —Built production-ready RAG system without vendor lock-in, handling real-world enterprise workloads at scale
- —Designed multi-agent workflows with explicit state management, improving system throughput by 30%
- —Developed advanced agentic RAG with multiple memory layers (short-term, long-term, semantic), increasing contextual accuracy by 45%
- —Implemented time-aware RAG with temporal context understanding for recency-based retrieval and dynamic data
- —Developed web search integration with deep answer feature combining multiple sources, fact-checking, and synthesis
- —Implemented full-fledged citation system with proper referencing across files, web sources, and memory layers
- —Reduced hallucinations by 98% using hybrid search, reranking models, and strict source-grounding strategies
- —Built internal evaluation benchmark, cutting evaluation time by 99% and enabling rapid model iteration
- —Scaled system from 100 to 10,000+ concurrent users with latency optimizations achieving ChatGPT-level response speeds
- —Developed image generation and editing pipelines with pixel-level control, integrated into agent workflows
Machine Learning Engineer
Full-timeProgressed from Trainee Engineer to ML Engineer over 1 year 10 months, building production-grade LLM integrations and private AI deployments for enterprise clients.
- —Designed Port and Adapter (Hexagonal) architecture for seamless multi-provider LLM integration (OpenAI, Anthropic, local LLMs)
- —Deployed LLM evaluation pipeline with source-based fact-checking, improving response reliability by 35% and user trust by 25%
- —Reduced AI safety and jailbreaking risks by 45% through multi-layer guardrails and advanced prompt engineering
- —Delivered private on-premise AI solution for enterprise client — air-gapped, reducing cloud dependency and latency by ~30%
- —Utilized Docker and self-hosted GPU infrastructure, cutting model startup time by 20–25%
- —Built full RAG application (3M+ sample database) using open-source LLMs, Next.js, LangChain, and FastAPI as Trainee
ML Researcher & Engineer
Part-timeJoined as intern, progressed to Researcher & Engineer. Built conversational AI systems and contributed to Bengali language NLP research.
- —Built Rasa-based chatbots for financial services and mobile operator industries (PoC and pilot deployments)
- —Developed multilingual restaurant chatbot supporting English, Banglish, and Bangla natural language inputs
- —Collaborated on Bengali Automatic Speech Recognition (ASR) tool — high-accuracy speech-to-text conversion
- —Researched Voice Activity Detection (VAD) technologies to improve system efficiency and responsiveness
- —Served as Data Annotation Team Lead, managing NLP dataset collection, labeling, and quality assurance
Data Science Apprentice
InternshipEarly-career data science role working on OCR, text summarization, data visualization, and analytical reporting across client projects.
- —Evaluated and compared OCR libraries, contributing to a real-world document processing tool
- —Researched and applied modern text summarization algorithms to real-world client data
- —Built custom Google Data Studio dashboards for client data communication and stakeholder reporting
- —Pre-processed and cleaned complex datasets, designed interactive dashboards, and performed web scraping
Volunteer & Community
Robotic Society of RUET (RSR)
Volunteer5 years 3 months of progressive leadership in RUET's robotics and engineering society. Grew from Executive Member to Technical Secretary, organizing technical events and managing digital infrastructure.
- —Technical Secretary (Nov 2023 – May 2024): Led technical initiatives and workshop programs for the society
- —IT Manager (Dec 2022 – Nov 2023): Managed society's digital infrastructure, website, and technical operations
- —Executive Member (Mar 2019 – Dec 2022): Contributed to robotics competitions, events, and member activities
Stack
Technical Arsenal
Tools and disciplines I rely on daily to ship production AI systems.
AI & Intelligence
Retrieval, reasoning, and memory systems.
Backend Engineering
APIs, databases, and scalable architecture.
Cloud & MLOps
Deployment, observability, and scale.
Core Competencies
Craft, communication, and leadership.
Research
Publications
Research spanning AI-generated content detection, healthcare systems, and NLP.
Unraveling the Enigmatic Frontier: Deciphering the Distinction Between AI-Generated and Real Images
Abu Bakar Siddik et al.
Investigates the boundary between AI-generated and authentic images using deep learning classification techniques, with implications for digital forensics and media authenticity.
Real-time Patient Monitoring System to Reduce Medical Error with the help of Database System
Abu Bakar Siddik et al.
Proposes a real-time patient monitoring architecture leveraging database-backed alert systems to reduce clinical errors and improve healthcare outcomes.
Competitions
Achievements
Hackathons, Kaggle, and ML competitions across healthcare, NLP, and computer vision.
Let's build the future of AI.
Open for strategic AI/ML consulting, technical collaborations, or deep technical discussions.