Architecting Intelligent Systems.
Co-founder & Lead AI Engineer specializing in RAG pipelines, agentic workflows, and scalable LLM architectures. Currently building agentic AI software for lawyers at an early-stage legal tech venture (stealth).
Winner 2025
Google Nano Banana Hackathon
Users Scaled
10K+
Years in AI
3+
Winner 2025
Google Nano Banana Hackathon
10K+
3+
Core Stack
The Philosophy
I bridge the gap between bleeding-edge research and production-ready applications. My focus is on Reliable AI—systems that aren't just impressive in demos, but robust enough for enterprise scale.
From winning the Google Nano Banana Hackathon 2025 to scaling systems from 100 to 10,000+ users, I specialize in minimizing hallucinations, optimizing retrieval latencies, and designing intuitive agentic interfaces.
Latest Insights
Deep dives into the mechanics of modern AI.
Selected Work
Featured experiments and production platforms that push the boundaries of LLM capabilities.
More experiments on GitHub.
View Open SourceCareer
Professional Path
Building AI systems that scale — from early research to production architecture.
Co-founder & Lead AI Engineer
CurrentFull-timeCo-founding and leading AI engineering at an early-stage legal tech venture. Building agentic AI software for lawyers — from retrieval over case law and contracts to multi-step legal reasoning workflows.
- —Designing agentic workflows that automate legal research, drafting, and review for practicing lawyers
- —Owning AI architecture end-to-end — retrieval, agent orchestration, evaluation, and production deployment
- —Shaping the technical direction of the product and the engineering culture of the team
AI Consultant
CurrentPart-timeFocused on making high-quality AI accessible to Bangladeshi students, enabling more effective, engaging, and enjoyable learning experiences.
- —Architecting AI-powered learning systems tailored to the Bangladeshi education context
- —Advising on LLM integration, curriculum design, and responsible AI adoption
Applied AI Engineer (L-2)
Full-timeLed the Core RAG & AI Team, owning end-to-end architecture of a production RAG platform — retrieval, agent orchestration, evaluation, and the model/provider layer — without vendor lock-in.
- —Led Core RAG & AI Team — owned architecture from retrieval and agent orchestration to evaluation and the LLM provider layer
- —Scaled the platform from 100 to 10,000+ concurrent users at ChatGPT-level latency, without vendor lock-in
- —Reduced hallucinations by 98% via hybrid search, reranking, citation extraction, and strict source-grounding
- —Designed multi-agent workflows with explicit state management, improving system throughput by 30%
- —Built agentic RAG with multi-layer memory (short-term, long-term, semantic) and time-aware retrieval — lifted contextual accuracy by 45%
- —Implemented a citation system with cross-source referencing across documents, web sources, and memory layers
- —Optimized chat hot paths via prepare-then-query pre-computation, parallelized resolution, and reduced database round-trips — cutting end-to-end latency
- —Built persistent user memory end-to-end: extraction service, schemas, CRUD APIs, and chat-flow integration
- —Refactored the LLM provider/model layer into modular configs and rolled out new model tiers behind feature flags
- —Built an internal evaluation benchmark and RAG suite management tooling — cut evaluation time by 99% and enabled rapid model iteration
- —Built image generation and editing pipelines with pixel-level control, integrated into agent workflows as first-class tools
- —Migrated the backend to async SQLAlchemy and restructured conversation, message-source, and repository layers for scale
Machine Learning Engineer
Full-timeProgressed from Trainee Engineer to ML Engineer over 1 year 10 months, building production-grade LLM integrations and private AI deployments for enterprise clients.
- —Designed Port and Adapter (Hexagonal) architecture for seamless multi-provider LLM integration (OpenAI, Anthropic, local LLMs)
- —Deployed LLM evaluation pipeline with source-based fact-checking, improving response reliability by 35% and user trust by 25%
- —Reduced AI safety and jailbreaking risks by 45% through multi-layer guardrails and advanced prompt engineering
- —Delivered private on-premise AI solution for enterprise client — air-gapped, reducing cloud dependency and latency by ~30%
- —Utilized Docker and self-hosted GPU infrastructure, cutting model startup time by 20–25%
- —Built full RAG application (3M+ sample database) using open-source LLMs, Next.js, LangChain, and FastAPI as Trainee
ML Researcher & Engineer
Part-timeJoined as intern, progressed to Researcher & Engineer. Built conversational AI systems and contributed to Bengali language NLP research.
- —Built Rasa-based chatbots for financial services and mobile operator industries (PoC and pilot deployments)
- —Developed multilingual restaurant chatbot supporting English, Banglish, and Bangla natural language inputs
- —Collaborated on Bengali Automatic Speech Recognition (ASR) tool — high-accuracy speech-to-text conversion
- —Researched Voice Activity Detection (VAD) technologies to improve system efficiency and responsiveness
- —Served as Data Annotation Team Lead, managing NLP dataset collection, labeling, and quality assurance
Data Science Apprentice
InternshipEarly-career data science role working on OCR, text summarization, data visualization, and analytical reporting across client projects.
- —Evaluated and compared OCR libraries, contributing to a real-world document processing tool
- —Researched and applied modern text summarization algorithms to real-world client data
- —Built custom Google Data Studio dashboards for client data communication and stakeholder reporting
- —Pre-processed and cleaned complex datasets, designed interactive dashboards, and performed web scraping
Volunteer & Community
Robotic Society of RUET (RSR)
Volunteer5 years 3 months of progressive leadership in RUET's robotics and engineering society. Grew from Executive Member to Technical Secretary, organizing technical events and managing digital infrastructure.
- —Technical Secretary (Nov 2023 – May 2024): Led technical initiatives and workshop programs for the society
- —IT Manager (Dec 2022 – Nov 2023): Managed society's digital infrastructure, website, and technical operations
- —Executive Member (Mar 2019 – Dec 2022): Contributed to robotics competitions, events, and member activities
Stack
Technical Arsenal
Tools and disciplines I rely on daily to ship production AI systems.
AI & Intelligence
Retrieval, reasoning, and memory systems.
Backend Engineering
APIs, databases, and scalable architecture.
Cloud & MLOps
Deployment, observability, and scale.
Core Competencies
Craft, communication, and leadership.
Research
Publications
Research spanning AI-generated content detection, healthcare systems, and NLP.
Unraveling the Enigmatic Frontier: Deciphering the Distinction Between AI-Generated and Real Images
Abu Bakar Siddik et al.
Investigates the boundary between AI-generated and authentic images using deep learning classification techniques, with implications for digital forensics and media authenticity.
Real-time Patient Monitoring System to Reduce Medical Error with the help of Database System
Abu Bakar Siddik et al.
Proposes a real-time patient monitoring architecture leveraging database-backed alert systems to reduce clinical errors and improve healthcare outcomes.
Competitions
Achievements
Hackathons, Kaggle, and ML competitions across healthcare, NLP, and computer vision.
Let's build the future of AI.
Open for strategic AI/ML consulting, technical collaborations, or deep technical discussions.