Available for consulting

Architecting Intelligent Systems.

Co-founder & Lead AI Engineer specializing in RAG pipelines, agentic workflows, and scalable LLM architectures. Currently building agentic AI for lawyers (stealth).

Explore Projects Read Writing

Winner 2025

Google Nano Banana Hackathon

Users Scaled

10K+

Years in AI

Core Stack

Pydantic AIClaude SDKLangGraphFastAPIRAGPostgreSQLDockerPython

Philosophy

Reliable AI, at Scale.

Bridging bleeding-edge research and production-ready applications.

Production First

Systems that aren't just impressive in demos, but robust enough for enterprise scale — 10,000+ concurrent users, zero-downtime.

Anti-Hallucination

98% reduction through hybrid search, reranking, and strict source-grounding. Reliable AI is the only kind worth deploying.

End-to-End Ownership

From retrieval architecture to evaluation frameworks and agent orchestration — I own the full stack and ship with confidence.

Writing

Latest Insights

Deep dives into the mechanics of modern AI.

6 min read·Jul 14, 2026

Where Should the AI Actually Go?

A model can be capable of doing a task and still be placed in the wrong part of the system. I learned this while building AI for a legal workflow.

aiarchitecturemachine-learninglegal-tech

read →

10 min read·May 7, 2026

The Ship You Can't Dock: Architectural Debt in the AI Era

In the fast-moving AI space, architectural debt isn't just about cutting corners—it's about reasonable decisions being invalidated by a shifting environment.

architectureaiengineeringtechnical-debt

read →

18 min read·Apr 29, 2026

Scaling to 1,500 Concurrent Users: PgBouncer and Null Pooling

A deep dive into why application-level pooling fails for long-running AI workflows and how to implement PgBouncer with statement-level pooling to handle 30x the load with 10x fewer resources.

postgrespgbouncerscalabilitybackendai

read →

View All Posts

Work

Selected Projects

Featured experiments and production platforms that push LLM capabilities.

Founder, Builder & Product Engineer

CareerKor

End-to-end AI career platform for candidates, recruiters, and platform operations.

AI WritingATS AnalysisInterview Coaching

Open Source AI Tooling

Axiom Wiki

Open-source knowledge compiler that turns documents and codebases into a self-maintaining markdown wiki.

MCPKnowledge CompilationSemantic Search

Award Winner

MagicSpin 360°

Generates interactive 360° rotations from single 2D images. Winner of Google Nano Banana Hackathon 2025.

Gemini Pro VisionStability AISegment Anything

Computer Vision Experiment

AI Virtual Try-On

Computer-vision experiment for realistic garment transfer and virtual clothing previews.

Stable DiffusionControlNetHuman Parsing

View Open Source

Career

Professional Path

Building AI systems that scale — from early research to production architecture.

Co-founder & Lead AI Engineer

CurrentFull-time

Stealth · Legal AI·Colorado, USA — Remote

Apr 2026 – Present

Co-founding and leading AI engineering at an early-stage legal tech venture. Building agentic AI software for lawyers — from retrieval over case law and contracts to multi-step legal reasoning workflows.

—Designing agentic workflows that automate legal research, drafting, and review for practicing lawyers
—Owning AI architecture end-to-end — retrieval, agent orchestration, evaluation, and production deployment
—Shaping the technical direction of the product and the engineering culture of the team

Legal AIAgentic SystemsRAGLLM ArchitectureLeadership

AI Consultant

CurrentPart-time

Chorcha·Rajshahi, Bangladesh

Apr 2026 – Present

Focused on making high-quality AI accessible to Bangladeshi students, enabling more effective, engaging, and enjoyable learning experiences.

—Architecting AI-powered learning systems tailored to the Bangladeshi education context
—Advising on LLM integration, curriculum design, and responsible AI adoption

LLMEdTechAI Strategy

Applied AI Engineer (L-2)

Full-time

AskTuring.ai·La Jolla, CA, USA — Remote

Jul 2025 – Apr 2026

Led the Core RAG & AI Team, owning end-to-end architecture of a production RAG platform — retrieval, agent orchestration, evaluation, and the model/provider layer — without vendor lock-in.

—Led Core RAG & AI Team — owned architecture from retrieval and agent orchestration to evaluation and the LLM provider layer
—Scaled the platform from 100 to 10,000+ concurrent users at ChatGPT-level latency, without vendor lock-in
—Reduced hallucinations by 98% via hybrid search, reranking, citation extraction, and strict source-grounding
—Designed multi-agent workflows with explicit state management, improving system throughput by 30%
—Built agentic RAG with multi-layer memory (short-term, long-term, semantic) and time-aware retrieval — lifted contextual accuracy by 45%
—Implemented a citation system with cross-source referencing across documents, web sources, and memory layers
—Optimized chat hot paths via prepare-then-query pre-computation, parallelized resolution, and reduced database round-trips — cutting end-to-end latency
—Built persistent user memory end-to-end: extraction service, schemas, CRUD APIs, and chat-flow integration
—Refactored the LLM provider/model layer into modular configs and rolled out new model tiers behind feature flags
—Built an internal evaluation benchmark and RAG suite management tooling — cut evaluation time by 99% and enabled rapid model iteration
—Built image generation and editing pipelines with pixel-level control, integrated into agent workflows as first-class tools
—Migrated the backend to async SQLAlchemy and restructured conversation, message-source, and repository layers for scale

LangGraphRAGMulti-AgentAgentic MemoryFastAPIPostgreSQLRedisAsync SQLAlchemyPython

Machine Learning Engineer

Full-time

Sazim Tech Ltd·Dhaka, Bangladesh — Remote

Oct 2023 – Jul 2025

Joined as Trainee Engineer and promoted to ML Engineer (L-1) after 4 months. Built production-grade LLM integrations and private AI deployments for enterprise clients.

—Designed Port and Adapter (Hexagonal) architecture for seamless multi-provider LLM integration (OpenAI, Anthropic, local LLMs)
—Deployed LLM evaluation pipeline with source-based fact-checking, improving response reliability by 35% and user trust by 25%
—Reduced AI safety and jailbreaking risks by 45% through multi-layer guardrails and advanced prompt engineering
—Delivered private on-premise AI solution for enterprise client — air-gapped, reducing cloud dependency and latency by ~30%
—Utilized Docker and self-hosted GPU infrastructure, cutting model startup time by 20–25%
—Built full RAG application (3M+ sample database) using open-source LLMs, Next.js, LangChain, and FastAPI as Trainee

NestJSOpenAIDockerTypeScriptLLM EvalSafety

ML Researcher & Engineer

Part-time

Intelsense AI·Dhaka, Bangladesh — Remote

Sep 2022 – Sep 2023

Joined as intern, progressed to Researcher & Engineer. Built conversational AI systems and contributed to Bengali language NLP research.

—Built Rasa-based chatbots for financial services and mobile operator industries (PoC and pilot deployments)
—Developed multilingual restaurant chatbot supporting English, Banglish, and Bangla natural language inputs
—Collaborated on Bengali Automatic Speech Recognition (ASR) tool — high-accuracy speech-to-text conversion
—Researched Voice Activity Detection (VAD) technologies to improve system efficiency and responsiveness
—Served as Data Annotation Team Lead, managing NLP dataset collection, labeling, and quality assurance

RasaNLPASRPythonConversational AI

Data Science Apprentice

Internship

Cramstack·Dhaka, Bangladesh

Nov 2021 – Apr 2022

Early-career data science role working on OCR, text summarization, data visualization, and analytical reporting across client projects.

—Evaluated and compared OCR libraries, contributing to a real-world document processing tool
—Researched and applied modern text summarization algorithms to real-world client data
—Built custom Google Data Studio dashboards for client data communication and stakeholder reporting
—Pre-processed and cleaned complex datasets, designed interactive dashboards, and performed web scraping

PythonOCRData AnalysisNLPData Studio

Volunteer & Community

Robotic Society of RUET (RSR)

Volunteer

Technical Secretary → IT Manager → Executive Member·Rajshahi, Bangladesh

5 years 3 months of progressive leadership in RUET's robotics and engineering society. Grew from Executive Member to Technical Secretary, organizing technical events and managing digital infrastructure.

—Technical Secretary (Nov 2023 – May 2024): Led technical initiatives and workshop programs for the society
—IT Manager (Dec 2022 – Nov 2023): Managed society's digital infrastructure, website, and technical operations
—Executive Member (Mar 2019 – Dec 2022): Contributed to robotics competitions, events, and member activities

LeadershipRoboticsCommunityEvent Management

Mar 2019 – May 2024

Education

Academic foundation.

Engineering and computer science at RUET.

BSc in Mechatronics Engineering

Rajshahi University of Engineering & Technology (RUET)

Jan 2019 – Jan 2024

Thesis: Deep learning in medical field

Relevant coursework: Artificial Intelligence, Machine Learning Algorithms, Software Engineering, Robotics, Numerical Analysis & Statistics, Digital Signal Processing & Machine Vision, Automation

MSc in Computer Science & Engineering

Rajshahi University of Engineering & Technology (RUET)

Ongoing

Nov 2025 – Present

Research: LLM alignment

Arsenal

Technical Stack

Tools and disciplines I rely on daily to ship production AI systems.

AI & Intelligence

Retrieval, reasoning, and memory systems.

RAG PipelinesAgentic WorkflowsLLM Fine-TuningVector DatabasesLangGraphLlamaIndexOpenAI Agent SDKPydantic AIEvaluation FrameworksPrompt EngineeringMulti-Agent SystemsEmbeddings

Backend Engineering

APIs, databases, and scalable architecture.

FastAPIPythonNestJSTypeScriptPostgreSQLRedisREST & GraphQLHexagonal ArchitectureEvent-Driven DesignTDD

Cloud & MLOps

Deployment, observability, and scale.

DockerKubernetesAWS EC2 / S3GCP Vertex AIAWS SageMakerGitHub ActionsWeights & BiasesPrivate AI DeploymentsAir-Gapped Infra

Core Competencies

Craft, communication, and leadership.

System DesignAPI Design PatternsCode ReviewTechnical WritingTeam LeadershipMentorshipPublic Speaking

Research

Publications

Research spanning AI-generated content detection, healthcare systems, and NLP.

Research Publication2023

Unraveling the Enigmatic Frontier: Deciphering the Distinction Between AI-Generated and Real Images

Abu Bakar Siddik et al.

Investigates the boundary between AI-generated and authentic images using deep learning classification techniques, with implications for digital forensics and media authenticity.

Deep LearningComputer VisionAI DetectionImage Forensics

Research Publication2023

Real-time Patient Monitoring System to Reduce Medical Error with the help of Database System

Abu Bakar Siddik et al.

Proposes a real-time patient monitoring architecture leveraging database-backed alert systems to reduce clinical errors and improve healthcare outcomes.

Healthcare AIReal-time SystemsDatabase DesignPatient Safety

Competitions