Available for consulting

Architecting Intelligent Systems.

Co-founder & Lead AI Engineer specializing in RAG pipelines, agentic workflows, and scalable LLM architectures. Currently building agentic AI for lawyers (stealth).

Winner 2025
Google Nano Banana Hackathon
Users Scaled
10K+
Years in AI
3+
Core Stack
Pydantic AIClaude SDKLangGraphFastAPIRAGPostgreSQLDockerPython
Philosophy

Reliable AI, at Scale.

Bridging bleeding-edge research and production-ready applications.

01

Production First

Systems that aren't just impressive in demos, but robust enough for enterprise scale — 10,000+ concurrent users, zero-downtime.

02

Anti-Hallucination

98% reduction through hybrid search, reranking, and strict source-grounding. Reliable AI is the only kind worth deploying.

03

End-to-End Ownership

From retrieval architecture to evaluation frameworks and agent orchestration — I own the full stack and ship with confidence.

Career

Professional Path

Building AI systems that scale — from early research to production architecture.

Co-founder & Lead AI Engineer

CurrentFull-time
Stealth · Legal AI·Colorado, USA — Remote
Apr 2026 – Present

Co-founding and leading AI engineering at an early-stage legal tech venture. Building agentic AI software for lawyers — from retrieval over case law and contracts to multi-step legal reasoning workflows.

  • Designing agentic workflows that automate legal research, drafting, and review for practicing lawyers
  • Owning AI architecture end-to-end — retrieval, agent orchestration, evaluation, and production deployment
  • Shaping the technical direction of the product and the engineering culture of the team
Legal AIAgentic SystemsRAGLLM ArchitectureLeadership

AI Consultant

CurrentPart-time
Chorcha·Rajshahi, Bangladesh
Apr 2026 – Present

Focused on making high-quality AI accessible to Bangladeshi students, enabling more effective, engaging, and enjoyable learning experiences.

  • Architecting AI-powered learning systems tailored to the Bangladeshi education context
  • Advising on LLM integration, curriculum design, and responsible AI adoption
LLMEdTechAI Strategy

Applied AI Engineer (L-2)

Full-time
AskTuring.ai·La Jolla, CA, USA — Remote
Jul 2025 – Apr 2026

Led the Core RAG & AI Team, owning end-to-end architecture of a production RAG platform — retrieval, agent orchestration, evaluation, and the model/provider layer — without vendor lock-in.

  • Led Core RAG & AI Team — owned architecture from retrieval and agent orchestration to evaluation and the LLM provider layer
  • Scaled the platform from 100 to 10,000+ concurrent users at ChatGPT-level latency, without vendor lock-in
  • Reduced hallucinations by 98% via hybrid search, reranking, citation extraction, and strict source-grounding
  • Designed multi-agent workflows with explicit state management, improving system throughput by 30%
  • Built agentic RAG with multi-layer memory (short-term, long-term, semantic) and time-aware retrieval — lifted contextual accuracy by 45%
  • Implemented a citation system with cross-source referencing across documents, web sources, and memory layers
  • Optimized chat hot paths via prepare-then-query pre-computation, parallelized resolution, and reduced database round-trips — cutting end-to-end latency
  • Built persistent user memory end-to-end: extraction service, schemas, CRUD APIs, and chat-flow integration
  • Refactored the LLM provider/model layer into modular configs and rolled out new model tiers behind feature flags
  • Built an internal evaluation benchmark and RAG suite management tooling — cut evaluation time by 99% and enabled rapid model iteration
  • Built image generation and editing pipelines with pixel-level control, integrated into agent workflows as first-class tools
  • Migrated the backend to async SQLAlchemy and restructured conversation, message-source, and repository layers for scale
LangGraphRAGMulti-AgentAgentic MemoryFastAPIPostgreSQLRedisAsync SQLAlchemyPython

Machine Learning Engineer

Full-time
Sazim Tech Ltd·Dhaka, Bangladesh — Remote
Oct 2023 – Jul 2025

Joined as Trainee Engineer and promoted to ML Engineer (L-1) after 4 months. Built production-grade LLM integrations and private AI deployments for enterprise clients.

  • Designed Port and Adapter (Hexagonal) architecture for seamless multi-provider LLM integration (OpenAI, Anthropic, local LLMs)
  • Deployed LLM evaluation pipeline with source-based fact-checking, improving response reliability by 35% and user trust by 25%
  • Reduced AI safety and jailbreaking risks by 45% through multi-layer guardrails and advanced prompt engineering
  • Delivered private on-premise AI solution for enterprise client — air-gapped, reducing cloud dependency and latency by ~30%
  • Utilized Docker and self-hosted GPU infrastructure, cutting model startup time by 20–25%
  • Built full RAG application (3M+ sample database) using open-source LLMs, Next.js, LangChain, and FastAPI as Trainee
NestJSOpenAIDockerTypeScriptLLM EvalSafety

ML Researcher & Engineer

Part-time
Intelsense AI·Dhaka, Bangladesh — Remote
Sep 2022 – Sep 2023

Joined as intern, progressed to Researcher & Engineer. Built conversational AI systems and contributed to Bengali language NLP research.

  • Built Rasa-based chatbots for financial services and mobile operator industries (PoC and pilot deployments)
  • Developed multilingual restaurant chatbot supporting English, Banglish, and Bangla natural language inputs
  • Collaborated on Bengali Automatic Speech Recognition (ASR) tool — high-accuracy speech-to-text conversion
  • Researched Voice Activity Detection (VAD) technologies to improve system efficiency and responsiveness
  • Served as Data Annotation Team Lead, managing NLP dataset collection, labeling, and quality assurance
RasaNLPASRPythonConversational AI

Data Science Apprentice

Internship
Cramstack·Dhaka, Bangladesh
Nov 2021 – Apr 2022

Early-career data science role working on OCR, text summarization, data visualization, and analytical reporting across client projects.

  • Evaluated and compared OCR libraries, contributing to a real-world document processing tool
  • Researched and applied modern text summarization algorithms to real-world client data
  • Built custom Google Data Studio dashboards for client data communication and stakeholder reporting
  • Pre-processed and cleaned complex datasets, designed interactive dashboards, and performed web scraping
PythonOCRData AnalysisNLPData Studio

Volunteer & Community

Robotic Society of RUET (RSR)

Volunteer
Technical Secretary → IT Manager → Executive Member·Rajshahi, Bangladesh

5 years 3 months of progressive leadership in RUET's robotics and engineering society. Grew from Executive Member to Technical Secretary, organizing technical events and managing digital infrastructure.

  • Technical Secretary (Nov 2023 – May 2024): Led technical initiatives and workshop programs for the society
  • IT Manager (Dec 2022 – Nov 2023): Managed society's digital infrastructure, website, and technical operations
  • Executive Member (Mar 2019 – Dec 2022): Contributed to robotics competitions, events, and member activities
LeadershipRoboticsCommunityEvent Management
Mar 2019 – May 2024
Arsenal

Technical Stack

Tools and disciplines I rely on daily to ship production AI systems.

01
AI & Intelligence
Retrieval, reasoning, and memory systems.
RAG PipelinesAgentic WorkflowsLLM Fine-TuningVector DatabasesLangGraphLlamaIndexOpenAI Agent SDKPydantic AIEvaluation FrameworksPrompt EngineeringMulti-Agent SystemsEmbeddings
02
Backend Engineering
APIs, databases, and scalable architecture.
FastAPIPythonNestJSTypeScriptPostgreSQLRedisREST & GraphQLHexagonal ArchitectureEvent-Driven DesignTDD
03
Cloud & MLOps
Deployment, observability, and scale.
DockerKubernetesAWS EC2 / S3GCP Vertex AIAWS SageMakerGitHub ActionsWeights & BiasesPrivate AI DeploymentsAir-Gapped Infra
04
Core Competencies
Craft, communication, and leadership.
System DesignAPI Design PatternsCode ReviewTechnical WritingTeam LeadershipMentorshipPublic Speaking
Research

Publications

Research spanning AI-generated content detection, healthcare systems, and NLP.

01
Research Publication2023

Unraveling the Enigmatic Frontier: Deciphering the Distinction Between AI-Generated and Real Images

Abu Bakar Siddik et al.

Investigates the boundary between AI-generated and authentic images using deep learning classification techniques, with implications for digital forensics and media authenticity.

Deep LearningComputer VisionAI DetectionImage Forensics
02
Research Publication2023

Real-time Patient Monitoring System to Reduce Medical Error with the help of Database System

Abu Bakar Siddik et al.

Proposes a real-time patient monitoring architecture leveraging database-backed alert systems to reduce clinical errors and improve healthcare outcomes.

Healthcare AIReal-time SystemsDatabase DesignPatient Safety
Contact

Let's build the future of AI.

Open for strategic AI/ML consulting, technical collaborations, or deep technical discussions.

abubakar1808031@gmail.com · Rajshahi, Bangladesh