Architecting Intelligent Systems.

Co-founder & Lead AI Engineer specializing in RAG pipelines, agentic workflows, and scalable LLM architectures. Currently building agentic AI software for lawyers at an early-stage legal tech venture (stealth).

Winner 2025

Google Nano Banana Hackathon

Users Scaled

10K+

Years in AI

3+

The Philosophy

I bridge the gap between bleeding-edge research and production-ready applications. My focus is on Reliable AI—systems that aren't just impressive in demos, but robust enough for enterprise scale.

From winning the Google Nano Banana Hackathon 2025 to scaling systems from 100 to 10,000+ users, I specialize in minimizing hallucinations, optimizing retrieval latencies, and designing intuitive agentic interfaces.

98%
Hallucination Reduction
45%
Accuracy Increase
30%
Throughput Boost

Career

Professional Path

Building AI systems that scale — from early research to production architecture.

Co-founder & Lead AI Engineer

CurrentFull-time
Stealth · Legal AI·Colorado, USA — Remote
Apr 2026 – Present

Co-founding and leading AI engineering at an early-stage legal tech venture. Building agentic AI software for lawyers — from retrieval over case law and contracts to multi-step legal reasoning workflows.

  • Designing agentic workflows that automate legal research, drafting, and review for practicing lawyers
  • Owning AI architecture end-to-end — retrieval, agent orchestration, evaluation, and production deployment
  • Shaping the technical direction of the product and the engineering culture of the team
Legal AIAgentic SystemsRAGLLM ArchitectureLeadership

AI Consultant

CurrentPart-time
Chorcha·Rajshahi, Bangladesh
Apr 2026 – Present

Focused on making high-quality AI accessible to Bangladeshi students, enabling more effective, engaging, and enjoyable learning experiences.

  • Architecting AI-powered learning systems tailored to the Bangladeshi education context
  • Advising on LLM integration, curriculum design, and responsible AI adoption
LLMEdTechAI Strategy

Applied AI Engineer (L-2)

Full-time
AskTuring.ai·La Jolla, CA, USA — Remote
Jul 2025 – Apr 2026

Led the Core RAG & AI Team, owning end-to-end architecture of a production RAG platform — retrieval, agent orchestration, evaluation, and the model/provider layer — without vendor lock-in.

  • Led Core RAG & AI Team — owned architecture from retrieval and agent orchestration to evaluation and the LLM provider layer
  • Scaled the platform from 100 to 10,000+ concurrent users at ChatGPT-level latency, without vendor lock-in
  • Reduced hallucinations by 98% via hybrid search, reranking, citation extraction, and strict source-grounding
  • Designed multi-agent workflows with explicit state management, improving system throughput by 30%
  • Built agentic RAG with multi-layer memory (short-term, long-term, semantic) and time-aware retrieval — lifted contextual accuracy by 45%
  • Implemented a citation system with cross-source referencing across documents, web sources, and memory layers
  • Optimized chat hot paths via prepare-then-query pre-computation, parallelized resolution, and reduced database round-trips — cutting end-to-end latency
  • Built persistent user memory end-to-end: extraction service, schemas, CRUD APIs, and chat-flow integration
  • Refactored the LLM provider/model layer into modular configs and rolled out new model tiers behind feature flags
  • Built an internal evaluation benchmark and RAG suite management tooling — cut evaluation time by 99% and enabled rapid model iteration
  • Built image generation and editing pipelines with pixel-level control, integrated into agent workflows as first-class tools
  • Migrated the backend to async SQLAlchemy and restructured conversation, message-source, and repository layers for scale
LangGraphRAGMulti-AgentAgentic MemoryFastAPIPostgreSQLRedisAsync SQLAlchemyPython

Machine Learning Engineer

Full-time
Sazim Tech Ltd·Dhaka, Bangladesh — Remote
Oct 2023 – Jul 2025

Progressed from Trainee Engineer to ML Engineer over 1 year 10 months, building production-grade LLM integrations and private AI deployments for enterprise clients.

  • Designed Port and Adapter (Hexagonal) architecture for seamless multi-provider LLM integration (OpenAI, Anthropic, local LLMs)
  • Deployed LLM evaluation pipeline with source-based fact-checking, improving response reliability by 35% and user trust by 25%
  • Reduced AI safety and jailbreaking risks by 45% through multi-layer guardrails and advanced prompt engineering
  • Delivered private on-premise AI solution for enterprise client — air-gapped, reducing cloud dependency and latency by ~30%
  • Utilized Docker and self-hosted GPU infrastructure, cutting model startup time by 20–25%
  • Built full RAG application (3M+ sample database) using open-source LLMs, Next.js, LangChain, and FastAPI as Trainee
NestJSOpenAIDockerTypeScriptLLM EvalSafety

ML Researcher & Engineer

Part-time
Intelsense AI·Dhaka, Bangladesh — Remote
Sep 2022 – Sep 2023

Joined as intern, progressed to Researcher & Engineer. Built conversational AI systems and contributed to Bengali language NLP research.

  • Built Rasa-based chatbots for financial services and mobile operator industries (PoC and pilot deployments)
  • Developed multilingual restaurant chatbot supporting English, Banglish, and Bangla natural language inputs
  • Collaborated on Bengali Automatic Speech Recognition (ASR) tool — high-accuracy speech-to-text conversion
  • Researched Voice Activity Detection (VAD) technologies to improve system efficiency and responsiveness
  • Served as Data Annotation Team Lead, managing NLP dataset collection, labeling, and quality assurance
RasaNLPASRPythonConversational AI

Data Science Apprentice

Internship
Cramstack·Dhaka, Bangladesh
Nov 2021 – Apr 2022

Early-career data science role working on OCR, text summarization, data visualization, and analytical reporting across client projects.

  • Evaluated and compared OCR libraries, contributing to a real-world document processing tool
  • Researched and applied modern text summarization algorithms to real-world client data
  • Built custom Google Data Studio dashboards for client data communication and stakeholder reporting
  • Pre-processed and cleaned complex datasets, designed interactive dashboards, and performed web scraping
PythonOCRData AnalysisNLPData Studio

Volunteer & Community

Robotic Society of RUET (RSR)

Volunteer
Technical Secretary → IT Manager → Executive Member·Rajshahi, Bangladesh

5 years 3 months of progressive leadership in RUET's robotics and engineering society. Grew from Executive Member to Technical Secretary, organizing technical events and managing digital infrastructure.

  • Technical Secretary (Nov 2023 – May 2024): Led technical initiatives and workshop programs for the society
  • IT Manager (Dec 2022 – Nov 2023): Managed society's digital infrastructure, website, and technical operations
  • Executive Member (Mar 2019 – Dec 2022): Contributed to robotics competitions, events, and member activities
LeadershipRoboticsCommunityEvent Management
Mar 2019 – May 2024

Stack

Technical Arsenal

Tools and disciplines I rely on daily to ship production AI systems.

AI & Intelligence

Retrieval, reasoning, and memory systems.

01
RAG PipelinesAgentic WorkflowsLLM Fine-TuningVector DatabasesLangGraphLlamaIndexOpenAI Agent SDKPydantic AIEvaluation FrameworksPrompt EngineeringMulti-Agent SystemsEmbeddings

Backend Engineering

APIs, databases, and scalable architecture.

02
FastAPIPythonNestJSTypeScriptPostgreSQLRedisREST & GraphQLHexagonal ArchitectureEvent-Driven DesignTDD

Cloud & MLOps

Deployment, observability, and scale.

03
DockerKubernetesAWS EC2 / S3GCP Vertex AIAWS SageMakerGitHub ActionsWeights & BiasesPrivate AI DeploymentsAir-Gapped Infra

Core Competencies

Craft, communication, and leadership.

04
System DesignAPI Design PatternsCode ReviewTechnical WritingData Annotation QATeam LeadershipMentorshipPublic Speaking

Research

Publications

Research spanning AI-generated content detection, healthcare systems, and NLP.

01
Research Publication2023

Unraveling the Enigmatic Frontier: Deciphering the Distinction Between AI-Generated and Real Images

Abu Bakar Siddik et al.

Investigates the boundary between AI-generated and authentic images using deep learning classification techniques, with implications for digital forensics and media authenticity.

Deep LearningComputer VisionAI DetectionImage Forensics
02
Research Publication2023

Real-time Patient Monitoring System to Reduce Medical Error with the help of Database System

Abu Bakar Siddik et al.

Proposes a real-time patient monitoring architecture leveraging database-backed alert systems to reduce clinical errors and improve healthcare outcomes.

Healthcare AIReal-time SystemsDatabase DesignPatient Safety

Let's build the future of AI.

Open for strategic AI/ML consulting, technical collaborations, or deep technical discussions.

abubakar1808031@gmail.com
Rajshahi, Bangladesh