Clinical RAG System

Production RAG for Multiple Myeloma (medRxiv, 36 citations)

Overview

A production Retrieval-Augmented Generation (RAG) system for clinical decision support in multiple myeloma.

The Problem

Clinicians needed rapid access to the latest multiple myeloma research across thousands of documents while ensuring accuracy and traceability for clinical decision-making.

Technical Architecture

RAG Pipeline

Built using LangGraph for sophisticated orchestration:

  • Embeddings: BAAI/bge-large-en-v1.5 (state-of-the-art dense retrieval)
  • Generation: Mistral-7B (optimized for clinical text)
  • Vector Store: Amazon OpenSearch with hybrid search
  • Reranking: Cohere reranker for precision

Document Processing

  • Processed 5,000+ clinical documents
  • Multi-format support (PDFs, clinical notes, research papers)
  • Chunk optimization for clinical context preservation

Results

Metric Value
Clinical Effectiveness 88%
Documents Processed 5,000+
Active Clinical Users 2-3 oncologists
Citations 36 (medRxiv)

Key Features

  • Source Attribution: Every response linked to source documents
  • Clinical Guardrails: Input/output guardrails prevent hallucination
  • Query Understanding: Handles complex clinical terminology
  • Hybrid Search: Combines semantic and keyword matching

This project resulted in the following publications:

Impact

First prototype of an AI-powered clinical decision support system deployed in active clinical use at a major cancer center, demonstrating the feasibility of production RAG systems in healthcare.

Technical Stack

Orchestration:  LangGraph
Embeddings:     BAAI/bge-large-en-v1.5
Generation:     Mistral-7B
Vector Store:   Amazon OpenSearch
Reranking:      Cohere
Deployment:     AWS (Bedrock, EC2)
Monitoring:     MLflow

References

2024

  1. medRxiv
    rag.png
    A RAG Chatbot for Precision Medicine of Multiple Myeloma
    Mujahid Ali Quidwai and Alessandro Lagana
    medRxiv, 2024
    36 citations
  2. IMS
    IMS.png
    2P-145: Innovative AI-Driven Decision Support Tool for Multiple Myeloma Using Retrieval Augmented Generation
    Mujahid Ali Quidwai, Santiago Thibaud, Joshua Richter, and 3 more authors
    Clinical Lymphoma Myeloma and Leukemia, 2024
    IMS 2024 Conference Abstract, 1 citation