- Accepted to npj Digital Medicine
Using "specialty voice-to-text and RAG," LAOS creates a full-cycle, closed loop from doctor-patient dialogue to evidence-based medical records. It also introduces a groundbreaking "clinical-semantic" dual-evaluation model, which allows the AI to understand doctors' jargon while ensuring the medical logic is perfectly sound.
The study is the first to systematically address the issue of documentation overload for ophthalmologists. It proves the superiority of AI-generated records in a clinical environment, achieving a 62% boost in documentation speed, cutting doctors' daily overtime by an hour, and showing a significantly lower rate of critical medical errors compared to manual entry.

- Data-Driven, Highly Customized Architecture
3) A Pioneering "NLP + Clinical" Dual-Evaluation Framework
- Rapid Response: Average speech-to-text latency is just 0.3 seconds. The system supports 30 minutes of continuous processing, with generation times far shorter than manual documentation.
- Leading Scores: It achieved a comprehensive clinical evaluation score of 84.1, significantly higher than the available baseline.
Discharge summaries showed the best performance due to their standardized structure. Although surgical records are more challenging (with variable procedures and frequent unexpected intraoperative events), the system still delivered statistically significant improvements in key sections like "Intraoperative Findings."

Laos_System
โโโ README.md # Project documentation
โโโ requirements.txt # Python dependencies
โโโ prompt.py # LLM prompts
โโโ utils.py # Utility functions
โโโ main.py # Launch script
โโโ configs/ # Configuration files
โ โโโ laos_default.yaml # Detail parameters
โโโ data/ # Data files
โ โโโ inputs/patient_001.json # Patient information
โ โโโ refs/patient_001_admission_ref.json # Admission Struction
โโโ outputs/ generated after running # Generated results
-
Install dependencies
- Python 3.10+ recommended
- For GPU: install a PyTorch build that matches your environment (see โDependency installation tipsโ)
- Other dependencies:
pip install -r requirements.txt
-
Run a sample (for Qwen/LLaMA, ensure you can pull from Hugging Face or have local weights)
- Edit paths in
configs/laos_default.yaml(e.g., patient_json / audio_path) - Run:
python main.py --config configs/laos_default.yaml - Outputs will be written to
outputs/, for example:outputs/patient_001_admission_pred.json
- Edit paths in
- task: admission | surgery | discharge
- io:
- patient_json: input patient info / composed JSON including transcribed speech
- out_dir: directory for outputs
- asr: optional, enable FunASR Paraformer for speech-to-text and append to input
- search: optional, unified retrieval interface (leave base_url empty if you donโt have a service)
- doctor_feedback: optional, clinician review notes (text or file)
- llm: choose inference backend and model (suggest Qwen2-*-Instruct or Meta-Llama-3/3.1-Instruct)
- logging: log level