Preserving Privacy, Increasing Accessibility, and Reducing Cost: An On-Device Artificial Intelligence Model for Medical Transcription and Note Generation

Johnson Thomas, Ayush Mudgal, Wendao Liu, Nisten Tahiraj, Zeeshaan Mohammed, Dhruv Diddi

2025-07-08

Preserving Privacy, Increasing Accessibility, and Reducing Cost: An
On-Device Artificial Intelligence Model for Medical Transcription and Note
Generation

Summary

This paper talks about a new AI model based on Llama 3.2 1B that is fine-tuned to help doctors by automatically transcribing speech and generating medical notes directly on devices like phones or computers. It does this while keeping patient data private by working entirely on the device without sending data to the cloud.

What's the problem?

The problem is that doctors spend a lot of time writing medical notes, which is exhausting and takes away from patient care. Current AI tools often need cloud servers, which raise privacy concerns and may be too expensive or slow for smaller clinics.

What's the solution?

The researchers fine-tuned a smaller Llama 3.2 1B model using an efficient training method that adapts the model to medical transcription tasks. They trained it on synthetic medical conversations and structured notes so it can work well on real medical data. This model runs fully on the device, preserving privacy and reducing costs while improving transcription accuracy and note quality.

Why it matters?

This matters because it helps doctors save time and stress by automating medical documentation securely and affordably. It makes advanced AI tools accessible to smaller healthcare providers and protects sensitive patient information.

Abstract

A fine-tuned Llama 3.2 1B model using PEFT with LoRA improves medical transcription quality and enables on-device, privacy-preserving deployment in healthcare.

View Paper