Abstract
BACKGROUND: Speech recognition is becoming increasingly common in the national healthcare system. One of the first specialties to implement this technology on a large scale was radiology. However, the efficiency of voice input and its effect on the length of time required to complete medical records remain unresolved.
 AIM: To assess the efficiency of speech recognition in generating radiological protocols of different modalities and types.
 METHODS: The retrospective study was conducted at the Moscow Reference Center of the Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Department of Health. A total of 12,912 radiological reports on fluorography, mammography, chest computed tomography (CT), contrast-enhanced magnetic resonance imaging (MRI) of the brain, and contrast-enhanced CT of the abdomen and pelvis were included in the study by simple random sampling. The size of all samples exceeded 766 reports, calculated with regard to the size of the general population of over 100,000 reports. The Voice2Med software was used to fill in the radiological protocols. Intergroup comparison was performed using the MannWhitney U-test with a statistical significance level of 0.05.
 RESULTS: The average duration of generating fluorographic protocols in the keyboard and voice input groups was 189.9 s (0:03:09) and 236.2 s (0:03:56), respectively (p 0.0001). For mammographic reports, the duration was 387.1 s (0:06:27) and 444.8 s (0:07:24), respectively (p 0.0001). For radiographic reports, it amounted to 247.8 s (0:04:07) and 189.0 s (0: 03:09), respectively (p 0.0001), and for chest CT, it was 379.7 s (0:06:19) and 382.7 s (0:06:22), respectively (p=0.12). For MRI of the brain, the protocols were generated for 709.9 s (0:11:49) and 559.9 s (0: 09:19), respectively (p 0.0001), and for contrast-enhanced chest, abdominal, and pelvic CT scans, it took 2714.6 s (0:45:15) and 1778.4 s (0:29:38), respectively. Voice input slowed down the preparation time of mammographic and fluorographic protocols. This is due to the use of a structured electronic medical document in medical facilities to describe the results of the examinations. Speech recognition showed the greatest efficiency in generating MRI and CT protocols. Such reports contain a large number of pathological changes, both target and incidental findings, which requires a detailed description by the radiologist in the examination protocol.
 CONCLUSIONS: Speech recognition in generating radiological protocols showed different efficiency depending on the modality and type of the radiological protocol filled in using the voice input system. This approach is optimal for describing CT and MRI scans.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have