Application of large language models in medicine
Liu F., Zhou H., Gu B., Zou X., Huang J., Wu J., Li Y., Chen SS., Hua Y., Zhou P., Liu J., Mao C., You C., Wu X., Zheng Y., Clifton L., Li Z., Luo J., Clifton DA.
Large language models (LLMs), such as ChatGPT, have received great attention owing to their capabilities for understanding and generating human language. Despite a trend in researching the application of LLMs in supporting different medical tasks (such as enhancing clinical diagnostics and providing medical education), a comprehensive assessment of their development, practical applications and outcomes in the medical space is still missing. Therefore, this Review aims to provide an overview of the development and deployment of LLMs in medicine, including the challenges and opportunities they face. In terms of development, we discuss the principles of existing medical LLMs, including their basic model structures, number of parameters, and sources and scales of data used for model development. In terms of deployment, we compare different LLMs across various medical tasks and with state-of-the-art lightweight models.