Abstract
Generative artificial intelligence (GenAI) in general, and large language models (LLMs) in particular, are highly fashionable. As they have the ability to generate coherent output based on prompts in natural language, they are promoted as tools to free knowledge workers from tedious tasks such as content writing, customer support and routine computer code generation. Unsurprisingly, their application is also attractive to professionals in the research domain, where mundane and laborious tasks, such as literature screening, are commonplace. We evaluate Vertex AI ‘text-bison’, a foundational LLM model, in a real-world academic scenario by replicating parts of a popular systematic review in the information management domain. By comparing the results of a zero-shot LLM-based approach with those of the original study, we gather evidence on the suitability of state-of-the-art general-purpose LLMs for the analysis of scientific content. We show that the LLM-based approach delivers good scientific content analysis performance for a general classification problem (ACC = 0.9), acceptable performance for a domain-specific classification problem (ACC = 0.8) and borderline performance for a text comprehension problem (ACC ≈ 0.69). We conclude that some content analysis tasks with moderate accuracy requirements may be supported by current LLMs. As the technology will evolve rapidly in the foreseeable future, studies on large corpora, where some inaccuracies are tolerable, or workflows that prepare large data sets for human processing, may increasingly benefit from the capabilities of GenAI.
Original language | English |
---|---|
Title of host publication | 17th IEEE International Conference on Application of Information and Communication Technologies, AICT 2023 - Proceedings |
Publisher | IEEE |
ISBN (Electronic) | 9798350303568 |
DOIs | |
Publication status | Published - 13 Nov 2023 |
Event | 17th International Conference on Application of Information and Communication Technologies - ADA University, Baku, Azerbaijan Duration: 18 Oct 2023 → 20 Oct 2023 Conference number: 17 https://www.aict.info/?csc=2023 |
Publication series
Name | 17th IEEE International Conference on Application of Information and Communication Technologies, AICT 2023 - Proceedings |
---|
Conference
Conference | 17th International Conference on Application of Information and Communication Technologies |
---|---|
Abbreviated title | AICT |
Country/Territory | Azerbaijan |
City | Baku |
Period | 18/10/2023 → 20/10/2023 |
Internet address |
Keywords
- AI-Assisted Research
- Literature Screening
- Content Analysis
- Prompt Engineering
- Classification Performance