Abstract
A growing and significant number of computer science related papers are being published; hence it is challenging to keep up with the latest research. This paper describes the development of a large language model (LLM) augmentation chatbot and user interface that provides responses to research queries in the domain of computer science. Around 200,000 computer science research papers from arXiv were embedded, resulting in ~11 million vectors (based on ‘chunks’ from the papers). Each vector is comprised of 384 numbers/dimensions. Technologies used include Langchain, a Vector Database, and Semantic Searching with document / query embeddings. The chatbot was tested using 30 sample questions that could be asked by computer science students across several topics and from different education levels (i.e., BSc, MSc and PhD level). The responses from this chatbot were compared with those from GPT-4. The responses with and without prompting were also compared. Readability metrics (Flesch-Kincaid and Coleman-Liau) were used to compare the responses from this LLM with GPT-4. Retrieval Augmented Generation Assessment (RAGAS), a novel LLM self-evaluation method was used to evaluate the system. We observed that the developed system provides more suitable responses to the user based on the readability level at which the questions were asked.
Original language | English |
---|---|
Title of host publication | 2023 31st Irish Conference on Artificial Intelligence and Cognitive Science (AICS) |
Publisher | IEEE |
ISBN (Electronic) | 979-8-3503-6021-9 |
ISBN (Print) | 979-8-3503-6022-6 |
DOIs | |
Publication status | Published (in print/issue) - 20 Mar 2024 |
Event | 2023 31st Irish Conference on Artificial Intelligence and Cognitive Science (AICS) - Letterkenny, Ireland Duration: 7 Dec 2023 → 8 Dec 2023 Conference number: 2023 |
Publication series
Name | 2023 31st Irish Conference on Artificial Intelligence and Cognitive Science (AICS) |
---|---|
Publisher | IEEE Control Society |
Conference
Conference | 2023 31st Irish Conference on Artificial Intelligence and Cognitive Science (AICS) |
---|---|
Abbreviated title | AICS |
Country/Territory | Ireland |
City | Letterkenny |
Period | 7/12/23 → 8/12/23 |
Bibliographical note
Publisher Copyright:© 2023 IEEE.
Keywords
- Computer science
- Databases
- Semantics
- Knowledge based systems
- Education
- User interfaces
- Chatbots
- chatbot
- Langchain
- Readability metrics
- large language model
- retrieval augmented generation
- Flesch-Kincaid
- Coleman-Liau
- vector database
- GPT-4
- Retrieval Augmented Generation Assessment
- semantic search