ChatPapers: An AI Chatbot for Interacting with Academic Research

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

A growing and significant number of computer science related papers are being published; hence it is challenging to keep up with the latest research. This paper describes the development of a large language model (LLM) augmentation chatbot and user interface that provides responses to research queries in the domain of computer science. Around 200,000 computer science research papers from arXiv were embedded, resulting in ~11 million vectors (based on ‘chunks’ from the papers). Each vector is comprised of 384 numbers/dimensions. Technologies used include Langchain, a Vector Database, and Semantic Searching with document / query embeddings. The chatbot was tested using 30 sample questions that could be asked by computer science students across several topics and from different education levels (i.e., BSc, MSc and PhD level). The responses from this chatbot were compared with those from GPT-4. The responses with and without prompting were also compared. Readability metrics (Flesch-Kincaid and Coleman-Liau) were used to compare the responses from this LLM with GPT-4. Retrieval Augmented Generation Assessment (RAGAS), a novel LLM self-evaluation method was used to evaluate the system. We observed that the developed system provides more suitable responses to the user based on the readability level at which the questions were asked.
Original languageEnglish
Title of host publication2023 31st Irish Conference on Artificial Intelligence and Cognitive Science (AICS)
PublisherIEEE
ISBN (Electronic)979-8-3503-6021-9
ISBN (Print)979-8-3503-6022-6
DOIs
Publication statusPublished (in print/issue) - 20 Mar 2024
Event2023 31st Irish Conference on Artificial Intelligence and Cognitive Science (AICS) - Letterkenny, Ireland
Duration: 7 Dec 20238 Dec 2023
Conference number: 2023

Publication series

Name2023 31st Irish Conference on Artificial Intelligence and Cognitive Science (AICS)
PublisherIEEE Control Society

Conference

Conference2023 31st Irish Conference on Artificial Intelligence and Cognitive Science (AICS)
Abbreviated titleAICS
Country/TerritoryIreland
CityLetterkenny
Period7/12/238/12/23

Bibliographical note

Publisher Copyright:
© 2023 IEEE.

Keywords

  • Computer science
  • Databases
  • Semantics
  • Knowledge based systems
  • Education
  • User interfaces
  • Chatbots
  • chatbot
  • Langchain
  • Readability metrics
  • large language model
  • retrieval augmented generation
  • Flesch-Kincaid
  • Coleman-Liau
  • vector database
  • GPT-4
  • Retrieval Augmented Generation Assessment
  • semantic search

Fingerprint

Dive into the research topics of 'ChatPapers: An AI Chatbot for Interacting with Academic Research'. Together they form a unique fingerprint.

Cite this