Towards machine learning based text categorization in the financial domain

Frederic Voigt, Jose Alcaraz Calero, Keshav Dahal, Qi Wang, Kai Von Luck, Peer Stelldinger

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Downloads (Pure)

Abstract

Large language models (LLMs) have demonstrated remarkable success in the field of natural language processing (NLP). Despite their origins in NLP, these algorithms possess the theoretical capability to process any data type represented in an NLP-like format. In this study, we use stock data to illustrate three methodologies for processing regression data with LLMs, employing tokenization and contextualized embeddings. By leveraging the well-known LLM algorithm Bidirectional Encoder Representations from Transformers (BERT) [1], we apply quantitative stock price prediction methodologies to predict stock prices and stock price movements, showcasing the versatility and potential of LLMs in financial data analysis.
Original languageEnglish
Title of host publicationProceedings of the 6th International Conference on Cybernetics, Cognition and Machine Learning Applications
PublisherIEEE
Publication statusAccepted/In press - 5 Aug 2024
Event6th International Conference on Cybernetics, Cognition and Machine Learning Applications - Nordakademie University of Applied Sciences, Germany
Duration: 19 Oct 202420 Oct 2024
https://www.intdatacon.com/

Conference

Conference6th International Conference on Cybernetics, Cognition and Machine Learning Applications
Abbreviated titleICCCMLA 2024
Country/TerritoryGermany
Period19/10/2420/10/24
Internet address

Keywords

  • finance
  • quantitative stock price prediction
  • natural language processing
  • stock movement prediction
  • fintech
  • machine learning
  • large language models

Fingerprint

Dive into the research topics of 'Towards machine learning based text categorization in the financial domain'. Together they form a unique fingerprint.

Cite this