A machine learning approach to domain specific dictionary generation. An economic time series framework
Stellenbosch Working Paper Series No. WP06/2021Publication date: March 2021
Author(s):
This paper aims to offer an alternative to the manually labour intensive process of constructing a domain specific lexicon or dictionary through the operationalization of subjective information processing. This paper builds on current empirical literature by (a) constructing a domain specific dictionary for various economic confidence indices, (b) introducing a novel weighting schema of text tokens that account for time dependence; and (c) operationalising subjective information processing of text data using machine learning. The results show that sentiment indices constructed from machine generated dictionaries have a better fit with multiple indicators of economic activity than @loughran2011liability's manually constructed dictionary. Analysis shows a lower RMSE for the domain specific dictionaries in a five year holdout sample period from 2012 to 2017. The results also justify the time series weighting design used to overcome the p>>n problem, commonly found when working with economic time series and text data.
JEL Classification:C32, C45, C53, C55
Keywords:Sentometrics, Machine learning, Domain-specific dictionaries
Notes:Data download: Generated Dictionaries
Download: PDF (738 KB)Login
(for staff & registered students)
BER Weekly
25 Apr 2025 Budget 3.0 loading; Trump starts to walk back tariff threats and Fed bullying (for now)This week was marked by policy reversals and clarifications both in SA and abroad, as policymakers confronted the consequences of their decisions. In the US, the administration softened its previously hardline stance on tariffs and downplayed earlier critiques of the US Federal Reserve (Fed). At home, SA’s National Treasury retracted its VAT increase...
Read the full issue
BER Weekly
25 Apr 2025 Budget 3.0 loading; Trump starts to walk back tariff threats and Fed bullying (for now)This week was marked by policy reversals and clarifications both in SA and abroad, as policymakers confronted the consequences of their decisions. In the US, the administration softened its previously hardline stance on tariffs and downplayed earlier critiques of the US Federal Reserve (Fed). At home, SA’s National Treasury retracted its VAT increase...
Read the full issue