If investors are detectives seeking clues for outperformance in the US large-cap equity market, natural language processing is a team of tireless assistants.
Scouring company financial statements, listening to management calls, visiting facilities, analyzing competitors—an analyst’s daily life is filled from morning to night. And a river of incoming information, much of it unstructured, flows faster than ever. That means a lot of reading—and possibly less time for synthesizing data.
As fundamental analysts dig deep into companies and their business operations, conversations with management teams are a key source of information. Earnings calls are particularly important—a direct communication channel to company leaders, offering insights into a firm’s financial health, strategic direction and market trends.
Earnings-call transcripts may be fertile ground for insights—from forward-looking statements to key performance metrics, and from operational updates to market sentiment. Subtle clues matter too: tone, language and level of detail may help analysts gauge management's confidence and potential future performance. And our research suggests that incorporating these factors may lead to more informed investment decisions.
Sentiment: The “Mood” of a Message Matters
The sentiment metric is an intuitive but powerful indicator that captures a document’s tone: is it relatively positive, negative or neutral?
Sentiment assessed from earnings-call transcripts may impact future stock prices. Our analysis suggests that the stocks of companies whose communications score higher on sentiment have tended to outperform following the calls. But given the sheer volume of earnings calls, manually evaluating sentiment can be daunting.
This is where, in our view, natural language processing (NLP) sentiment analysis may be a useful tool for analysts.
“Bag of Words” and “Context-Aware”: NLP to the Rescue
NLP, a branch of artificial intelligence (AI), can be a powerful resource to help analysts extract insights from mountains of documentation. Instead of hiring a team of interns to comb for clues, NLP can do the job faster and more efficiently.
We think it makes sense to capture sentiment using two NLP approaches: “bag of words” and “context-aware.” Bag of words provides a baseline for assessing text by counting the number of positive and negative words in a document. This is highly intuitive, but also simplistic and subject to manipulation.
The context-aware approach, leveraging models such as BERT, GPT and LLaMA, analyzes the construction of sentences and their context within the overall document. These models measure aspects such as sentiment by analyzing overall language patterns, making their analysis more accurate and contextually relevant than simple counts of positive or negative words.
Applying both the bag-of-words and context-aware approaches to the earnings transcripts of US large-cap companies (Display) from 2010 to 2023 reveals a large sentiment dip in mid-2020. This bout of pessimism was related to the impact of the COVID-19 pandemic and its global economic aftershocks on company financials.
That downturn aside, both metrics indicate improving sentiment since 2010—more so using the bag-of-words approach. We haven’t yet analyzed this trend in detail, but other studies suggest that company executives are adapting to the age of AI by communicating with more positive words and phrases. And the sharper increase in the average bag-of-words sentiment suggests that firms are incorporating positive words more often.