Developing an Automated Sentiment Analysis Service for Julius Bär

Developing an Automated Sentiment Analysis Service for Julius Bär

Through this case study, discover:

Context & Challenges

Our client is Julius Bär, the leading Swiss wealth management group. Julius Bär has been managing client assets for over 130 years and is headquartered in Zurich with a presence in over 25 countries and around 60 locations.

In order to be able to react quickly, especially to important events in the financial environment, banks have global monitoring of financial news. At Julius Bär, financial news is available in-house in several languages. The quality and completeness of the monitoring are crucial, for example, in order to check the validity of current investment recommendations or to identify relevant risks at an early stage as part of ongoing risk monitoring.

In most cases, this involves accessing a large number of different news streams and then attempting to combine them and manually “distill” the essential information relevant to decision-making. Due to the high manual share, however, this process is time and personnel intensive as well as highly repetitive. This applies to an even greater extent to the manual condensation of information, e.g., to the level of individual actors – this is necessary to be able to evaluate news over time.

Thanks to rapid advances in modern NLP as well as artificial intelligence (AI), it is possible today to automate the extraction and summarization of high-quality information. The backbone of such analysis are gigantic deep-learning language models such as GPT-3, which can automatically summarize, classify, translate, and even autonomously generate texts. They are based on a transformer architecture, which enables them to reproduce human ways of thinking and patterns of reasoning.

By using these transformer models, sentiment assessment as well as data compilation and pre-processing can be almost completely automated, while humans can play to their strengths in interpreting trends and, if necessary, deriving actions. Transformer models using financial news are therefore applied in many different contexts.

To get a clear idea of the type of situation that can influence investment strategies, check out the Wirecard scandal.

One of the key challenges in the present case comprised the fact that the financial news data was multilingual, and it was insufficient to predict an overall sentiment score for every news article. Instead, the sentiment scores had to be aggregated for each individual company.

Investigating on the project feasibility, it became clear that out-of-box technologies or solutions could not be used in the present context. The main challenges were:

Our Approach

In order to address the top challenges described and develop a sustainable solution we especially took care of:

Key Benefits

Team involved on this project

One NLP expert and one Data Scientist collaborated with our client’s business and tech teams for six months on this NLP project (automated sentiment analysis service).

Technologies & Tools

Making the right tools selection in such a project is definitely no easy task. In fact, it requires a NLP expert able to analyze the plethora of open and commercial tools and technologies available on the market. Below are some insights on how our experts helped our client decide on which tools should be used for their sentiment analysis service:

fasttext has a small performance gain towards langdetect but is much faster.

Spacy and flair english models have a similar kind of performance while spacy has slightly higher inference speed on CPU vs. Flair. But the NLP pipeline concept of spacy is much more suited to easily build-up a custom end-2-end NLP pipeline with any custom component with any kind of NLP step to be integrated. It offers more integrations with other NLP tools and has more detailed documentation. Hence spacy is selected as an NER and pipeline tool over flair.

Sklearn character level TF-IDF vectorization combined with Logistic Regression is fastestet and reaches higher F1 scores on the inhouse financial news documents compared toFinBERT, but custom finetuned BERT is much better – but with too slow inference time. Distilbert is a good compromise in speed and performance.

More NLP Case Studies

Newsletter subscription