Automating the creation of news summaries by using Natural Language Processing

What if journalists could spend less time summarizing and more time creating? At VRT, innovation meets AI to explore how technology can support newsrooms in delivering faster, high-quality content.

Catch up quickly
VRT partnered with ML6 to improve AI-driven news summarization using advanced NLP models. ML6 guided the innovation team in enhancing model training with a sequential approach: starting from a pretrained multilingual BART model, fine-tuning on English-to-Dutch summaries, and finally integrating VRT’s own data. The result is a tool that speeds up summary creation, suggests alternative wordings, and supports journalists in their work without replacing their creativity. A pilot with VRT journalists confirmed the value of combining human insight with AI to deliver high-quality results.
About this client
The VRT is the national public-service broadcaster for the Flemish Community of Belgium. With its three television channels, five radio stations, and various digital channels, the VRT reaches up to 90% of all Flemish people every week. The VRT NWS is the news service of the VRT and aims at keeping Flemish people informed about the national and international news through its diverse channels (such as the website, application and live tv broadcastings). The VRT has also an innovation department which continuously explores new technologies and applications for media purposes in close collaboration with its end-users.
Impact
The AI driven model is a powerful tool in assisting journalists in their writing work by speeding up the summary creation as well as suggest alternative wordings. In that way, human creativity and AI can enhance each other’s complementary strengths and produce high quality results.
Challenge
Before implementing the solution, VRT faced several hurdles in automating the creation of high-quality news summaries.
Time-Consuming Summaries
01News articles on the VRT NWS website generally consist of a short summary followed by the entire article. The summary acts as a condensed version of the article and captures the main points of the story. Because the creation of summaries is a repetitive and time-consuming process, the VRT innovation department explored the possibility of using Natural Language Processing to automate this activity.
Two approaches
02Two approaches can be used for that: extractive and abstractive summarization. The extractive summarization focuses on identifying the most important parts of the article and produces a set of sentences from the original text, while the abstractive method produces a new text based on the interpretation of the article. The VRT chose for the abstractive method, which is a more state-of-the-art method and provides more promising results.
Improving Model Performance
03In that way, the innovation department sought to develop trained models to enable the automatic creation of news summaries. Although they had already fine-tuned their models, they were not satisfied with the quality of the output. Therefore, ML6 provided in-depth technical advice on the training and on the deployment of the models to achieve the highest performance possible.
Working with ML6 is investing in our own people. We believe it’s important that we have internal knowledge, and through our collaboration we received a knowledge transfer in a very efficient way to bring our people to a higher level. We buy knowledge, we buy flexibility, we make an investment in our people towards the future.

Solution
To enhance model training and improve output quality, ML6 proposed a sequential way of working.
Starting with a Pretrained Model
01ML6 began from a pretrained multilingual NLP “BART model” (see paper).
Fine-Tuning Phase
02A first fine-tuning phase was done on English news summaries translated into Dutch.
Integrating VRT Data
03Finally, VRT data was integrated into the model. This way of complementing the client data with processed open-source data leads to a higher quality of the generated summaries.
Results
This type of Transformer model is a powerful tool in assisting journalists in their writing work. They can speed up the summary creation as well as suggest alternative wordings. At the same time, they can not replace the insights and the creativity of the journalists; which are human skills gained on a broader scale than the (limited) training data presented to the NLP model. With this in mind, it can be said that humans and AI can enhance each other’s complementary strengths and produce high quality results. As a final test of this theory and in order to evaluate the output quality of this tool, a journalist from the VRT tested some of the news summaries based on existing VRT articles, which enabled the VRT innovation to gain new insights and feedback.
Inspired?
Let’s connect and make it happen!
Ready to elevate your AI game? Schedule a meeting with us today and let’s craft a winning strategy together!
Cupcake ipsum dolor sit amet apple pie.
Frequently Asked Questions
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.