The project presents a massive overview of the Ukrainian conflict. After the last escalation, which took place in spring 2021, we came up with an idea to make an analysis of the conflict based on data of the number of ceasefire violations provided by the OSCE Special Monitoring Mission. By matching the timeline of ceasefire violations with the social and political events and media propaganda we found out that the conflict has transformed from real military operations into infowar. We concluded that at the moment this so called hybrid war is endless because of the benefits it brings on for
We have created a unique tool that demonstrates the real dynamics of the conflict in Donbas over the past 5 years. Research was shared with many media, including Ukrainian. Global Investigative Journalism Network included our material in the top 10 best investigations of August 2021 in CIS countries.
Unfortunately, OSCE doesn’t have an open access to the data of ceasefire violations, so we made huge work.
- We gathered more than 2000 daily reports in PDF format from the OSCE website using Python.
- These PDFs has different structure according to different publishing periods, so we created a programming tool to parse them and convert into large CSV files. This step also required some manual work to prove the recognition quality.
- Then we cleaned the data in Google Sheets, extracted number of ceasefire violations by day, grouped them by weeks and put them on the timeline.
- We also made massive research to establish events that could be causes or consequences of the escalations and truces.
- We decided that the best form to present the result of our work is scrollytelling format. It was very new for us.
What was the hardest part of this project?
Each step of the project was challenging, especially the work with parsed and recognized PDFs. It took many hours to prove them manually, fact-check, and ensure the high quality of the data we presented to our readers.
The scale of the database was ambiguous in terms of manual prooving, but we double-checked the result of parsing because over the past years, the OSCE mission has changed the structure of reports several times.
Also, we had very limited experience with such complicated front-end development and had no production team, so we learned on the go how to manage such a big project. Moreover, it took a while to develop the exclusive engine for our site to achieve the best performance of scrollytelling for better user experience neither using a desktop nor mobile devices.
On the other hand, we spent a long time figuring out how to make an interesting story and comprehensive analysis based on these data. We hope we achieved this goal.
What can others learn from this project?
This project had two interesting aspects: technical implementation and data interpretation. The conflict in Ukraine has been going on for more than 7 years and we have come across a huge amount of information on this topic. Despite this we have combined all our analytical findings and expert abstracts into a single structured data-driven story.