We love reading news but who’s got the time to read ALL articles? Luckily, the Global Database of Events, Language, and Tone, GDELT for short, scrapes tons of news articles and other sources of information. And yes, you can access their data via an API or very convenient Python and R SDKs.

The GDELT documentation is pretty good, you can easily find Python and R tutorials to get started with the API. Also, you can query the data on BigQuery if you want to analyze it. However, if you are to do more serious analysis, you will need your own copy of the raw data. We started playing with the data, you can check our very WIP repo to see how you can get a local copy of GDELT, and maybe it can give you further tips to process the raw data.