Gdelt Mentions

I copy parts of the original mentions dataset from the gdelt project into a mysql database for further evaluations. Below my table definition. The definition of the fields is of cause identical.  Please also read the original paper:  

THE GDELT EVENT DATABASE DATA FORMAT CODEBOOK V2.0

From the paper:
” … Mentions table that records all mentions of each event. As an event is mentioned across multiple news reports, each of those mentions is recorded in the Mentions table, along with several key indicators about that mention, including the location within the article where the mention appeared (in the lead paragraph versus being buried at the bottom) and the “confidence” of the algorithms in their identification of the event from that specific news report. …”
 
For a lack of better knowledge I use the hash of GLOBALEVENTID, MentionTimeDate, MentionIdentifier, Actor2CharOffset, ActionCharOffset as primary key. Please let me know if there is a better one!

 

About ralf

I studied Geology at University Erlangen and got my PhD (bio-nanotechnology) at TU Dresden. In my spare time i program simulations and tinker around with data prediction methods. Frisbee is my favorite sport and i play guitar when my friends and i meet to make some music.
This entry was posted in datacoll, gdelt and tagged , . Bookmark the permalink.

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.