Newspaper (or TV program or events from a city) are often published as rss feeds and are dumped in the rss_posts table. You can ask simple question like “Was my Organization mentioned reasonly?” or “How often is the keyword used”. (technical preview, dynamic data update will be implemented this year)
Column Name | Datatype | Comment | Sample data |
---|---|---|---|
idrss | int | unique number | 12 |
source2_name | varchar(45) | original source of the content (can differ from rss source, e.g. TV program & TV station) | Bild |
source2_country | varchar(45) | country of the original source of the content (de, ch, at) | de |
rss_post_id | varchar(250) | id to prevent doublicate entries, given by the source1 | http://www.bild.de/politik/inland/talkshow/illner-erdogan-und-todesstrafe-51378892.bild.html |
rss_title | mediumtext | Title of the post | Erdogan-Talk bei Illner - Todesstrafe-Abstimmung in Deutschland? |
rss_text | longtext | main content of the post | In der Runde trafen Erdogan-Fans und Kritiker des Despoten aufeinander. Eine explosive Mischung. Foto: ZDF |
rss_published | datetime | date the informations hits the public | 2017-04-21 01:22:56 |
inserted | timestamp | timestamp when the post was inserted in the database | 2017-04-21 07:05:23 |
source2_type | varchar(45) | like newspaper, radio, webnews, TV, ... | newspaper |
The rss definition table controls which rss posts are pulled from the web and were to get the desired fields from the posts. To add a source (e.g. Newspaper) just add a line in the definition table.
Column Name | Comment | Sample Data |
---|---|---|
idrssdefinition | primary key | 1 |
version | in case the specification of the dafinition changes i count the version of the definition | 1 |
status | use this datasource or not | 1 |
source1_name_get | From where do i read the source name? | 0 |
source1_name_value | If source1_name_get is 0, get it from here, else, read it from a location specified in the script (1 = defined rss field) | Bild |
source2_name_get | Like source1_name_get | 0 |
source2_name_value | Like source1_name_value | Bild |
source1_type_value | Like source1_name_get | newspaper |
source1_type_get | Like source1_name_value | 0 |
source1_country_get | Like source1_name_get | 0 |
source1_country_value | Like source1_name_value | de |
rssurl_get | Like source1_name_get | 0 |
rssurl_value | Like source1_name_value | http://www.bild.de/rssfeeds/vw-alles/vw-alles-26970192,sort=1,view=rss2.bild.xml |
rssid_get | Like source1_name_get | 1 |
rssid_value | Like source1_name_value | id |
rss_title_get | Like source1_name_get | 1 |
rss_title_value | Like source1_name_value | title |
rss_text_get | Like source1_name_get | 1 |
rss_text_value | Like source1_name_value | summary |
rss_published_get | Like source1_name_get | 1 |
rss_published_value | Like source1_name_value | published_parsed |
source2_type_value | Like source1_name_get | newspaper |
source2_type_get | Like source1_name_value | 0 |
Status of Datacollector Backend Datacollector Web