Bsc Marcel Michels
BSc defense Marcel Michels
Metadata
- Date: 10 September 2018 (Monday)
- Time: 2:00 pm
- Room: B 016
- Presenter: Marcel Michels
- 1st reviewer: Ralf Lämmel
- 2nd reviewer: Marcel Heinz
Title
Information Extraction from Wikipedia’s Software Language Infoboxes
Abstract
Wikipedia is one of the world's biggest knowledge bases. Besides the articles' plain text there are infoboxes which are a way to structurally summarize the article's content. This thesis aims at extracting those infoboxes and normalize the given data. The outcome of the data processing is supposed to build a base so that articles can be compared to each other. To achieve this goal the structure and contents of those infoboxes will be analyzed to identify trouble makers within the wiki markup and it will be explained how these problems are solved in the implementation.
page revision: 0, last edited: 03 Sep 2018 13:32