mysql - Wikipedia: dump article id's and it's category -
i make mysql database every wikipedia article id , it's category id (most general category). saw wikipedia gives entire dump, , few others links between categories. saw there mediawiki can't manage find right query send.
but nonetheless can't find how dump big file article id's , it's category id. how should it? how data should expect?
wikipedia provides dumps of of data. 1 want categorylinks.sql
, contains list of category names (categories don't have ids) each article id. want page.sql
, contains map article id title.
to work dumps, can import them local mysql database, or use library parses dumps directly, the 1 wrote .net.
but each article in several categories , there no notion of primary category or that. so, if want 1 category each article, have figure out how yourself.
Comments
Post a Comment