User talk:FactoProphyl
where can I find assistance with or advice about wikipedia page queries
[edit]This help request has been answered. If you need more help, you can , contact the responding user(s) directly on their user talk page, or consider visiting the Teahouse. |
I'm trying to obtain a list of English-born people born after 1950 who have Wikipedia pages Ideally I'd like to include those who were schooled in England specifically
Ideally I'd like to have the result as a flat file of some sort which could link to their Wikipedia pages so as a start I could correlate birth place with birth location, school and so on
I have been looking at https://en.wikipedia.org/wiki/Special:Export https://petscan.wmflabs.org/ https://en.wikipedia.org/wiki/Wikipedia:Request_a_query and https://query.wikidata.org/ and so on.. My results have been pitiful, I don't know SQL, I don't know how many results to expect and I need the search to be verifiable as it's for study. I downloaded a dump as well... I know the wiki pages and what I would like from them, such as infobox and text data. That's about it Can I ask someone if this is possible, how I might go about it and if so where should I ask?
FactoProphyl (talk) 11:36, 17 June 2018 (UTC)
- Hey, see if this can help you. If you need more help turn on the {{help me}} template. ‐‐1997kB (talk) 12:18, 17 June 2018 (UTC)
FactoProphyl (talk) 15:57, 17 June 2018 (UTC)
thank you 1997kB @1997kB: I had. I forgot to mention it. I had issues with it probably because I don't know what I'm doing. But also, it searches Wikidata. Unless I'm mistaken this is a different dataset than Wikimedia and its various dumps. The results differ using that and quarry.
Secondly, it seems to me that not every wikipedia page contains the marker/field_descriptor/template tag that would facilitate this type of search because, for example, biographical pages exist where the person's birth-date is mentioned in the body text without being otherwise 'tagged' or that information being in the infobox. At times no infobox exists at all.
So, as far as I could tell all searches constructed this way look for the required marker/field_descriptor/template tag to return a result [of that field]
To me this meant that I was missing an awful lot of people. Is such an assumption wrong?
Perhaps it would be better to reformulate this as - Having stumbled through these interrogation pages I came to the conclusion that the search I would need to construct to get the information I would like out of wikipedia pages means I would have to construct something that searches text triggers such as "was born in", "went to school at", "england"
That sounds pretty stupid so where am I going wrong? In short why can't I get a list of people who were born [or raised, ideally] in England aged 70 or less, dead or alive..... FactoProphyl (talk) 15:57, 17 June 2018 (UTC)
- I'm not sure Wikipedia is constructed in such a way as to make this quest of yours simple. No automated query that it based on looking at the wording of articles is going to be easy, there is just too much diversity in how articles are written.
- What is relatively easy is to extract articles that are members of a set of categories. This depends on articles being categorized in the way you want. For instance, while there are categories including people educated at Oxford, by college, there is no current category that gathers Oxford, Cambridge, along with other English universities and colleges. so you'd need to list all of the birth years of interest, all of the educational institutions of interest and find pages that are included in one category from the first set and in at least one category of the second set. But only the more extensive articles are likely to identify where a person went to school, and not all of them will have already had a suitable category attached.
- So I don't know how to offer much help at getting the data set you are asking for, given the current state of things. — jmcgnh(talk) (contribs) 17:00, 17 June 2018 (UTC)
Blocked for socking
[edit]I have indefinitely blocked you for persistently and abusively editing while logged out in violation of WP:SOCK. See WP:GAB for your appeal rights.--Bbb23 (talk) 17:01, 4 December 2019 (UTC)
- Just in case any admin considers withdrawing TPA, the user's access to WP:UTRS has been withdrawn. As this is a CU block it cannot be reviewed by a patrolling UTRS admin and the user needs to go to ArbCom as confirmed by Beeblebrox.
- FactoProphyl, I did send you an email about this and my advice remains the same. I'm not one of the panel of editors for them so I cannot speak for them, however, if you email them, I would strongly recommend that you address your socking. You didn't address that when you contacted UTRS. If you're not familiar with ArbCom, I'd suggest that you read about them before making contact.-- 5 albert square (talk) 23:43, 16 January 2020 (UTC)