Wikipedia:Bots/Requests for approval/Bibcode Bot
- The following discussion is an archived debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA. The result of the discussion was Approved.
Coders:
- Δ (talk · contribs) coded the first chunk of this bot, and abandoned the project.
- Snottywong (talk · contribs) took over and will do the rest of the "major" coding.
- Headbomb (talk · contribs) [aka me] took care of the journal detection / JJJJJ assignment stuff (see below) and will keep making tweaks as needed.
Operator:
Time filed: 05:13, Friday April 15, 2011 (UTC)
Automatic or Manual: Automatic
Programming language(s): Python / Pywikipedia
Source code available: When a stable version is around.
Function overview: Finds bibcodes and dois for {{citation}} and {{cite journal}}. See also the BOTREQ.
Links to relevant discussions (where appropriate):WP Astronomy, WP Astronomical Objects, WP Physics, WP Space, WP Solar System, and the Relativity taskforce have been noticed. Response is either unanimous support, or indifference, no one gave feedback, most likely because this is uncontroversial and similar to what User:Citation bot already does, or maybe because the BOTREQ was rather technical. Either way, no one bothered to object. Headbomb {talk / contribs / physics / books} 05:13, 15 April 2011 (UTC)[reply]
Edit period(s): One big run after every database dumps, with possibly a few smaller runs following logic tweaks. If it doesn't have the time to finish between dumps, then it'll essentially be continuous.
Estimated number of pages affected: The theoretical upper limit is all pages with a {{citation}} and {{cite journal}}. A more realistic scenario is every astronomy- and physics-related page on Wikipedia (~50K), plus a few odd pages which cites a astronomy/physics journal for some weird reason (~5? ~10K? ~25K?) and most {{cite doi}} templates built by User:Citation bot. The number of affected pages should go down drastically following the first few runs, since there is a ~10 year backlog of stuff to deal with at the moment. The bot would at first go through human-compiled lists mostly focus on articles related to astronomy & physics articles. Once its astronomy & physics workload is reduced drastically, it could possibly move on to all pages with {{citation}} and {{cite journal}}.
Exclusion compliant (Y/N): Not implemented yet, but should be compliant with it.
Already has a bot flag (Y/N): Nope.
Function details:
A bibcode is in the YYYYJJJJJVVVVMPPPPA format, such as Bibcode:1982PhRvL..49.1804A, where YYYY is the year, JJJJJ is a journal code, VVVV is the volume, M is a special character, PPPP are pages, and A is the first letter of the first author's last name. The bot tries to...
- Look for
|doi=foo
in {{citation}}/{{cite journal}} and tries to find the associated bibcode, if it exists, by querying the ADSABS database. If it find one, it adds|bibcode=bar
. - Look for
|bibcode=foo
in {{citation}}/{{cite journal}} and tries to find the associated doi, if it exists, by querying the ADSABS database. If it find one, it adds|doi=bar
. - If a search for the bibcode comes up empty-handed, it tries to build a bibcode from
|year=
/|date=
,|journal=
/|work=
,|volume=
,|page=
/|pages=
,|last=
/|last1=
- If the bibcode is valid, it adds
|bibcode=
. If the bibcode has an associated doi, it also adds|doi=
. - If not, it makes some additional guesses (varies A from A to Z, if that fails, it varies M from A-Z, then from a-z).
- If it finds a valid bibcode, it adds
|bibcode=
. If the bibcode has an associated doi, it also adds|doi=
.
- If it finds a valid bibcode, it adds
- If the bibcode is valid, it adds
The bot does not touch anything other than |doi=
and |bibcode=
. If the scope expands beyond this, there will be another BRFA.
Headbomb {talk / contribs / physics / books} 05:13, 15 April 2011 (UTC)[reply]
Discussion
[edit]Could you post an example of before and after the bibcode addition and what that means for citation/user? Is there a point of reference that bibcode presence in the citation is beneficial in all cases and wanted by the community to be implemented? — HELLKNOWZ ▎TALK 14:26, 15 April 2011 (UTC)[reply]
- See sample diff. And I know of no cases where a bibcode is harmful or unwanted, as for the "wanted by the community", it is (see links to relevant discussions above). It's basically similar to User:Citation bot, only with a limited scope. Headbomb {talk / contribs / physics / books} 14:37, 15 April 2011 (UTC)[reply]
- Those had no responses. Anyway, Approved for trial (30 edits). Please provide a link to the relevant contributions and/or diffs when the trial is complete. — HELLKNOWZ ▎TALK 14:45, 15 April 2011 (UTC)[reply]
Trial complete. Everything went flawlessly, save some timeout errors which make the bot crash from time to time [doesn't affect Wikipedia, just annoying on this side of things]. All bibcodes and DOIs were the correct ones. Logic is continuously being refined to cover more and more journals, but that's something that can only be taken care of over time. Headbomb {talk / contribs / physics / books} 03:07, 19 April 2011 (UTC)[reply]
- Update
- I've done a recoding of the whole bot, to make it more efficient with queries, and to also make it cover
|arxiv=
(which is also present in the ADSABS database, so might as well retrieve it too [this is uncontroversial]) as well as|bibcode=
and|doi=
. I could be making a BRFA 2 for the arxiv thing, but, since I've recoded the whole thing and it's not yet approved, and it's a very small expansion of the scope of this bot, could I get another 30 trial edits?. Headbomb {talk / contribs / physics / books} 19:37, 21 April 2011 (UTC)[reply]
Approved for extended trial (30 edits). Please provide a link to the relevant contributions and/or diffs when the trial is complete. Still no responses on talk pages. — HELLKNOWZ ▎TALK 19:43, 21 April 2011 (UTC)[reply]
Also, would the operator(s) please acknowledge points of Wikipedia:Bot policy#Bots operated by multiple users. — HELLKNOWZ ▎TALK 19:48, 21 April 2011 (UTC)[reply]
- Only I operate the bot. If that changes, BAG will be notified. Headbomb {talk / contribs / physics / books} 19:49, 21 April 2011 (UTC)[reply]
- O.K. — HELLKNOWZ ▎TALK 19:50, 21 April 2011 (UTC)[reply]
- FYI - I have access to portions of the code, but have no intention of running the code, and I don't have access to the User:Bibcode Bot account. —SW— soliloquize 20:51, 21 April 2011 (UTC)[reply]
- O.K. — HELLKNOWZ ▎TALK 19:50, 21 April 2011 (UTC)[reply]
Trial complete. everything was a smooth ride. Bot is ready for action and flagging. Headbomb {talk / contribs / physics / books} 00:43, 22 April 2011 (UTC)[reply]
- Approved. MBisanz talk 04:14, 23 April 2011 (UTC)[reply]
- The above discussion is preserved as an archive of the debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA.