Wikipedia:Bots/Requests for approval/DaxServerBot I
- The following discussion is an archived debate. Please do not modify it. To request review of this BRFA, please start a new section at Wikipedia:Bots/Noticeboard. The result of the discussion was Withdrawn by operator.
Operator: DaxServer (talk · contribs · SUL · edit count · logs · page moves · block log · rights log · ANI search)
Time filed: 20:14, Thursday, April 8, 2021 (UTC)
Automatic, Supervised, or Manual: supervised
Programming language(s): Python (pywikibot, mwparserfromhell)
Source code available: https://github.com/SrihariThalla/pywikibot-scripts/tree/main/wiki_citations_archival
Function overview: Update citations with links to domain "articles.timesofindia.indiatimes.com" to "timesofindia.indiatimes.com" as the contents on the former domain were moved to the latter with no redirects and URL path change, thus there is no way of guessing using any kind of algorithms
Links to relevant discussions (where appropriate): [[1]]
Edit period(s): Until all possible links are updated
Estimated number of pages affected: 500+
Exclusion compliant (Yes/No): Yes
Already has a bot flag (Yes/No): No
Function details: The bot needs inputs from the operator and cannot work as it involves a Google search.
The bot would accept the article page, retrieves content and scans for citations (Cite news and Cite web) to the former domain. The operator now has to do a Google search with the title of the link. Most of the time the article is found on the new domain. The operator now has to input the new URL. The bot would check if there is an archived version on the Wayback Machine and update it as well, along with the url-status.
New URL - [3]
Discussion
[edit]Disclosure: The source code was originally written to add archival-urls to citations (news and web) primarily to articles related to WP:INDIA. The script will be modified to reflect the task proposed here. Earlier, I was unaware of the bot proposal system, and has used the script (contributions from script). The edit summary says "Add archival urls to citations" or "Add archived urls to citations". The script was tested on my sandbox first and only then used to update articles on main namespace. I have since then stopped editing using it. I apologize for the unapproved use, if the Bot policy was violated. -- DaxServer (talk) 20:24, 8 April 2021 (UTC)[reply]
- GreenC, is this bot necessary or can your bot handle the transition? Primefac (talk) 13:09, 9 April 2021 (UTC)[reply]
- Based on the conversation at URLREQ since there is no way to safely automate determination of the new URL, will run WaybackMedic to archive all the links and any it can not (
{{dead link}}
) make a list for manual or semi-automed process (how not yet determined). The links exist in about 10,000 articles and total of around 20,000 links. -- GreenC 14:04, 12 April 2021 (UTC)[reply]- The @GreenC bot has finished adding the
{{dead link}}
to these links. I withdraw this request. -- DaxServer (talk) 11:47, 17 April 2021 (UTC)[reply]
- The @GreenC bot has finished adding the
- Based on the conversation at URLREQ since there is no way to safely automate determination of the new URL, will run WaybackMedic to archive all the links and any it can not (
- The above discussion is preserved as an archive of the debate. Please do not modify it. To request review of this BRFA, please start a new section at Wikipedia:Bots/Noticeboard.