Wikipedia:Link rot/URL change requests/Archives/2022/September
This is an archive of past discussions about Wikipedia:Link rot. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current main page. |
www.co.summit.oh.us
http://www.co.summit.oh.us/ now goes to https://www.eyemg.com/, the website of some web design firm. The correct address appears to be https://co.summitoh.net/. The link is used on Summit County, Ohio as well as a number of related articles. RTao (talk) 04:56, 30 August 2022 (UTC)
- @RTao: this is used on 16 pages. Please do it manually, not a bot job. Due to the work required. -- GreenC 06:00, 30 August 2022 (UTC)
- @GreenC: Ah, I wasn't aware. I'll do that. Thanks for letting me know. RTao (talk) 06:10, 30 August 2022 (UTC)
- Thanks. Plus you'll be able to do a better job with manual review. -- GreenC 14:58, 30 August 2022 (UTC)
- @GreenC: Ah, I wasn't aware. I'll do that. Thanks for letting me know. RTao (talk) 06:10, 30 August 2022 (UTC)
FABLE
Not a direct request, but thought people here would be interested in User:FABLEBot/New URLs for permanently dead external links * Pppery * it has begun... 20:53, 31 August 2022 (UTC)
lexico.com, oxforddictionaries.com
Lexico.com, formerly at oxforddictionaries.com (and askoxford.com before that), has been redirected to Dictionary.com, which does not provide the content that was available on Lexico, so citations need to be replaced with archives. Not sure what to do with {{Cite Lexico}}, but I assume subst'ing the transclusions and adding archive links is the way to go. Nardog (talk) 02:16, 26 August 2022 (UTC)
- @Nardog: For the template see edit Special:Diff/1055780831/1106756789 .. it's a hack solution since it doesn't deal with missing archives or ability to control timestamps, but better than nothing. Ideally every instance would be converted to a
{{cite dictionary}}
a standardized format that regular tools can maintain without custom coding. For the rest, I think all three should be processed as dead. If no archive exists add a{{dead link}}
. -- GreenC 05:52, 30 August 2022 (UTC)- @GreenC: So... can you help? Nardog (talk) 05:45, 1 September 2022 (UTC)
- Special:Diff/1098865079/1107817768 - Not done yet. -- GreenC 06:01, 1 September 2022 (UTC)
- As for {{cite Lexico}}, I think it should be expanded to a CS1 template ({{cite web}} or {{cite dictionary}}) with the URL that would have been used as of the date provided in
|access-date=
or, if absent, when the template was inserted. Nardog (talk) 05:49, 1 September 2022 (UTC)
- @GreenC: So... can you help? Nardog (talk) 05:45, 1 September 2022 (UTC)
Done
- Convert 1,036
{{Cite Oxford Dictionaries}}
to{{Cite dictionary}}
. Example - Add archives or mark dead three domains. Added aprx 3,018 new archives. Example
- Update changes in the IABot database. Example
@Nardog: If see anything else let me know. , thanks. -- GreenC 17:30, 2 September 2022 (UTC)
- Thank you!
- The entry for purple patch was in fact archived at [1], with an underscore instead of the plus sign (which IIRC simply redirected to the canonical URL). I assume most (all?) phrases are affected by this.
- Were previous URL schemes (
https://www.oxforddictionaries.com/definition/english/...
,https://www.oxforddictionaries.com/definition/american_english/...
,https://en.oxforddictionaries.com/definition/...
,https://en.oxforddictionaries.com/definition/us/...
) not used for old transclusions (or transclusions with old access dates)? I believe they should be, at least in|url=
, because otherwise a citation would say the source was retrieved before it existed. - developer.oxforddictionaries.com and premium.oxforddictionaries.com are still live, so (though these specific subdomains are rarely cited) you might want to exclude them or, perhaps preferably, stop assuming all subdomains are dead.
- Nardog (talk) 21:55, 2 September 2022 (UTC)
- -- GreenC 02:53, 3 September 2022 (UTC)
- Actually the one in Pendekar is a soft-404. -- GreenC 16:23, 3 September 2022 (UTC)
- Oh, so you simply ignored
|access(-)date=
? That seems... inadvisable. Nardog (talk) 11:31, 4 September 2022 (UTC)- No I didn't ignore it. However there is no guarantee the Wayback retrieved one close to it. The problem is I don't now what your talking about "transclusion", I honestly can not follow what your saying above at all. -- GreenC 15:57, 4 September 2022 (UTC)
- Here's an example from Zymogen. It's a two-step process first it converts
{{OxfordDictionaries.com|access-date=2016-01-24|zymogen}}
to{{Cite dictionary |url=https://web.archive.org/web/20160124000000/http://www.lexico.com/definition/zymogen |title=zymogen |dictionary=[[Lexico|Oxford Dictionaries]] UK English Dictionary |publisher=[[Oxford University Press]]}}
.. the snapshot is the access-date with 6 trailing 0's. If you try that URL, the Wayback redirects to https://web.archive.org/web/20200322182724/https://www.lexico.com/definition/zymogen .. that's the final URL. This is typical, most of them ended up at 2020 and 2021. -- GreenC 16:14, 4 September 2022 (UTC)- An access date and an archive date are two separate things. The former is when the information was verified, the latter is when the archive was made. The resulting
{{Cite dictionary}}
should retain|access-date=2016-01-24
and say the dictionary is "Oxford Dictionaries", not "Lexico" (which didn't exist in 2016), much like the version before the bot edited it, and|url=
should be set tohttps://www.oxforddictionaries.com/definition/english/...
. Nardog (talk) 02:10, 5 September 2022 (UTC)- Alright, three issues:
|access-date=
: I didn't transfer because I don't think it's important once the link is dead and archived. Some might disagree but I think it's a tradeoff with clutter and ease of reading comprehension.|url=
: The{{OxfordDictionaries.com}}
was producing lexico.com and that's what the conversion did 1:1.|dictionary=Lexico
. I wrote code to use|dictionary=Oxford Dictionaries
if the access-date is older than 2019-06-11 mirroring how the template worked, as you see in the above intermediary step. There was another bit of code, after the archive URL was finally settled on, that looked at the snapshot date of the archive and if it was later than 2019-06-11 the name was changed to Lexico. As it should be since both the url the archive URL is Lexico, not Oxford.
- In theory could restore the old template as it previously existed - according to logs there are 432 with a pre-2019-06-11 access-date - then re-run the conversion with the new assumptions programmed in ie. use the correct
|access-date=
,|url=
and|dictionary=
. I'll need to understand when to use the two variations of oxforddictionaries.com you listed above (ie. www vs. en) . And which one's to do this for: only those with an old access-date, or anything that uses a template name of "Oxford Dictionaries":{{OxfordDictionaries.com}}
,{{Oxford Dictionaries}}
,{{Cite Oxford Dictionaries}}
, or anything with old access-date and the Oxford name.. there are a number of permutations and assumptions here. -- GreenC 04:19, 5 September 2022 (UTC)
- Alright, three issues:
- An access date and an archive date are two separate things. The former is when the information was verified, the latter is when the archive was made. The resulting
- Here's an example from Zymogen. It's a two-step process first it converts
- No I didn't ignore it. However there is no guarantee the Wayback retrieved one close to it. The problem is I don't now what your talking about "transclusion", I honestly can not follow what your saying above at all. -- GreenC 15:57, 4 September 2022 (UTC)
www.iihf.com/competition
The "www.iihf.com/competition" URL is dead. Many of these references can no longer be recovered. However, there are two exceptions:
- www.iihf.com/competition/385/ (←space or end of URL)
- www.iihf.com/competition/385/statistics.html
These can be recovered under a new name:
- stats.iihf.com/Hydra/385/ (without statistics.html)
Put any number from 1 to 999 there. I hope this can be done. Thanks, Maiō T. (talk) 17:15, 18 June 2022 (UTC)
For URL www.iihf.com/competition/609/ it can convert to either one:
I think webarchive is best first choice since it has a header and looks like the intended page. It has to be verified though as sometimes it works for one both or neither. -- GreenC 17:55, 16 July 2022 (UTC)
- Done Processed 209 pages, migrated 365 URLs per above, added 19 archive URLs. Example. -- GreenC 19:36, 16 July 2022 (UTC)
@GreenC: Thank you, I almost forgot about this request. I only remembered it today when I needed to write a new one. Sorry about that. I checked here regularly for the first month to see if you had replied. Were there any problems with it? Maiō T. (talk) 15:46, 11 September 2022 (UTC)
- Sometimes it takes me a while to get to the request, as it takes time to focus on it. No problems that I recall. -- GreenC 16:17, 11 September 2022 (UTC)
Replacing link to a broken archive
Hello
I noticed that this link to 2012 Guyana census downloads a broken archive:
It should be repalced by this one:
(notice the capital B in "Population_By_Village")
I think it is present on most Guyana settlements pages (and maybe some other articles about Guyana), but I wasn't able to make a list.
Regards, Şÿℵדαχ₮ɘɼɾ๏ʁ 16:30, 14 September 2022 (UTC)
- User:SyntaxTerror, this is done it edited 137 pages. There was one page Guyana that had an archive URL that was deleted. -- GreenC 20:13, 14 September 2022 (UTC)
asp → aspx
I would need to fix URLs with .asp extension. References in several articles no longer display these pages. For example,
https://www.eurobasket.com/United-Kingdom/basketball-National-Team.asp?Age=16 is wrong, and
https://www.eurobasket.com/United-Kingdom/basketball-National-Team.aspx?Age=16 is correct. Interestingly, both versions (asp & aspx) work on non-European sites. Thank you for your efforts. Maiō T. (talk) 15:45, 11 September 2022 (UTC)
- Hi Maiō T. Can you confirm this is just for, but all of, eurobasket.com that contain a .asp ? Looks like around 6,700 pages. -- GreenC 16:27, 11 September 2022 (UTC)
- Maiō T. , something has changed because both asp and aspx return content now. The content is different and I don't know which is preferred. -- GreenC 20:11, 14 September 2022 (UTC)
- @GreenC: The "...National-Team.asp?Age=16" pages don't exist so the program redirects them to the main page (with adult men's national team). The "...National-Team.aspx?Age=16" pages are correct; they deal with the under-16 national team. So the "aspx" version is preferred. Maiō T. (talk) 12:29, 16 September 2022 (UTC)
- OK it is done. -- GreenC 01:17, 17 September 2022 (UTC)
- @GreenC: Thank you, good job! Maiō T. (talk) 12:32, 17 September 2022 (UTC)
- OK it is done. -- GreenC 01:17, 17 September 2022 (UTC)
- @GreenC: The "...National-Team.asp?Age=16" pages don't exist so the program redirects them to the main page (with adult men's national team). The "...National-Team.aspx?Age=16" pages are correct; they deal with the under-16 national team. So the "aspx" version is preferred. Maiō T. (talk) 12:29, 16 September 2022 (UTC)
Emporis.com links
As of last week, it seems that Emporis has been shut down, and all of its links have gone dead. Every single link to the website now leads to an error page that looks like this. I'm not sure how many articles are affected by Emporis's shutdown, but I believe this issue affects thousands of links. – Epicgenius (talk) 13:57, 20 September 2022 (UTC)
- @Epicgenius, Jklamo, and BD2412: I finished most of them last night per initial request at User_talk:BrownHairedGirl#Emporis.com_has_gone,_but_is_preserved. This will be the new "official thread". There's another 20% I need to custom program for, the Wayback Machine has them but they are kind of hidden from API view. In addition there is a request at Wikipedia_talk:WikiProject_Skyscrapers#Emporis_end to convert the
{{Emporis}}
template which I'll be working on. -- GreenC 15:12, 20 September 2022 (UTC)- Great work - barnstars and medals all around! BD2412 T 16:48, 20 September 2022 (UTC)
- Great work, @GreenC. And there's that old 80:20 rule poking its annoying head in again to make more work for you. BrownHairedGirl (talk) • (contribs) 17:50, 20 September 2022 (UTC)
- Right! hah whoever invented that rule much prefer 95/5 with the 5 being so hard you can safely skip it. -- GreenC 20:57, 20 September 2022 (UTC)
Report
- Converted 1,204
{{Emporis}}
to{{Cite web}}
: Example - Added archive-url to 7,097 citations: Example
- Wayback CDX trawling that saved 251 citations: Example
Anything else, let me know. -- GreenC 19:48, 22 September 2022 (UTC)
Kalki links
The domain is kalkionline.com. All links are dead, so anything tagged as url-status=live
may be changed to dead. Kailash29792 (talk) 04:59, 24 September 2022 (UTC)
- User:Kailash29792, thanks for the reminder! I just ran the first 20 articles. In Special:Contributions/GreenC_bot from Thudikkum Karangal to Magalir Mattum (1994 film). Can you take a look and provide any feedback before proceeding further? I see a bunch don't have archive.today available. I could add the
{{dead link}}
now, and go back later to add the archive if or when it becomes available. -- GreenC 03:35, 25 September 2022 (UTC)- Yeah, and I regret not having archived them before. Thankfully, the Internet Archive has a bunch of them. You may continue tagging the dead links while I manually replace the links. Kailash29792 (talk) 04:38, 25 September 2022 (UTC)
- I can provide a list of the dead links, if that helps. -- GreenC 14:42, 25 September 2022 (UTC)
- Sure. Kailash29792 (talk) 15:12, 25 September 2022 (UTC)
- OK. Also there are about a dozen edits like this with commented out URLs with a wayback link with a 1899 date. It's an artifact how the bot works internally, normally I'd fix those by hand, but I think you created those comments so I am not going to worry about it, if that's alright, all I would do is remove the wayback portion. of the link. -- GreenC 17:35, 25 September 2022 (UTC)
- Wikipedia:Link rot/cases/kalkionline.com - list of 307 dead links -- GreenC 17:57, 25 September 2022 (UTC)
- @Kailash29792: It's done. Let me know if/when you want to run it again, based on archive availability -- GreenC 18:32, 25 September 2022 (UTC)
- Sure. Kailash29792 (talk) 15:12, 25 September 2022 (UTC)
- I can provide a list of the dead links, if that helps. -- GreenC 14:42, 25 September 2022 (UTC)
- Yeah, and I regret not having archived them before. Thankfully, the Internet Archive has a bunch of them. You may continue tagging the dead links while I manually replace the links. Kailash29792 (talk) 04:38, 25 September 2022 (UTC)