Jump to content

User:Fabrickator/christianpost pages display as blank

From Wikipedia, the free encyclopedia

This page 'christianpost pages display as blank' originally was a section of User:Fabrickator/sandbox

Here's an example of such an url:

https://www.christianpost.com/article/20070810/28848_Bibles,_Crucifixes_Not_Allowed_into_Saudi_Arabia

The thing to look for is that the last component of the local path begins with a 5-digit number.

To find a working url, do a search based on the text following that number, e.g. Bibles,_Crucifixes_Not_Allowed_into_Saudi_Arabia.

Candidate pages are shown below. Part of this effort includes identifying alternate archived urls, specifically showing earlier renditions of an article. These are usually to be found on "archive.today" a.k.a. "archive.is" and "archive.li". Since the same "stories" may be referenced from different Wikipedia articles, the preliminary part of the effort will be to enumerate the affected "stories" in each Wikipedia article.

Here is a description of problems encountered with links for christianpost.com, largely associated with the fact that the format of article links changed at some point in time. In the older links, the final component of the link consists of a numeric article id followed by the article title; in the newer links, the final component of the link consists of the article titled followed by the numeric article id. In some cases, the final component contains only the numeric article id. At some point, the older format links were evidently converted to the newer format, with a redirect going to the new link format. However, the redirect of older links was removed at some point, and older links, rather than returning a 404 error, return an empty page. At other times, these links redirected to the Christian Post home page. So in some cases, wayback has archived the Christian Post home page, causing archive.today to use that as an archive copy. Additionally, the robots.txt file for christianpost.com has been updated to block IABot, causing wayback to refuse to return pages that may have previously been archived.

With that background, here goes:

  1. old format links return blank pages, and IABot fails to detect these links as broken, since they don't get a 404 error
  2. pages previously archived by wayback are no longer accessible
  3. in some cases, pages archived by archive.today are actually an archive of the christianpost.com home page
  4. search on title sometimes fails to find live link on christianpost.com

When search turns up a "directory page" on an "archive.today" site, it is necessary to check google's cached page to find the actual link.

Note there are some links to subsdomains of christianost.com, to which the wayback resetrictions may not necessarily apply

  • blogs.christianpost.com
  • breathecast.christianpost.com
  • chinese.christianpost.com
  • crossmap.christianpost.com
  • dev.christianpost.com
  • espanol.christianpost.com
  • global.christianpost.com
  • gnli.christianpost.com
  • ipost.christianpost.com
  • m.christianpost.com
  • portugues.christianpost.com
  • portuguese.christianpost.com
  • sg.christianpost.com
  • world.christianpost.com

These are pages with obsolete (old-style) links that need updating:

Here are relevant links to earlier renditions of the "stories" (these are mostly archived versions of articles posted at christianpost.com, but christianitytoday.com and gospelherald.com should also be considered as potential sources since they frequently post the same stories. This is a list of links to the various stories, organized by the story. These are mostly for stories from christianpost.com, including archived copies, but christianitytoday.com and gospelherald.com should also be considered as potential sources since they frequently post the same stories as as posted at christianpost.com.

(ordered by date of story)