User:Chlod/Tools/JTWC Archiver
Original author(s) | Chlod Alejandro |
---|---|
Developer(s) | Contributors to the JTWC Archiver |
Initial release | September 28, 2020 |
Stable release | 1.1.0
/ October 6, 2020 |
Repository | https://github.com/ChlodAlejandro/jtwc-archiver |
Written in | JavaScript |
Engine | Node.js |
Type | Web scraper |
License |
|
Website | https://wiki.chlod.net/jtwc |
The JTWC Archiver is a Node.js script that parses information from the Joint Typhoon Warning Center's RSS feed and archives whichever version is available. This way, warnings issued by the JTWC which are not immediately added to an article may still be used. The script is run once every 10 minutes and archives each new product.
Since all of these files are pretty much analogous to the actual JTWC warnings (since they are archives straight from the source), you can use them as a basis for citation.
Usage
[edit]You can browse the Tropical Cyclone Formation Alert and Tropical Cyclone Warning texts and images at https://wiki.chlod.net/jtwc.
To cite a JTWC warning, use the following:
- Tropical Storm 15W (Kujira) Warning No. 10 (Report). United States Joint Typhoon Warning Center. 29 September 2020. Archived from the original on 29 September 2020. Retrieved 29 September 2020.
{{Cite web|last=|first=|date={{subst:date}}|title=Tropical Storm 15W (Kujira) Warning No. 10|url=https://www.metoc.navy.mil/jtwc/products/wp1520web.txt|url-status=dead|archive-url=https://wiki.chlod.net/jtwc/text/2020-09-29-0200-wp1520web.txt|archive-date={{subst:date}}|access-date={{subst:date}}|publisher=United States Joint Typhoon Warning Center}}
You may choose to drop the publisher
entirely, like I do.
- "Tropical Storm 15W (Kujira) Warning No. 10". United States Joint Typhoon Warning Center. 29 September 2020. Archived from the original on 29 September 2020. Retrieved 29 September 2020.
{{Cite web|last=|first=|date={{subst:date}}|title=Tropical Storm 15W (Kujira) Warning No. 10|url=https://www.metoc.navy.mil/jtwc/products/wp1520web.txt|url-status=dead|archive-url=https://wiki.chlod.net/jtwc/text/2020-09-29-0200-wp1520web.txt|archive-date={{subst:date}}|access-date={{subst:date}}|publisher=United States Joint Typhoon Warning Center}}
At the end of each year, most of the links are replaced with more resilient backups. Information for that can be found below.
Output
[edit]In this tree, the JTWC Archiver is run with the hypothetical wp4220
system, and was archived on September 28, 2020 at 00:00 UTC.
- Working directory
-
jtwc.rss
— The latest copy of the JTWC RSS. This is used to check if there were any updates to the JTWC bulletin. -
jtwc_products
— The folder containing all archived JTWC products.-
gif
— The folder containing graphics for TCFAs and TCWs. -
jmv
— The folder containing JMV 3.0 data. -
prog
— The folder containing tropical cyclone prognostic reasonings. -
text
— The folder containing the TCFA and TCW warnings.-
2020-09-28-0000-abioweb.txt
— The advisories for the ABIO sector (the Indian Ocean). Since this text file is always provided when a system in that sector is active, it will be archived as well. -
2020-09-28-0000-abpwweb.txt
— The advisories for the ABPW sector (the Pacific Ocean). Since this text file is always provided when a system in that sector is active, it will be archived as well. -
2020-09-28-0000-wp4220web.txt
— The TCFA/TCW text forwp4220
exactly at the moment of archiving. -
latest-abioweb.txt
— The latest ABIO advisory. This file is overwritten when a new advisory is issued. -
latest-abpwweb.txt
— The latest ABPW advisory. This file is overwritten when a new advisory is issued. -
latest-wp4220.txt
— The latest TCFA/TCW text forwp4220
. This file is overwritten when a new warning is issued.
-
-
-
url-status
[edit]The url-status
parameter in {{Cite web}} should always be either dead
(which emphasizes the archived version over the original) or unfit
(which hides the original entirely). This is because the links that lead to a specific warning are time-sensitive and will change, and that the archived version is preferred over the live version (which may already be a new system entirely.)
Periodic archiving
[edit]At the end of each year, all collected products will be uploaded to the Internet Archive for permanent storage. This is primarily due to three reasons:
- The Internet Archive is a more generally-known (and thus, generally-trusted) website, which helps alleviate some concerns regarding SELFPUB or privacy-related concerns (even though the archiver website is just an Apache directory browser).
- Though the total file sizes for a year of operation are not significant, there might be a time where I am forced to clear out space from the server. If that happens, those files would then be permanently
Not Found
, and may not be accessible anymore. - The JTWC Archiver, despite being an archive, was initially designed to be a temporary gathering place for bulletins while parts of an article have not yet been written. Since bulletins are changed rapidly, while articles may not be, there are instances where an old version is no longer available due to it being overwritten. Though I can theoretically hold all bulletins up until, say 2030, I'd like to form a sense of assurance that the bulletins would be stored "forever" by a capable and established archiving service.
Though those bulletins will be archived, it does not mean that they will be removed from the website immediately. They're put on the Internet Archive for that exact reason: archiving. It's meant to be a backup in case things go south on my end.
A list of archives is provided below.
- 2020 (AT • NI • PC • WP) – Archived on January 12, 2021 – TXT • GIF • PROG
- Note: The archive project started on September 28, 2020, and thus, this year only has products beginning September 28, 2020.
- 2021 (AT • NI • PC • WP) – Archived on January 12, 2021 – TXT • GIF • PROG • JMV
- Note: JMV 3.0 data archiving started on May 16, 2021, and thus, this year has JMV 3.0 data only for products beginning May 16, 2021.
- 2022 (AT • NI • PC • WP) – Archived on March 5, 2023 – TXT • GIF • PROG • JMV
- 2023 (AT • NI • PC • WP) – Archived on September 2, 2024 – TXT • GIF • PROG • JMV