Jump to content

User:Dsprc/CIA

From Wikipedia, the free encyclopedia


201

[edit]

97 pages of difficult to read PDF scans --- 00[01-97].pdf --- The documents span the 000X to 00XX space (where X indicates a random number); can plug in a random number and get a doc.

First Document: 0001.pdf is first document of set.

  • random docs in between to illustrate numbering:
* http://www.foia.cia.gov/sites/default/files/document_conversions/1705143/DTPILLAR%20%20%20VOL.%201_0009.pdf
* http://www.foia.cia.gov/sites/default/files/document_conversions/1705143/DTPILLAR%20%20%20VOL.%201_0010.pdf
* http://www.foia.cia.gov/sites/default/files/document_conversions/1705143/DTPILLAR%20%20%20VOL.%201_0011.pdf
* http://www.foia.cia.gov/sites/default/files/document_conversions/1705143/DTPILLAR%20%20%20VOL.%201_0031.pdf
* http://www.foia.cia.gov/sites/default/files/document_conversions/1705143/DTPILLAR%20%20%20VOL.%201_0036.pdf
* http://www.foia.cia.gov/sites/default/files/document_conversions/1705143/DTPILLAR%20%20%20VOL.%201_0072.pdf
* http://www.foia.cia.gov/sites/default/files/document_conversions/1705143/DTPILLAR%20%20%20VOL.%201_0084.pdf
* http://www.foia.cia.gov/sites/default/files/document_conversions/1705143/DTPILLAR%20%20%20VOL.%201_0085.pdf
  • end random docs

Last Document: This is the last document located in the above set using the 00[0X-XX].pdf numbering scheme (0098 and 0099 or 0100 do not exist):

Bulk download

[edit]

To download all files from set, one can easily do so with this BASH oneliner that utilizes 'for loops' and 'brace expansion' facilities of the shell (and GNU wget!):

$ for X in {0001..0097}; do wget -c "http://www.foia.cia.gov/sites/default/files/document_conversions/1705143/DTPILLAR%20%20%20VOL.%201_$X.pdf"; done

Which pulls all docs in range specified within curly brackets (brace expansion); files will be named like this: "DTPILLAR VOL. 1_0001.pdf" (except with 3 spaces after "DTPILLAR" instead of 1 as displayed by wikimarkup). Annoying spaces in filenames can lazily be handled with utilities such as detox:

$ detox * (that simple)

Which converts into "DTPILLAR_VOL._1_0001.pdf"

Or can include more complex regular expressions in oneliner if one desires.

202

[edit]

Possible same

This set of documents was also discovered to have a similar numbering scheme so possi-likely is same exact 00[01-XX].pdf as previous:

0xDEADBEEF

[edit]

Random documents with Radio Free Asia mentioned but lacking any discernible numbering pattern or scheme: