View on Internet Archive

Filename Size
academicearth.org-shallow-20181002-103008-d8f95-00000.warc.gz 1870298 download   job
academicearth.org-shallow-20181002-103008-d8f95-00000.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-103008-d8f95-meta.warc.gz 9093 download   job
academicearth.org-shallow-20181002-103008-d8f95-meta.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-103008-d8f95.json 288 download   job
academicearth.org-shallow-20181002-103039-7l8y3-00000.warc.gz 4006 download   job
academicearth.org-shallow-20181002-103039-7l8y3-00000.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-103039-7l8y3-meta.warc.gz 3455 download   job
academicearth.org-shallow-20181002-103039-7l8y3-meta.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-103039-7l8y3.json 289 download   job
academicearth.org-shallow-20181002-103054-5rtg4-00000.warc.gz 1982770 download   job
academicearth.org-shallow-20181002-103054-5rtg4-00000.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-103054-5rtg4-meta.warc.gz 8170 download   job
academicearth.org-shallow-20181002-103054-5rtg4-meta.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-103054-5rtg4.json 252 download   job
academicearth.org-shallow-20181002-103208-by1jj-00000.warc.gz 10055 download   job
academicearth.org-shallow-20181002-103208-by1jj-00000.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-103208-by1jj-meta.warc.gz 3504 download   job
academicearth.org-shallow-20181002-103208-by1jj-meta.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-103208-by1jj.json 277 download   job
academicearth.org-shallow-20181002-103338-ahgs0-00000.warc.gz 10071 download   job
academicearth.org-shallow-20181002-103338-ahgs0-00000.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-103338-ahgs0-meta.warc.gz 3513 download   job
academicearth.org-shallow-20181002-103338-ahgs0-meta.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-103338-ahgs0.json 288 download   job
academicearth.org-shallow-20181002-103453-30sdv-00000.warc.gz 1850179 download   job
academicearth.org-shallow-20181002-103453-30sdv-00000.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-103453-30sdv-meta.warc.gz 9095 download   job
academicearth.org-shallow-20181002-103453-30sdv-meta.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-103453-30sdv.json 285 download   job
academicearth.org-shallow-20181002-103550-bh3xe-00000.warc.gz 4023 download   job
academicearth.org-shallow-20181002-103550-bh3xe-00000.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-103550-bh3xe-meta.warc.gz 3454 download   job
academicearth.org-shallow-20181002-103550-bh3xe-meta.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-103550-bh3xe.json 293 download   job
academicearth.org-shallow-20181002-103939-6eyze-00000.warc.gz 1241402 download   job
academicearth.org-shallow-20181002-103939-6eyze-00000.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-103939-6eyze-meta.warc.gz 7741 download   job
academicearth.org-shallow-20181002-103939-6eyze-meta.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-103939-6eyze.json 256 download   job
academicearth.org-shallow-20181002-104011-2q2jf-00000.warc.gz 512269 download   job
academicearth.org-shallow-20181002-104011-2q2jf-00000.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-104011-2q2jf-meta.warc.gz 5693 download   job
academicearth.org-shallow-20181002-104011-2q2jf-meta.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-104011-2q2jf.json 289 download   job
academicearth.org-shallow-20181002-104137-6jbip-00000.warc.gz 2174564 download   job
academicearth.org-shallow-20181002-104137-6jbip-00000.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-104137-6jbip-meta.warc.gz 9011 download   job
academicearth.org-shallow-20181002-104137-6jbip-meta.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-104137-6jbip.json 282 download   job
academicearth.org-shallow-20181002-112946-vvqbj-00000.warc.gz 438736 download   job
academicearth.org-shallow-20181002-112946-vvqbj-00000.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-112946-vvqbj-meta.warc.gz 5936 download   job
academicearth.org-shallow-20181002-112946-vvqbj-meta.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-112946-vvqbj.json 275 download   job
academicearth.org-shallow-20181002-113140-5k7bc-00000.warc.gz 1800991 download   job
academicearth.org-shallow-20181002-113140-5k7bc-00000.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-113140-5k7bc-meta.warc.gz 9224 download   job
academicearth.org-shallow-20181002-113140-5k7bc-meta.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-113140-5k7bc.json 287 download   job
academicearth.org-shallow-20181002-113706-vrydp-00000.warc.gz 440266 download   job
academicearth.org-shallow-20181002-113706-vrydp-00000.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-113706-vrydp-meta.warc.gz 5905 download   job
academicearth.org-shallow-20181002-113706-vrydp-meta.warc.os.cdx.gz 0 download
academicearth.org-shallow-20181002-113706-vrydp.json 263 download   job
answers.google.com-shallow-20181002-105313-ed6wb-00000.warc.gz 24528 download   job
answers.google.com-shallow-20181002-105313-ed6wb-00000.warc.os.cdx.gz 0 download
answers.google.com-shallow-20181002-105313-ed6wb-meta.warc.gz 3722 download   job
answers.google.com-shallow-20181002-105313-ed6wb-meta.warc.os.cdx.gz 0 download
answers.google.com-shallow-20181002-105313-ed6wb.json 280 download   job
answers.google.com-shallow-20181002-105438-eqdgg-00000.warc.gz 52445 download   job
answers.google.com-shallow-20181002-105438-eqdgg-00000.warc.os.cdx.gz 0 download
answers.google.com-shallow-20181002-105438-eqdgg-meta.warc.gz 3814 download   job
answers.google.com-shallow-20181002-105438-eqdgg-meta.warc.os.cdx.gz 0 download
answers.google.com-shallow-20181002-105438-eqdgg.json 280 download   job
answers.google.com-shallow-20181002-105544-2d3fq-00000.warc.gz 27309 download   job
answers.google.com-shallow-20181002-105544-2d3fq-00000.warc.os.cdx.gz 0 download
answers.google.com-shallow-20181002-105544-2d3fq-meta.warc.gz 3805 download   job
answers.google.com-shallow-20181002-105544-2d3fq-meta.warc.os.cdx.gz 0 download
answers.google.com-shallow-20181002-105544-2d3fq.json 280 download   job
answers.google.com-shallow-20181002-105746-cplgi-00000.warc.gz 23706 download   job
answers.google.com-shallow-20181002-105746-cplgi-00000.warc.os.cdx.gz 0 download
answers.google.com-shallow-20181002-105746-cplgi-meta.warc.gz 3799 download   job
answers.google.com-shallow-20181002-105746-cplgi-meta.warc.os.cdx.gz 0 download
answers.google.com-shallow-20181002-105746-cplgi.json 280 download   job
answers.google.com-shallow-20181002-105831-74s1j-00000.warc.gz 32482 download   job
answers.google.com-shallow-20181002-105831-74s1j-00000.warc.os.cdx.gz 0 download
answers.google.com-shallow-20181002-105831-74s1j-meta.warc.gz 3807 download   job
answers.google.com-shallow-20181002-105831-74s1j-meta.warc.os.cdx.gz 0 download
answers.google.com-shallow-20181002-105831-74s1j.json 285 download   job
answers.google.com-shallow-20181002-115342-eql6u-00000.warc.gz 30302 download   job
answers.google.com-shallow-20181002-115342-eql6u-00000.warc.os.cdx.gz 0 download
answers.google.com-shallow-20181002-115342-eql6u-meta.warc.gz 3804 download   job
answers.google.com-shallow-20181002-115342-eql6u-meta.warc.os.cdx.gz 0 download
answers.google.com-shallow-20181002-115342-eql6u.json 280 download   job
answers.google.com-shallow-20181002-115427-5e4jd-00000.warc.gz 26707 download   job
answers.google.com-shallow-20181002-115427-5e4jd-00000.warc.os.cdx.gz 0 download
answers.google.com-shallow-20181002-115427-5e4jd-meta.warc.gz 3765 download   job
answers.google.com-shallow-20181002-115427-5e4jd-meta.warc.os.cdx.gz 0 download
answers.google.com-shallow-20181002-115427-5e4jd.json 280 download   job
answers.google.com-shallow-20181002-115725-3wq8v-00000.warc.gz 24341 download   job
answers.google.com-shallow-20181002-115725-3wq8v-00000.warc.os.cdx.gz 0 download
answers.google.com-shallow-20181002-115725-3wq8v-meta.warc.gz 3806 download   job
answers.google.com-shallow-20181002-115725-3wq8v-meta.warc.os.cdx.gz 0 download
answers.google.com-shallow-20181002-115725-3wq8v.json 285 download   job
archiveteam_archivebot_go_20181002120002.cdx.gz 86817715 download
archiveteam_archivebot_go_20181002120002.cdx.idx 90438 download
archiveteam_archivebot_go_20181002120002_archive.torrent 46663 download
archiveteam_archivebot_go_20181002120002_files.xml 0 download
archiveteam_archivebot_go_20181002120002_meta.sqlite 433152 download
archiveteam_archivebot_go_20181002120002_meta.xml 758 download
cdn02.dayviews.com-shallow-20181002-113231-ecqsu-00000.warc.gz 7110 download   job
cdn02.dayviews.com-shallow-20181002-113231-ecqsu-00000.warc.os.cdx.gz 0 download
cdn02.dayviews.com-shallow-20181002-113231-ecqsu.json 397 download   job
edjimy.tian.yam.com-inf-20180930-121100-b0h9h-00014.warc.gz 5369342732 download   job
edjimy.tian.yam.com-inf-20180930-121100-b0h9h-00014.warc.os.cdx.gz 0 download
edjimy.tian.yam.com-inf-20180930-121100-b0h9h-00015.warc.gz 5369832824 download   job
edjimy.tian.yam.com-inf-20180930-121100-b0h9h-00015.warc.os.cdx.gz 0 download
edjimy.tian.yam.com-inf-20180930-121100-b0h9h-00016.warc.gz 5375393343 download   job
edjimy.tian.yam.com-inf-20180930-121100-b0h9h-00016.warc.os.cdx.gz 0 download
en.wikipedia.org-shallow-20181002-111101-d85v5-00000.warc.gz 352715 download   job
en.wikipedia.org-shallow-20181002-111101-d85v5-00000.warc.os.cdx.gz 0 download
faculty.mdanderson.org-shallow-20181002-110418-1lkex-00000.warc.gz 14925352 download   job
faculty.mdanderson.org-shallow-20181002-110418-1lkex-00000.warc.os.cdx.gz 0 download
faculty.mdanderson.org-shallow-20181002-110418-1lkex-meta.warc.gz 13572 download   job
faculty.mdanderson.org-shallow-20181002-110418-1lkex-meta.warc.os.cdx.gz 0 download
faculty.mdanderson.org-shallow-20181002-110418-1lkex.json 284 download   job
greensing.ning.com-shallow-20181002-102251-c70iy-00000.warc.gz 14225 download   job
greensing.ning.com-shallow-20181002-102251-c70iy-00000.warc.os.cdx.gz 0 download
greensing.ning.com-shallow-20181002-102251-c70iy-meta.warc.gz 3447 download   job
greensing.ning.com-shallow-20181002-102251-c70iy-meta.warc.os.cdx.gz 0 download
greensing.ning.com-shallow-20181002-102251-c70iy.json 252 download   job
iuoma-network.ning.com-shallow-20181002-102521-1b96z-00000.warc.gz 8000546 download   job
iuoma-network.ning.com-shallow-20181002-102521-1b96z-00000.warc.os.cdx.gz 0 download
iuoma-network.ning.com-shallow-20181002-102521-1b96z-meta.warc.gz 22410 download   job
iuoma-network.ning.com-shallow-20181002-102521-1b96z-meta.warc.os.cdx.gz 0 download
iuoma-network.ning.com-shallow-20181002-102521-1b96z.json 256 download   job
legacy.zam.com-inf-20180914-182710-10rv3-00051.warc.gz 5373828987 download   job
legacy.zam.com-inf-20180914-182710-10rv3-00051.warc.os.cdx.gz 0 download
legislature.camera.it-shallow-20181002-104657-c3dbm-00000.warc.gz 46115 download   job
legislature.camera.it-shallow-20181002-104657-c3dbm-00000.warc.os.cdx.gz 0 download
legislature.camera.it-shallow-20181002-104657-c3dbm-meta.warc.gz 4312 download   job
legislature.camera.it-shallow-20181002-104657-c3dbm-meta.warc.os.cdx.gz 0 download
legislature.camera.it-shallow-20181002-104657-c3dbm.json 394 download   job
letyourhairdownin.ning.com-shallow-20181002-102411-98g9h-00000.warc.gz 344610 download   job
letyourhairdownin.ning.com-shallow-20181002-102411-98g9h-00000.warc.os.cdx.gz 0 download
letyourhairdownin.ning.com-shallow-20181002-102411-98g9h-meta.warc.gz 5302 download   job
letyourhairdownin.ning.com-shallow-20181002-102411-98g9h-meta.warc.os.cdx.gz 0 download
letyourhairdownin.ning.com-shallow-20181002-102411-98g9h.json 261 download   job
letyourhairdownin.ning.com-shallow-20181002-102420-df0s0-00000.warc.gz 926386 download   job
letyourhairdownin.ning.com-shallow-20181002-102420-df0s0-00000.warc.os.cdx.gz 0 download
letyourhairdownin.ning.com-shallow-20181002-102420-df0s0-meta.warc.gz 10906 download   job
letyourhairdownin.ning.com-shallow-20181002-102420-df0s0-meta.warc.os.cdx.gz 0 download
letyourhairdownin.ning.com-shallow-20181002-102420-df0s0.json 260 download   job
news.sky.com-shallow-20181002-102842-4qint-00000.warc.gz 835323 download   job
news.sky.com-shallow-20181002-102842-4qint-00000.warc.os.cdx.gz 0 download
news.sky.com-shallow-20181002-102842-4qint-meta.warc.gz 7229 download   job
news.sky.com-shallow-20181002-102842-4qint-meta.warc.os.cdx.gz 0 download
news.sky.com-shallow-20181002-102842-4qint.json 327 download   job
orpheus.network-inf-20181002-113330-7mms3-00000.warc.gz 48178 download   job
orpheus.network-inf-20181002-113330-7mms3-00000.warc.os.cdx.gz 0 download
orpheus.network-inf-20181002-113330-7mms3-meta.warc.gz 3795 download   job
orpheus.network-inf-20181002-113330-7mms3-meta.warc.os.cdx.gz 0 download
pastebin.com-shallow-20181002-090441-6f22x-00000.warc.gz 421683 download   job
pastebin.com-shallow-20181002-090441-6f22x-00000.warc.os.cdx.gz 0 download
pastebin.com-shallow-20181002-090441-6f22x-meta.warc.gz 6049 download   job
pastebin.com-shallow-20181002-090441-6f22x-meta.warc.os.cdx.gz 0 download
pastebin.com-shallow-20181002-090441-6f22x.json 255 download   job
pastebin.com-shallow-20181002-090513-83sb8-00000.warc.gz 5159 download   job
pastebin.com-shallow-20181002-090513-83sb8-00000.warc.os.cdx.gz 0 download
pastebin.com-shallow-20181002-090513-83sb8-meta.warc.gz 3412 download   job
pastebin.com-shallow-20181002-090513-83sb8-meta.warc.os.cdx.gz 0 download
pastebin.com-shallow-20181002-090513-83sb8.json 259 download   job
pastebin.com-shallow-20181002-091010-dajc6-00000.warc.gz 419559 download   job
pastebin.com-shallow-20181002-091010-dajc6-00000.warc.os.cdx.gz 0 download
pastebin.com-shallow-20181002-091010-dajc6-meta.warc.gz 6087 download   job
pastebin.com-shallow-20181002-091010-dajc6-meta.warc.os.cdx.gz 0 download
pastebin.com-shallow-20181002-091010-dajc6.json 255 download   job
pastebin.com-shallow-20181002-091215-dadnd-00000.warc.gz 4015 download   job
pastebin.com-shallow-20181002-091215-dadnd-00000.warc.os.cdx.gz 0 download
pastebin.com-shallow-20181002-091215-dadnd-meta.warc.gz 3413 download   job
pastebin.com-shallow-20181002-091215-dadnd-meta.warc.os.cdx.gz 0 download
pastebin.com-shallow-20181002-091215-dadnd.json 258 download   job
runescape.wikia.com-inf-20180929-232416-780ml-00006.warc.gz 5368949679 download   job
runescape.wikia.com-inf-20180929-232416-780ml-00006.warc.os.cdx.gz 0 download
telehack.com-inf-20181002-112907-96m46-00000.warc.gz 29015421 download   job
telehack.com-inf-20181002-112907-96m46-00000.warc.os.cdx.gz 0 download
telehack.com-inf-20181002-112907-96m46-meta.warc.gz 32796 download   job
telehack.com-inf-20181002-112907-96m46-meta.warc.os.cdx.gz 0 download
telehack.com-inf-20181002-112907-96m46.json 241 download   job
twitter.com-shallow-20181002-111241-1k88m-00000.warc.gz 2571194 download   job
twitter.com-shallow-20181002-111241-1k88m-00000.warc.os.cdx.gz 0 download
upload.wikimedia.org-shallow-20181002-091606-6zqb4-00000.warc.gz 7912 download   job
upload.wikimedia.org-shallow-20181002-091606-6zqb4-00000.warc.os.cdx.gz 0 download
upload.wikimedia.org-shallow-20181002-091606-6zqb4-meta.warc.gz 3528 download   job
upload.wikimedia.org-shallow-20181002-091606-6zqb4-meta.warc.os.cdx.gz 0 download
upload.wikimedia.org-shallow-20181002-091606-6zqb4.json 292 download   job
upload.wikimedia.org-shallow-20181002-091607-8f41z-00000.warc.gz 2681111 download   job
upload.wikimedia.org-shallow-20181002-091607-8f41z-00000.warc.os.cdx.gz 0 download
upload.wikimedia.org-shallow-20181002-091607-8f41z-meta.warc.gz 3558 download   job
upload.wikimedia.org-shallow-20181002-091607-8f41z-meta.warc.os.cdx.gz 0 download
upload.wikimedia.org-shallow-20181002-091607-8f41z.json 301 download   job
urls-pastebin.com-QQgd3Mnw-inf-20181001-062849-j5tuw-00016.warc.gz 2151342495 download   job
urls-pastebin.com-QQgd3Mnw-inf-20181001-062849-j5tuw-00016.warc.os.cdx.gz 0 download
urls-pastebin.com-QQgd3Mnw-inf-20181001-062849-j5tuw-00017.warc.gz 2147745650 download   job
urls-pastebin.com-QQgd3Mnw-inf-20181001-062849-j5tuw-00017.warc.os.cdx.gz 0 download
urls-pastebin.com-QQgd3Mnw-inf-20181001-062849-j5tuw-00018.warc.gz 2147928582 download   job
urls-pastebin.com-QQgd3Mnw-inf-20181001-062849-j5tuw-00018.warc.os.cdx.gz 0 download
urls-pastebin.com-QQgd3Mnw-inf-20181001-062849-j5tuw-00019.warc.gz 2147707070 download   job
urls-pastebin.com-QQgd3Mnw-inf-20181001-062849-j5tuw-00019.warc.os.cdx.gz 0 download
urls-pastebin.com-QQgd3Mnw-inf-20181001-062849-j5tuw-00020.warc.gz 2147515371 download   job
urls-pastebin.com-QQgd3Mnw-inf-20181001-062849-j5tuw-00020.warc.os.cdx.gz 0 download
urls-pastebin.com-WexC0XMB-inf-20181001-051840-5pljq-00001.warc.gz 5374532873 download   job
urls-pastebin.com-WexC0XMB-inf-20181001-051840-5pljq-00001.warc.os.cdx.gz 0 download
urls-pastebin.com-WexC0XMB-inf-20181001-051840-5pljq-00002.warc.gz 389305015 download   job
urls-pastebin.com-WexC0XMB-inf-20181001-051840-5pljq-00002.warc.os.cdx.gz 0 download
urls-pastebin.com-WexC0XMB-inf-20181001-051840-5pljq-meta.warc.gz 6361397 download   job
urls-pastebin.com-WexC0XMB-inf-20181001-051840-5pljq-meta.warc.os.cdx.gz 0 download
urls-pastebin.com-WexC0XMB-inf-20181001-051840-5pljq.json 282 download   job
urls-pastebin.com-YXuifWtQ-shallow-20181002-090710-djvhg-00000.warc.gz 4497088 download   job
urls-pastebin.com-YXuifWtQ-shallow-20181002-090710-djvhg-00000.warc.os.cdx.gz 0 download
urls-pastebin.com-YXuifWtQ-shallow-20181002-090710-djvhg-meta.warc.gz 12913 download   job
urls-pastebin.com-YXuifWtQ-shallow-20181002-090710-djvhg-meta.warc.os.cdx.gz 0 download
urls-pastebin.com-YXuifWtQ-shallow-20181002-090710-djvhg-urls.txt 410 download
urls-pastebin.com-YXuifWtQ-shallow-20181002-090710-djvhg.json 290 download   job
urls-pastebin.com-i5eURG1R-inf-20181001-112326-2coac-00006.warc.gz 2155779681 download   job
urls-pastebin.com-i5eURG1R-inf-20181001-112326-2coac-00006.warc.os.cdx.gz 0 download
urls-pastebin.com-i5eURG1R-inf-20181001-112326-2coac-00007.warc.gz 2147788265 download   job
urls-pastebin.com-i5eURG1R-inf-20181001-112326-2coac-00007.warc.os.cdx.gz 0 download
urls-pastebin.com-i5eURG1R-inf-20181001-112326-2coac-00008.warc.gz 2156832755 download   job
urls-pastebin.com-i5eURG1R-inf-20181001-112326-2coac-00008.warc.os.cdx.gz 0 download
urls-transfer.sh--EFF-tweets-shallow-20181002-073933-8eo01-00000.warc.gz 1342910363 download   job
urls-transfer.sh--EFF-tweets-shallow-20181002-073933-8eo01-00000.warc.os.cdx.gz 5921552 download
urls-transfer.sh--EFF-tweets-shallow-20181002-073933-8eo01-meta.warc.gz 3167274 download   job
urls-transfer.sh--EFF-tweets-shallow-20181002-073933-8eo01-meta.warc.os.cdx.gz 0 download
urls-transfer.sh--EFF-tweets-shallow-20181002-073933-8eo01-urls.txt 897904 download
urls-transfer.sh--EFF-tweets-shallow-20181002-073933-8eo01.json 297 download   job
urls-transfer.sh--akiman7-tweets-shallow-20181002-081516-es1k4-00000.warc.gz 536881917 download   job
urls-transfer.sh--akiman7-tweets-shallow-20181002-081516-es1k4-00000.warc.os.cdx.gz 622252 download
urls-transfer.sh--akiman7-tweets-shallow-20181002-081516-es1k4-00001.warc.gz 536914113 download   job
urls-transfer.sh--akiman7-tweets-shallow-20181002-081516-es1k4-00001.warc.os.cdx.gz 0 download
urls-transfer.sh--akiman7-tweets-shallow-20181002-081516-es1k4-00002.warc.gz 536908053 download   job
urls-transfer.sh--akiman7-tweets-shallow-20181002-081516-es1k4-00002.warc.os.cdx.gz 0 download
urls-transfer.sh--akiman7-tweets-shallow-20181002-081516-es1k4-00003.warc.gz 536896843 download   job
urls-transfer.sh--akiman7-tweets-shallow-20181002-081516-es1k4-00003.warc.os.cdx.gz 0 download
urls-transfer.sh--gekikawa_wa-tweets-shallow-20181002-081501-4ab6s.json 313 download   job
urls-transfer.sh--primeraair-tweets-shallow-20181002-095002-umxwd-00000.warc.gz 1535156 download   job
urls-transfer.sh--primeraair-tweets-shallow-20181002-095002-umxwd-00000.warc.os.cdx.gz 0 download
urls-transfer.sh--primeraair-tweets-shallow-20181002-095002-umxwd-meta.warc.gz 8292 download   job
urls-transfer.sh--primeraair-tweets-shallow-20181002-095002-umxwd-meta.warc.os.cdx.gz 0 download
urls-transfer.sh--primeraair-tweets-shallow-20181002-095002-umxwd-urls.txt 464 download
urls-transfer.sh--primeraair-tweets-shallow-20181002-095002-umxwd.json 311 download   job
urls-transfer.sh-AirPrimeraVA-tweets-shallow-20181002-100832-ppdsy-00000.warc.gz 1033973 download   job
urls-transfer.sh-AirPrimeraVA-tweets-shallow-20181002-100832-ppdsy-00000.warc.os.cdx.gz 0 download
urls-transfer.sh-AirPrimeraVA-tweets-shallow-20181002-100832-ppdsy-meta.warc.gz 6306 download   job
urls-transfer.sh-AirPrimeraVA-tweets-shallow-20181002-100832-ppdsy-meta.warc.os.cdx.gz 0 download
urls-transfer.sh-AirPrimeraVA-tweets-shallow-20181002-100832-ppdsy-urls.txt 177 download
urls-transfer.sh-AirPrimeraVA-tweets-shallow-20181002-100832-ppdsy.json 316 download   job
urls-transfer.sh-DontFlyPrimera-tweets-shallow-20181002-100607-cfo46-00000.warc.gz 1031559 download   job
urls-transfer.sh-DontFlyPrimera-tweets-shallow-20181002-100607-cfo46-00000.warc.os.cdx.gz 0 download
urls-transfer.sh-DontFlyPrimera-tweets-shallow-20181002-100607-cfo46-meta.warc.gz 6384 download   job
urls-transfer.sh-DontFlyPrimera-tweets-shallow-20181002-100607-cfo46-meta.warc.os.cdx.gz 0 download
urls-transfer.sh-DontFlyPrimera-tweets-shallow-20181002-100607-cfo46-urls.txt 124 download
urls-transfer.sh-DontFlyPrimera-tweets-shallow-20181002-100607-cfo46.json 318 download   job
urls-transfer.sh-FILE-shallow-20181002-090211-arsjd-00000.warc.gz 2488 download   job
urls-transfer.sh-FILE-shallow-20181002-090211-arsjd-00000.warc.os.cdx.gz 0 download
urls-transfer.sh-FILE-shallow-20181002-090211-arsjd-meta.warc.gz 3320 download   job
urls-transfer.sh-FILE-shallow-20181002-090211-arsjd-meta.warc.os.cdx.gz 0 download
urls-transfer.sh-FILE-shallow-20181002-090211-arsjd-urls.txt 0 download
urls-transfer.sh-FILE-shallow-20181002-090211-arsjd.json 272 download   job
urls-transfer.sh-Primera_unfAIR-tweets-shallow-20181002-100643-bbkl7-00000.warc.gz 7265711 download   job
urls-transfer.sh-Primera_unfAIR-tweets-shallow-20181002-100643-bbkl7-00000.warc.os.cdx.gz 0 download
urls-transfer.sh-Primera_unfAIR-tweets-shallow-20181002-100643-bbkl7-meta.warc.gz 11593 download   job
urls-transfer.sh-Primera_unfAIR-tweets-shallow-20181002-100643-bbkl7-meta.warc.os.cdx.gz 0 download
urls-transfer.sh-Primera_unfAIR-tweets-shallow-20181002-100643-bbkl7-urls.txt 6014 download
urls-transfer.sh-Primera_unfAIR-tweets-shallow-20181002-100643-bbkl7.json 318 download   job
urls-transfer.sh-Primeraarescum-tweets-shallow-20181002-090610-1oh0i-00000.warc.gz 10273543 download   job
urls-transfer.sh-Primeraarescum-tweets-shallow-20181002-090610-1oh0i-00000.warc.os.cdx.gz 0 download
urls-transfer.sh-Primeraarescum-tweets-shallow-20181002-090610-1oh0i-meta.warc.gz 15767 download   job
urls-transfer.sh-Primeraarescum-tweets-shallow-20181002-090610-1oh0i-meta.warc.os.cdx.gz 0 download
urls-transfer.sh-Primeraarescum-tweets-shallow-20181002-090610-1oh0i-urls.txt 8556 download
urls-transfer.sh-Primeraarescum-tweets-shallow-20181002-090610-1oh0i.json 318 download   job
urls-transfer.sh-air_primera-tweets-shallow-20181002-100503-9qacn-00000.warc.gz 4771367 download   job
urls-transfer.sh-air_primera-tweets-shallow-20181002-100503-9qacn-00000.warc.os.cdx.gz 0 download
urls-transfer.sh-air_primera-tweets-shallow-20181002-100503-9qacn-meta.warc.gz 11129 download   job
urls-transfer.sh-air_primera-tweets-shallow-20181002-100503-9qacn-meta.warc.os.cdx.gz 0 download
urls-transfer.sh-air_primera-tweets-shallow-20181002-100503-9qacn-urls.txt 3127 download
urls-transfer.sh-air_primera-tweets-shallow-20181002-100503-9qacn.json 312 download   job
urls-transfer.sh-custome84712881-tweets-shallow-20181002-090519-84j3i-00000.warc.gz 2517 download   job
urls-transfer.sh-custome84712881-tweets-shallow-20181002-090519-84j3i-00000.warc.os.cdx.gz 0 download
urls-transfer.sh-custome84712881-tweets-shallow-20181002-090519-84j3i-meta.warc.gz 3363 download   job
urls-transfer.sh-custome84712881-tweets-shallow-20181002-090519-84j3i-meta.warc.os.cdx.gz 0 download
urls-transfer.sh-custome84712881-tweets-shallow-20181002-090519-84j3i-urls.txt 0 download
urls-transfer.sh-custome84712881-tweets-shallow-20181002-090519-84j3i.json 320 download   job
urls-transfer.sh-donotflyprimera-tweets-shallow-20181002-090544-dn3q6-00000.warc.gz 21716860 download   job
urls-transfer.sh-donotflyprimera-tweets-shallow-20181002-090544-dn3q6-00000.warc.os.cdx.gz 0 download
urls-transfer.sh-donotflyprimera-tweets-shallow-20181002-090544-dn3q6-meta.warc.gz 27949 download   job
urls-transfer.sh-donotflyprimera-tweets-shallow-20181002-090544-dn3q6-meta.warc.os.cdx.gz 0 download
urls-transfer.sh-donotflyprimera-tweets-shallow-20181002-090544-dn3q6-urls.txt 17325 download
urls-transfer.sh-donotflyprimera-tweets-shallow-20181002-090544-dn3q6.json 320 download   job
urls-transfer.sh-mozilla-addons-url-list.txt-shallow-20180930-021207-akifc-00025.warc.gz 2147486404 download   job
urls-transfer.sh-mozilla-addons-url-list.txt-shallow-20180930-021207-akifc-00025.warc.os.cdx.gz 0 download
urls-transfer.sh-primeraair-tweets-shallow-20181002-085940-o6zr3-00000.warc.gz 42697044 download   job
urls-transfer.sh-primeraair-tweets-shallow-20181002-085940-o6zr3-00000.warc.os.cdx.gz 103011 download
urls-transfer.sh-primeraair-tweets-shallow-20181002-085940-o6zr3-meta.warc.gz 59519 download   job
urls-transfer.sh-primeraair-tweets-shallow-20181002-085940-o6zr3-meta.warc.os.cdx.gz 0 download
urls-transfer.sh-primeraair-tweets-shallow-20181002-085940-o6zr3-urls.txt 34727 download
urls-transfer.sh-primeraair-tweets-shallow-20181002-085940-o6zr3.json 310 download   job
uwaterloo.ca-shallow-20181002-111714-3u2fs-meta.warc.gz 10895 download   job
uwaterloo.ca-shallow-20181002-111714-3u2fs-meta.warc.os.cdx.gz 0 download
voat.co-inf-20180912-032959-8n20d-00065.warc.gz 5370858155 download   job
voat.co-inf-20180912-032959-8n20d-00065.warc.os.cdx.gz 0 download
worldufospace.ning.com-shallow-20181002-102319-7u39z-00000.warc.gz 14274 download   job
worldufospace.ning.com-shallow-20181002-102319-7u39z-00000.warc.os.cdx.gz 0 download
worldufospace.ning.com-shallow-20181002-102319-7u39z-meta.warc.gz 3466 download   job
worldufospace.ning.com-shallow-20181002-102319-7u39z-meta.warc.os.cdx.gz 0 download
worldufospace.ning.com-shallow-20181002-102319-7u39z.json 256 download   job
www.680news.com-shallow-20181002-092454-b2s7r-00000.warc.gz 1995746 download   job
www.680news.com-shallow-20181002-092454-b2s7r-00000.warc.os.cdx.gz 0 download
www.680news.com-shallow-20181002-092454-b2s7r-meta.warc.gz 12070 download   job
www.680news.com-shallow-20181002-092454-b2s7r-meta.warc.os.cdx.gz 0 download
www.680news.com-shallow-20181002-092454-b2s7r.json 310 download   job
www.bay12forums.com-inf-20180923-091105-ab2ji-00036.warc.gz 2306158191 download   job
www.bay12forums.com-inf-20180923-091105-ab2ji-00036.warc.os.cdx.gz 0 download
www.bbc.com-shallow-20181002-092808-1h9ed-00000.warc.gz 7915907 download   job
www.bbc.com-shallow-20181002-092808-1h9ed-00000.warc.os.cdx.gz 0 download
www.bbc.com-shallow-20181002-092808-1h9ed-meta.warc.gz 14350 download   job
www.bbc.com-shallow-20181002-092808-1h9ed-meta.warc.os.cdx.gz 0 download
www.bbc.com-shallow-20181002-092808-1h9ed.json 268 download   job
www.betaarchive.com-shallow-20181002-101012-220dh-00000.warc.gz 50551 download   job
www.betaarchive.com-shallow-20181002-101012-220dh-00000.warc.os.cdx.gz 0 download
www.betaarchive.com-shallow-20181002-101012-220dh-meta.warc.gz 3960 download   job
www.betaarchive.com-shallow-20181002-101012-220dh-meta.warc.os.cdx.gz 0 download
www.betaarchive.com-shallow-20181002-101012-220dh.json 536 download   job
www.businessinsider.com.au-shallow-20181002-102507-903dl-00000.warc.gz 153844811 download   job
www.businessinsider.com.au-shallow-20181002-102507-903dl-00000.warc.os.cdx.gz 0 download
www.businessinsider.com.au-shallow-20181002-102507-903dl-meta.warc.gz 22205 download   job
www.businessinsider.com.au-shallow-20181002-102507-903dl-meta.warc.os.cdx.gz 0 download
www.businessinsider.com.au-shallow-20181002-102507-903dl.json 327 download   job
www.cnbc.com-shallow-20181002-102109-cen9t-00000.warc.gz 10486278 download   job
www.cnbc.com-shallow-20181002-102109-cen9t-00000.warc.os.cdx.gz 0 download
www.cnbc.com-shallow-20181002-102109-cen9t-meta.warc.gz 21920 download   job
www.cnbc.com-shallow-20181002-102109-cen9t-meta.warc.os.cdx.gz 0 download
www.cnbc.com-shallow-20181002-102109-cen9t.json 307 download   job
www.freeroms.com-inf-20180816-234655-4ln71-00217.warc.gz 5384683782 download   job
www.freeroms.com-inf-20180816-234655-4ln71-00217.warc.os.cdx.gz 13989 download
www.freeroms.com-inf-20180816-234655-4ln71-00218.warc.gz 5394392150 download   job
www.freeroms.com-inf-20180816-234655-4ln71-00218.warc.os.cdx.gz 22275 download
www.groklaw.net-shallow-20181002-101430-a8m0w-00000.warc.gz 107796 download   job
www.groklaw.net-shallow-20181002-101430-a8m0w-00000.warc.os.cdx.gz 857 download
www.groklaw.net-shallow-20181002-101430-a8m0w-meta.warc.gz 3879 download   job
www.groklaw.net-shallow-20181002-101430-a8m0w-meta.warc.os.cdx.gz 0 download
www.groklaw.net-shallow-20181002-101430-a8m0w.json 284 download   job
www.hhmi.org-shallow-20181002-110510-5rxr9-meta.warc.gz 10694 download   job
www.hhmi.org-shallow-20181002-110510-5rxr9-meta.warc.os.cdx.gz 0 download
www.independent.co.uk-shallow-20181002-092354-b3z44-00000.warc.gz 36671164 download   job
www.independent.co.uk-shallow-20181002-092354-b3z44-00000.warc.os.cdx.gz 22376 download
www.independent.co.uk-shallow-20181002-092354-b3z44-meta.warc.gz 16857 download   job
www.independent.co.uk-shallow-20181002-092354-b3z44-meta.warc.os.cdx.gz 47 download
www.independent.co.uk-shallow-20181002-092354-b3z44.json 376 download   job
www.lds.org-inf-20180925-030149-5t6yn-00092.warc.gz 5382886294 download   job
www.lds.org-inf-20180925-030149-5t6yn-00092.warc.os.cdx.gz 1727 download
www.lds.org-inf-20180925-030149-5t6yn-00093.warc.gz 8123706679 download   job
www.lds.org-inf-20180925-030149-5t6yn-00093.warc.os.cdx.gz 1173 download
www.lds.org-inf-20180925-205550-e9g84-00167.warc.gz 5418157407 download   job
www.lds.org-inf-20180925-205550-e9g84-00167.warc.os.cdx.gz 6918 download
www.lds.org-inf-20180925-205550-e9g84-00169.warc.gz 5391353269 download   job
www.lds.org-inf-20180925-205550-e9g84-00169.warc.os.cdx.gz 16232 download
www.lds.org-inf-20180925-205550-e9g84-00170.warc.gz 5441137115 download   job
www.lds.org-inf-20180925-205550-e9g84-00170.warc.os.cdx.gz 9124 download
www.lds.org-inf-20180925-205550-e9g84-00171.warc.gz 5465044054 download   job
www.lds.org-inf-20180925-205550-e9g84-00171.warc.os.cdx.gz 9220 download
www.lds.org-inf-20180925-205550-e9g84-00172.warc.gz 5370940565 download   job
www.lds.org-inf-20180925-205550-e9g84-00172.warc.os.cdx.gz 8260 download
www.lds.org-inf-20180925-205550-e9g84-00173.warc.gz 5463406657 download   job
www.lds.org-inf-20180925-205550-e9g84-00173.warc.os.cdx.gz 6339 download
www.lds.org-inf-20180929-013437-s21ic-00055.warc.gz 7024938369 download   job
www.lds.org-inf-20180929-013437-s21ic-00055.warc.os.cdx.gz 4671 download
www.lds.org-inf-20180929-013437-s21ic-00056.warc.gz 6139155236 download   job
www.lds.org-inf-20180929-013437-s21ic-00056.warc.os.cdx.gz 1195 download
www.lds.org-inf-20180929-013437-s21ic-00057.warc.gz 8167849127 download   job
www.lds.org-inf-20180929-013437-s21ic-00057.warc.os.cdx.gz 892 download
www.linkedin.com-shallow-20181002-092107-9sotj-00000.warc.gz 9366 download   job
www.linkedin.com-shallow-20181002-092107-9sotj-00000.warc.os.cdx.gz 252 download
www.linkedin.com-shallow-20181002-092107-9sotj-meta.warc.gz 3507 download   job
www.linkedin.com-shallow-20181002-092107-9sotj-meta.warc.os.cdx.gz 47 download
www.linkedin.com-shallow-20181002-092107-9sotj.json 270 download   job
www.manxradio.com-shallow-20181002-102427-bwgak-00000.warc.gz 807176 download   job
www.manxradio.com-shallow-20181002-102427-bwgak-00000.warc.os.cdx.gz 3711 download
www.manxradio.com-shallow-20181002-102427-bwgak-meta.warc.gz 5784 download   job
www.manxradio.com-shallow-20181002-102427-bwgak-meta.warc.os.cdx.gz 47 download
www.manxradio.com-shallow-20181002-102427-bwgak.json 357 download   job
www.ning.com-shallow-20181002-102229-eiv38-00000.warc.gz 3121149 download   job
www.ning.com-shallow-20181002-102229-eiv38-00000.warc.os.cdx.gz 9634 download
www.ning.com-shallow-20181002-102229-eiv38-meta.warc.gz 8541 download   job
www.ning.com-shallow-20181002-102229-eiv38-meta.warc.os.cdx.gz 47 download
www.ning.com-shallow-20181002-102229-eiv38.json 247 download   job
www.ning.com-shallow-20181002-112447-6utcb-00000.warc.gz 2353012 download   job
www.ning.com-shallow-20181002-112447-6utcb-00000.warc.os.cdx.gz 7449 download
www.ning.com-shallow-20181002-112447-6utcb-meta.warc.gz 7638 download   job
www.ning.com-shallow-20181002-112447-6utcb-meta.warc.os.cdx.gz 47 download
www.ning.com-shallow-20181002-112447-6utcb.json 311 download   job
www.nobelprize.org-shallow-20181002-111204-4e3sy-meta.warc.gz 9709 download   job
www.nobelprize.org-shallow-20181002-111204-4e3sy-meta.warc.os.cdx.gz 47 download
www.nyx.cz-inf-20180929-112346-da0pa-00006.warc.gz 2147523376 download   job
www.nyx.cz-inf-20180929-112346-da0pa-00006.warc.os.cdx.gz 3709784 download
www.oireachtas.ie-shallow-20181002-102811-d1zk2-00000.warc.gz 1668480 download   job
www.oireachtas.ie-shallow-20181002-102811-d1zk2-00000.warc.os.cdx.gz 5344 download
www.oireachtas.ie-shallow-20181002-102811-d1zk2-meta.warc.gz 6510 download   job
www.oireachtas.ie-shallow-20181002-102811-d1zk2-meta.warc.os.cdx.gz 47 download
www.oireachtas.ie-shallow-20181002-102811-d1zk2.json 274 download   job
www.onlyinyourstate.com-inf-20181001-233423-ar52z-00000.warc.gz 5368757569 download   job
www.onlyinyourstate.com-inf-20181001-233423-ar52z-00000.warc.os.cdx.gz 1650808 download
www.racked.com-inf-20180923-152706-1zhut-00088.warc.gz 2147488999 download   job
www.racked.com-inf-20180923-152706-1zhut-00088.warc.os.cdx.gz 1356958 download
www.racked.com-inf-20180923-152706-1zhut-00089.warc.gz 2147498944 download   job
www.racked.com-inf-20180923-152706-1zhut-00089.warc.os.cdx.gz 2761125 download
www.senato.it-shallow-20181002-104555-6h3ej-00000.warc.gz 7158378 download   job
www.senato.it-shallow-20181002-104555-6h3ej-00000.warc.os.cdx.gz 14990 download
www.senato.it-shallow-20181002-104555-6h3ej-meta.warc.gz 11724 download   job
www.senato.it-shallow-20181002-104555-6h3ej-meta.warc.os.cdx.gz 47 download
www.senato.it-shallow-20181002-104555-6h3ej.json 284 download   job
www.standard.co.uk-shallow-20181002-092255-c7gps-00000.warc.gz 3113370 download   job
www.standard.co.uk-shallow-20181002-092255-c7gps-00000.warc.os.cdx.gz 9702 download
www.standard.co.uk-shallow-20181002-092255-c7gps-meta.warc.gz 9686 download   job
www.standard.co.uk-shallow-20181002-092255-c7gps-meta.warc.os.cdx.gz 47 download
www.standard.co.uk-shallow-20181002-092255-c7gps.json 377 download   job
www.telegraph.co.uk-shallow-20181002-092240-c1ji4-00000.warc.gz 6581874 download   job
www.telegraph.co.uk-shallow-20181002-092240-c1ji4-00000.warc.os.cdx.gz 37713 download
www.telegraph.co.uk-shallow-20181002-092240-c1ji4-meta.warc.gz 31901 download   job
www.telegraph.co.uk-shallow-20181002-092240-c1ji4-meta.warc.os.cdx.gz 47 download
www.telegraph.co.uk-shallow-20181002-092240-c1ji4.json 321 download   job
www.theguardian.com-shallow-20181002-092201-d2fom-00000.warc.gz 530079 download   job
www.theguardian.com-shallow-20181002-092201-d2fom-00000.warc.os.cdx.gz 4076 download
www.theguardian.com-shallow-20181002-092201-d2fom-meta.warc.gz 6699 download   job
www.theguardian.com-shallow-20181002-092201-d2fom-meta.warc.os.cdx.gz 47 download
www.theguardian.com-shallow-20181002-092201-d2fom.json 336 download   job
www.travelandleisure.com-shallow-20181002-092155-924qr-00000.warc.gz 1194428 download   job
www.travelandleisure.com-shallow-20181002-092155-924qr-00000.warc.os.cdx.gz 4378 download
www.travelandleisure.com-shallow-20181002-092155-924qr-meta.warc.gz 6168 download   job
www.travelandleisure.com-shallow-20181002-092155-924qr-meta.warc.os.cdx.gz 47 download
www.travelandleisure.com-shallow-20181002-092155-924qr.json 307 download   job
www.whitehouse.gov-shallow-20181002-101728-99rq1-00000.warc.gz 3816 download   job
www.whitehouse.gov-shallow-20181002-101728-99rq1-00000.warc.os.cdx.gz 264 download
www.whitehouse.gov-shallow-20181002-101728-99rq1-meta.warc.gz 3493 download   job
www.whitehouse.gov-shallow-20181002-101728-99rq1-meta.warc.os.cdx.gz 47 download
www.whitehouse.gov-shallow-20181002-101728-99rq1.json 340 download   job
www.whitehouse.gov-shallow-20181002-101742-2t5xv-00000.warc.gz 1982456 download   job
www.whitehouse.gov-shallow-20181002-101742-2t5xv-00000.warc.os.cdx.gz 8316 download
www.whitehouse.gov-shallow-20181002-101742-2t5xv-meta.warc.gz 8235 download   job
www.whitehouse.gov-shallow-20181002-101742-2t5xv-meta.warc.os.cdx.gz 47 download
www.whitehouse.gov-shallow-20181002-101742-2t5xv.json 371 download   job
www.whitehouse.gov-shallow-20181002-101813-9ghpk-00000.warc.gz 1992063 download   job
www.whitehouse.gov-shallow-20181002-101813-9ghpk-00000.warc.os.cdx.gz 8274 download
www.whitehouse.gov-shallow-20181002-101813-9ghpk-meta.warc.gz 8129 download   job
www.whitehouse.gov-shallow-20181002-101813-9ghpk-meta.warc.os.cdx.gz 47 download
www.whitehouse.gov-shallow-20181002-101813-9ghpk.json 348 download   job
www.writing.com-inf-20180916-180157-3qe7c-00015.warc.gz 5368724651 download   job
www.writing.com-inf-20180916-180157-3qe7c-00015.warc.os.cdx.gz 14085704 download
www2.mfour.med.kyoto-u.ac.jp-shallow-20181002-120907-2qrpe-meta.warc.gz 3916 download   job
www2.mfour.med.kyoto-u.ac.jp-shallow-20181002-120907-2qrpe-meta.warc.os.cdx.gz 47 download