Item archiveteam_archivebot_go_20160117200002

View on Internet Archive

Filename Size
00000_Header.png 798954 download
00000_Header_thumb.jpg 4653 download
__ia_thumb.jpg 9272 download
america.aljazeera.com-inf-20160113-202519-26xyb-00002.warc.gz 5368712204 download   job
america.aljazeera.com-inf-20160113-202519-26xyb-00002.warc.gz.png 89294 download
america.aljazeera.com-inf-20160113-202519-26xyb-00002.warc.gz_thumb.jpg 2571 download
america.aljazeera.com-inf-20160113-202519-26xyb-00002.warc.os.cdx.gz 1972977 download
america.aljazeera.com-inf-20160113-202519-26xyb-00003.warc.gz 5368823728 download   job
america.aljazeera.com-inf-20160113-202519-26xyb-00003.warc.gz.png 88115 download
america.aljazeera.com-inf-20160113-202519-26xyb-00003.warc.gz_thumb.jpg 2561 download
america.aljazeera.com-inf-20160113-202519-26xyb-00003.warc.os.cdx.gz 1960953 download
america.aljazeera.com-inf-20160113-202519-26xyb-00004.warc.gz 5369181462 download   job
america.aljazeera.com-inf-20160113-202519-26xyb-00004.warc.gz.png 60948 download
america.aljazeera.com-inf-20160113-202519-26xyb-00004.warc.gz_thumb.jpg 1915 download
america.aljazeera.com-inf-20160113-202519-26xyb-00004.warc.os.cdx.gz 2132045 download
america.aljazeera.com-shallow-20160117-025243-4kpp5-00000.warc.gz 8193902 download   job
america.aljazeera.com-shallow-20160117-025243-4kpp5-00000.warc.gz.png 101364 download
america.aljazeera.com-shallow-20160117-025243-4kpp5-00000.warc.gz_thumb.jpg 2611 download
america.aljazeera.com-shallow-20160117-025243-4kpp5-00000.warc.os.cdx.gz 8496 download
america.aljazeera.com-shallow-20160117-025243-4kpp5-meta.warc.gz 8880 download   job
america.aljazeera.com-shallow-20160117-025243-4kpp5-meta.warc.os.cdx.gz 47 download
america.aljazeera.com-shallow-20160117-025243-4kpp5.json 311 download   job
archiveteam_archivebot_go_20160117200002.cdx.gz 63082815 download
archiveteam_archivebot_go_20160117200002.cdx.idx 67562 download
archiveteam_archivebot_go_20160117200002_archive.torrent 683204 download
archiveteam_archivebot_go_20160117200002_files.xml 0 download
archiveteam_archivebot_go_20160117200002_meta.sqlite 445440 download
archiveteam_archivebot_go_20160117200002_meta.xml 1005 download
arstechnica.com-shallow-20160117-181815-huat9-00000.warc.gz 1901338 download   job
arstechnica.com-shallow-20160117-181815-huat9-00000.warc.gz.png 67615 download
arstechnica.com-shallow-20160117-181815-huat9-00000.warc.gz_thumb.jpg 1953 download
arstechnica.com-shallow-20160117-181815-huat9-00000.warc.os.cdx.gz 9619 download
arstechnica.com-shallow-20160117-181815-huat9-meta.warc.gz 9382 download   job
arstechnica.com-shallow-20160117-181815-huat9-meta.warc.os.cdx.gz 47 download
arstechnica.com-shallow-20160117-181815-huat9.json 332 download   job
college-campuses.tumblr.com-inf-20160116-201809-3jo21-00000.warc.gz 1912812568 download   job
college-campuses.tumblr.com-inf-20160116-201809-3jo21-00000.warc.gz.png 297710 download
college-campuses.tumblr.com-inf-20160116-201809-3jo21-00000.warc.gz_thumb.jpg 3366 download
college-campuses.tumblr.com-inf-20160116-201809-3jo21-00000.warc.os.cdx.gz 1564861 download
college-campuses.tumblr.com-inf-20160116-201809-3jo21-meta.warc.gz 22301828 download   job
college-campuses.tumblr.com-inf-20160116-201809-3jo21-meta.warc.os.cdx.gz 47 download
college-campuses.tumblr.com-inf-20160116-201809-3jo21.json 254 download   job
delimiter.com.au-shallow-20160117-005705-az5hu-00000.warc.gz 3835359 download   job
delimiter.com.au-shallow-20160117-005705-az5hu-00000.warc.os.cdx.gz 24654 download
delimiter.com.au-shallow-20160117-005705-az5hu-meta.warc.gz 18096 download   job
delimiter.com.au-shallow-20160117-005705-az5hu-meta.warc.os.cdx.gz 47 download
delimiter.com.au-shallow-20160117-005705-az5hu.json 329 download   job
disabled-learning.tumblr.com-inf-20160116-061011-dyj26-00000.warc.gz 194529041 download   job
disabled-learning.tumblr.com-inf-20160116-061011-dyj26-00000.warc.gz.png 134293 download
disabled-learning.tumblr.com-inf-20160116-061011-dyj26-00000.warc.gz_thumb.jpg 2183 download
disabled-learning.tumblr.com-inf-20160116-061011-dyj26-00000.warc.os.cdx.gz 1297113 download
disabled-learning.tumblr.com-inf-20160116-061011-dyj26-meta.warc.gz 57050632 download   job
disabled-learning.tumblr.com-inf-20160116-061011-dyj26-meta.warc.os.cdx.gz 47 download
disabled-learning.tumblr.com-inf-20160116-061011-dyj26.json 255 download   job
ds.ccc.de-inf-20160116-042534-brm2s-00000.warc.gz 294252813 download   job
ds.ccc.de-inf-20160116-042534-brm2s-00000.warc.gz.png 507897 download
ds.ccc.de-inf-20160116-042534-brm2s-00000.warc.gz_thumb.jpg 5983 download
ds.ccc.de-inf-20160116-042534-brm2s-00000.warc.os.cdx.gz 23560 download
ds.ccc.de-inf-20160116-042534-brm2s-meta.warc.gz 16306 download   job
ds.ccc.de-inf-20160116-042534-brm2s-meta.warc.os.cdx.gz 47 download
ds.ccc.de-inf-20160116-042534-brm2s.json 251 download   job
feeds.feedburner.com-shallow-20160116-014223-3wjdi-00000.warc.gz 174226 download   job
feeds.feedburner.com-shallow-20160116-014223-3wjdi-00000.warc.os.cdx.gz 236 download
feeds.feedburner.com-shallow-20160116-014223-3wjdi-meta.warc.gz 3177 download   job
feeds.feedburner.com-shallow-20160116-014223-3wjdi-meta.warc.os.cdx.gz 47 download
feeds.feedburner.com-shallow-20160116-014223-3wjdi.json 270 download   job
github.com-shallow-20160115-182049-6ay4e-meta.warc.gz 5920 download   job
github.com-shallow-20160115-182049-6ay4e-meta.warc.os.cdx.gz 47 download
hackaday.com-shallow-20160116-042853-bkk7v-00000.warc.gz 1450184 download   job
hackaday.com-shallow-20160116-042853-bkk7v-00000.warc.gz.png 49421 download
hackaday.com-shallow-20160116-042853-bkk7v-00000.warc.gz_thumb.jpg 1588 download
hackaday.com-shallow-20160116-042853-bkk7v-00000.warc.os.cdx.gz 6019 download
hackaday.com-shallow-20160116-042853-bkk7v-meta.warc.gz 7140 download   job
hackaday.com-shallow-20160116-042853-bkk7v-meta.warc.os.cdx.gz 47 download
hackaday.com-shallow-20160116-042853-bkk7v.json 304 download   job
i.4cdn.org-shallow-20160117-004906-8vz1h-00000.warc.gz 75899 download   job
i.4cdn.org-shallow-20160117-004906-8vz1h-00000.warc.os.cdx.gz 222 download
i.4cdn.org-shallow-20160117-004906-8vz1h-meta.warc.gz 3132 download   job
i.4cdn.org-shallow-20160117-004906-8vz1h-meta.warc.os.cdx.gz 47 download
i.4cdn.org-shallow-20160117-004906-8vz1h.json 261 download   job
i100.independent.co.uk-shallow-20160116-044102-e1ct5-00000.warc.gz 3924542 download   job
i100.independent.co.uk-shallow-20160116-044102-e1ct5-00000.warc.gz.png 350904 download
i100.independent.co.uk-shallow-20160116-044102-e1ct5-00000.warc.gz_thumb.jpg 3686 download
i100.independent.co.uk-shallow-20160116-044102-e1ct5-00000.warc.os.cdx.gz 4532 download
i100.independent.co.uk-shallow-20160116-044102-e1ct5-meta.warc.gz 5916 download   job
i100.independent.co.uk-shallow-20160116-044102-e1ct5-meta.warc.os.cdx.gz 47 download
i100.independent.co.uk-shallow-20160116-044102-e1ct5.json 356 download   job
icantdrawworthshit.tumblr.com-inf-20160116-194136-dhq80-00000.warc.gz 994046821 download   job
icantdrawworthshit.tumblr.com-inf-20160116-194136-dhq80-00000.warc.gz.png 301085 download
icantdrawworthshit.tumblr.com-inf-20160116-194136-dhq80-00000.warc.gz_thumb.jpg 2473 download
icantdrawworthshit.tumblr.com-inf-20160116-194136-dhq80-00000.warc.os.cdx.gz 779527 download
icantdrawworthshit.tumblr.com-inf-20160116-194136-dhq80-meta.warc.gz 9675120 download   job
icantdrawworthshit.tumblr.com-inf-20160116-194136-dhq80-meta.warc.os.cdx.gz 47 download
icantdrawworthshit.tumblr.com-inf-20160116-194136-dhq80.json 256 download   job
iffund.net-inf-20160115-223250-ccaf2-00000.warc.gz 289518253 download   job
iffund.net-inf-20160115-223250-ccaf2-00000.warc.gz.png 109249 download
iffund.net-inf-20160115-223250-ccaf2-00000.warc.gz_thumb.jpg 3378 download
iffund.net-inf-20160115-223250-ccaf2-00000.warc.os.cdx.gz 153316 download
iffund.net-inf-20160115-223250-ccaf2-meta.warc.gz 96614 download   job
iffund.net-inf-20160115-223250-ccaf2-meta.warc.os.cdx.gz 47 download
iffund.net-inf-20160115-223250-ccaf2.json 238 download   job
indiewebcamp.com-shallow-20160116-073340-aan31-00000.warc.gz 574331 download   job
indiewebcamp.com-shallow-20160116-073340-aan31-00000.warc.gz.png 224969 download
indiewebcamp.com-shallow-20160116-073340-aan31-00000.warc.gz_thumb.jpg 3434 download
indiewebcamp.com-shallow-20160116-073340-aan31-00000.warc.os.cdx.gz 4505 download
indiewebcamp.com-shallow-20160116-073340-aan31-meta.warc.gz 5808 download   job
indiewebcamp.com-shallow-20160116-073340-aan31-meta.warc.os.cdx.gz 47 download
indiewebcamp.com-shallow-20160116-073340-aan31.json 256 download   job
ironychan.tumblr.com-shallow-20160116-030905-btm94-00000.warc.gz 1958072 download   job
ironychan.tumblr.com-shallow-20160116-030905-btm94-00000.warc.gz.png 188877 download
ironychan.tumblr.com-shallow-20160116-030905-btm94-00000.warc.gz_thumb.jpg 2593 download
ironychan.tumblr.com-shallow-20160116-030905-btm94-00000.warc.os.cdx.gz 7403 download
ironychan.tumblr.com-shallow-20160116-030905-btm94-meta.warc.gz 7431 download   job
ironychan.tumblr.com-shallow-20160116-030905-btm94-meta.warc.os.cdx.gz 47 download
ironychan.tumblr.com-shallow-20160116-030905-btm94.json 307 download   job
just-open-the-book.tumblr.com-inf-20160117-060008-cq2wz-00000.warc.gz 557930465 download   job
just-open-the-book.tumblr.com-inf-20160117-060008-cq2wz-00000.warc.os.cdx.gz 2222235 download
just-open-the-book.tumblr.com-inf-20160117-060008-cq2wz-meta.warc.gz 6405630 download   job
just-open-the-book.tumblr.com-inf-20160117-060008-cq2wz-meta.warc.os.cdx.gz 47 download
just-open-the-book.tumblr.com-inf-20160117-060008-cq2wz.json 256 download   job
kingofdersecest.tumblr.com-shallow-20160116-194033-s56bp-00000.warc.gz 1138223 download   job
kingofdersecest.tumblr.com-shallow-20160116-194033-s56bp-00000.warc.gz.png 115212 download
kingofdersecest.tumblr.com-shallow-20160116-194033-s56bp-00000.warc.gz_thumb.jpg 3154 download
kingofdersecest.tumblr.com-shallow-20160116-194033-s56bp-00000.warc.os.cdx.gz 3422 download
kingofdersecest.tumblr.com-shallow-20160116-194033-s56bp-meta.warc.gz 5252 download   job
kingofdersecest.tumblr.com-shallow-20160116-194033-s56bp-meta.warc.os.cdx.gz 47 download
kingofdersecest.tumblr.com-shallow-20160116-194033-s56bp.json 320 download   job
last-starfighter.tumblr.com-inf-20160117-045536-9peal-00000.warc.gz 2000237641 download   job
last-starfighter.tumblr.com-inf-20160117-045536-9peal-00000.warc.gz.png 46558 download
last-starfighter.tumblr.com-inf-20160117-045536-9peal-00000.warc.gz_thumb.jpg 2520 download
last-starfighter.tumblr.com-inf-20160117-045536-9peal-00000.warc.os.cdx.gz 800002 download
last-starfighter.tumblr.com-inf-20160117-045536-9peal-meta.warc.gz 19476795 download   job
last-starfighter.tumblr.com-inf-20160117-045536-9peal-meta.warc.os.cdx.gz 47 download
last-starfighter.tumblr.com-inf-20160117-045536-9peal.json 254 download   job
libgen.io-inf-20160116-061420-9kfh0-00000.warc.gz 104270949 download   job
libgen.io-inf-20160116-061420-9kfh0-00000.warc.gz.png 205244 download
libgen.io-inf-20160116-061420-9kfh0-00000.warc.gz_thumb.jpg 2444 download
libgen.io-inf-20160116-061420-9kfh0-00000.warc.os.cdx.gz 71222 download
libgen.io-inf-20160116-061420-9kfh0-meta.warc.gz 37907 download   job
libgen.io-inf-20160116-061420-9kfh0-meta.warc.os.cdx.gz 47 download
libgen.io-inf-20160116-061420-9kfh0.json 268 download   job
lwn.net-shallow-20160116-062049-9ycwi-00000.warc.gz 40283 download   job
lwn.net-shallow-20160116-062049-9ycwi-00000.warc.gz.png 235408 download
lwn.net-shallow-20160116-062049-9ycwi-00000.warc.gz_thumb.jpg 3690 download
lwn.net-shallow-20160116-062049-9ycwi-00000.warc.os.cdx.gz 655 download
lwn.net-shallow-20160116-062049-9ycwi-meta.warc.gz 3410 download   job
lwn.net-shallow-20160116-062049-9ycwi-meta.warc.os.cdx.gz 47 download
lwn.net-shallow-20160116-062049-9ycwi.json 280 download   job
mail.mozilla.org-shallow-20160116-074320-2t37s-00000.warc.gz 5730 download   job
mail.mozilla.org-shallow-20160116-074320-2t37s-00000.warc.gz.png 144866 download
mail.mozilla.org-shallow-20160116-074320-2t37s-00000.warc.gz_thumb.jpg 2837 download
mail.mozilla.org-shallow-20160116-074320-2t37s-00000.warc.os.cdx.gz 238 download
mail.mozilla.org-shallow-20160116-074320-2t37s-meta.warc.gz 3164 download   job
mail.mozilla.org-shallow-20160116-074320-2t37s-meta.warc.os.cdx.gz 47 download
mail.mozilla.org-shallow-20160116-074320-2t37s.json 288 download   job
mattwilkens.com-shallow-20160116-061720-7f9vw-00000.warc.gz 898144 download   job
mattwilkens.com-shallow-20160116-061720-7f9vw-00000.warc.gz.png 325737 download
mattwilkens.com-shallow-20160116-061720-7f9vw-00000.warc.gz_thumb.jpg 3361 download
mattwilkens.com-shallow-20160116-061720-7f9vw-00000.warc.os.cdx.gz 7762 download
mattwilkens.com-shallow-20160116-061720-7f9vw-meta.warc.gz 7963 download   job
mattwilkens.com-shallow-20160116-061720-7f9vw-meta.warc.os.cdx.gz 47 download
mattwilkens.com-shallow-20160116-061720-7f9vw.json 295 download   job
medium.com-shallow-20160115-214232-2q8jk-00000.warc.gz 8716918 download   job
medium.com-shallow-20160115-214232-2q8jk-00000.warc.gz.png 115992 download
medium.com-shallow-20160115-214232-2q8jk-00000.warc.gz_thumb.jpg 2655 download
medium.com-shallow-20160115-214232-2q8jk-00000.warc.os.cdx.gz 8641 download
medium.com-shallow-20160115-214232-2q8jk-meta.warc.gz 8549 download   job
medium.com-shallow-20160115-214232-2q8jk-meta.warc.os.cdx.gz 47 download
medium.com-shallow-20160115-214232-2q8jk.json 318 download   job
medium.com-shallow-20160116-073759-eiluq-00000.warc.gz 10750938 download   job
medium.com-shallow-20160116-073759-eiluq-00000.warc.gz.png 61478 download
medium.com-shallow-20160116-073759-eiluq-00000.warc.gz_thumb.jpg 1981 download
medium.com-shallow-20160116-073759-eiluq-00000.warc.os.cdx.gz 9022 download
medium.com-shallow-20160116-073759-eiluq-meta.warc.gz 9058 download   job
medium.com-shallow-20160116-073759-eiluq-meta.warc.os.cdx.gz 47 download
medium.com-shallow-20160116-073759-eiluq.json 289 download   job
neurodivergentexchange.tumblr.com-inf-20160116-202434-7oj1w-00000.warc.gz 42957203 download   job
neurodivergentexchange.tumblr.com-inf-20160116-202434-7oj1w-00000.warc.gz.png 103889 download
neurodivergentexchange.tumblr.com-inf-20160116-202434-7oj1w-00000.warc.gz_thumb.jpg 3412 download
neurodivergentexchange.tumblr.com-inf-20160116-202434-7oj1w-00000.warc.os.cdx.gz 189138 download
neurodivergentexchange.tumblr.com-inf-20160116-202434-7oj1w-meta.warc.gz 7178895 download   job
neurodivergentexchange.tumblr.com-inf-20160116-202434-7oj1w-meta.warc.os.cdx.gz 47 download
neurodivergentexchange.tumblr.com-inf-20160116-202434-7oj1w.json 260 download   job
np.reddit.com-shallow-20160117-011213-1up2r-00000.warc.gz 1730488 download   job
np.reddit.com-shallow-20160117-011213-1up2r-00000.warc.gz.png 413249 download
np.reddit.com-shallow-20160117-011213-1up2r-00000.warc.gz_thumb.jpg 4347 download
np.reddit.com-shallow-20160117-011213-1up2r-00000.warc.os.cdx.gz 8839 download
np.reddit.com-shallow-20160117-011213-1up2r-meta.warc.gz 8243 download   job
np.reddit.com-shallow-20160117-011213-1up2r-meta.warc.os.cdx.gz 47 download
np.reddit.com-shallow-20160117-011213-1up2r.json 337 download   job
paste.ee-shallow-20160117-005251-dxnql-00000.warc.gz 5443 download   job
paste.ee-shallow-20160117-005251-dxnql-00000.warc.os.cdx.gz 211 download
paste.ee-shallow-20160117-005251-dxnql-meta.warc.gz 3156 download   job
paste.ee-shallow-20160117-005251-dxnql-meta.warc.os.cdx.gz 47 download
paste.ee-shallow-20160117-005251-dxnql.json 245 download   job
pbs.twimg.com-shallow-20160117-005502-4oqlp-00000.warc.gz 214404 download   job
pbs.twimg.com-shallow-20160117-005502-4oqlp-00000.warc.os.cdx.gz 242 download
pbs.twimg.com-shallow-20160117-005502-4oqlp-meta.warc.gz 3156 download   job
pbs.twimg.com-shallow-20160117-005502-4oqlp-meta.warc.os.cdx.gz 47 download
pbs.twimg.com-shallow-20160117-005502-4oqlp.json 268 download   job
pix11.com-shallow-20160115-204624-2h2i0-00000.warc.gz 12884797 download   job
pix11.com-shallow-20160115-204624-2h2i0-00000.warc.gz.png 71694 download
pix11.com-shallow-20160115-204624-2h2i0-00000.warc.gz_thumb.jpg 2191 download
pix11.com-shallow-20160115-204624-2h2i0-00000.warc.os.cdx.gz 15652 download
pix11.com-shallow-20160115-204624-2h2i0-meta.warc.gz 13052 download   job
pix11.com-shallow-20160115-204624-2h2i0-meta.warc.os.cdx.gz 47 download
pix11.com-shallow-20160115-204624-2h2i0.json 298 download   job
praacticalaac.org-inf-20160114-172936-7d4q9-00000.warc.gz 5972467905 download   job
praacticalaac.org-inf-20160114-172936-7d4q9-00000.warc.os.cdx.gz 8296960 download
quaalud.es-inf-20160117-021203-3mksu-00000.warc.gz 1909193704 download   job
quaalud.es-inf-20160117-021203-3mksu-00000.warc.gz.png 39551 download
quaalud.es-inf-20160117-021203-3mksu-00000.warc.gz_thumb.jpg 1523 download
quaalud.es-inf-20160117-021203-3mksu-00000.warc.os.cdx.gz 514 download
quaalud.es-inf-20160117-021203-3mksu-meta.warc.gz 3469 download   job
quaalud.es-inf-20160117-021203-3mksu-meta.warc.os.cdx.gz 47 download
quaalud.es-inf-20160117-021203-3mksu.json 264 download   job
quaalud.es-shallow-20160117-021108-3mksu-00000.warc.gz 3871 download   job
quaalud.es-shallow-20160117-021108-3mksu-00000.warc.gz.png 39551 download
quaalud.es-shallow-20160117-021108-3mksu-00000.warc.gz_thumb.jpg 1523 download
quaalud.es-shallow-20160117-021108-3mksu-00000.warc.os.cdx.gz 221 download
quaalud.es-shallow-20160117-021108-3mksu-meta.warc.gz 3140 download   job
quaalud.es-shallow-20160117-021108-3mksu-meta.warc.os.cdx.gz 47 download
quaalud.es-shallow-20160117-021108-3mksu.json 268 download   job
sanfrancisco.cbslocal.com-shallow-20160116-073005-6xfn6-00000.warc.gz 4549093 download   job
sanfrancisco.cbslocal.com-shallow-20160116-073005-6xfn6-00000.warc.gz.png 500962 download
sanfrancisco.cbslocal.com-shallow-20160116-073005-6xfn6-00000.warc.gz_thumb.jpg 4963 download
sanfrancisco.cbslocal.com-shallow-20160116-073005-6xfn6-00000.warc.os.cdx.gz 29157 download
sanfrancisco.cbslocal.com-shallow-20160116-073005-6xfn6-meta.warc.gz 21684 download   job
sanfrancisco.cbslocal.com-shallow-20160116-073005-6xfn6-meta.warc.os.cdx.gz 47 download
sanfrancisco.cbslocal.com-shallow-20160116-073005-6xfn6.json 361 download   job
sci-hub.cc-shallow-20160116-063448-9y680-00000.warc.gz 61523 download   job
sci-hub.cc-shallow-20160116-063448-9y680-00000.warc.os.cdx.gz 219 download
sci-hub.cc-shallow-20160116-063448-9y680-meta.warc.gz 3127 download   job
sci-hub.cc-shallow-20160116-063448-9y680-meta.warc.os.cdx.gz 47 download
sci-hub.cc-shallow-20160116-063448-9y680.json 259 download   job
sfbay.ca-shallow-20160116-073028-4q1xl-00000.warc.gz 2408619 download   job
sfbay.ca-shallow-20160116-073028-4q1xl-00000.warc.os.cdx.gz 11119 download
sfbay.ca-shallow-20160116-073028-4q1xl-meta.warc.gz 11055 download   job
sfbay.ca-shallow-20160116-073028-4q1xl-meta.warc.os.cdx.gz 47 download
sfbay.ca-shallow-20160116-073028-4q1xl.json 308 download   job
techaeris.com-shallow-20160116-072948-dwnqi-00000.warc.gz 3483561 download   job
techaeris.com-shallow-20160116-072948-dwnqi-00000.warc.os.cdx.gz 15281 download
techaeris.com-shallow-20160116-072948-dwnqi-meta.warc.gz 12605 download   job
techaeris.com-shallow-20160116-072948-dwnqi-meta.warc.os.cdx.gz 47 download
techaeris.com-shallow-20160116-072948-dwnqi.json 318 download   job
techaeris.com-shallow-20160116-073108-8zshw-00000.warc.gz 3224120 download   job
techaeris.com-shallow-20160116-073108-8zshw-00000.warc.os.cdx.gz 14678 download
techaeris.com-shallow-20160116-073108-8zshw-meta.warc.gz 11778 download   job
techaeris.com-shallow-20160116-073108-8zshw-meta.warc.os.cdx.gz 47 download
techaeris.com-shallow-20160116-073108-8zshw.json 305 download   job
theconversation.com-shallow-20160116-063913-edecb-00000.warc.gz 2091663 download   job
theconversation.com-shallow-20160116-063913-edecb-00000.warc.gz.png 173867 download
theconversation.com-shallow-20160116-063913-edecb-00000.warc.gz_thumb.jpg 3680 download
theconversation.com-shallow-20160116-063913-edecb-00000.warc.os.cdx.gz 8696 download
theconversation.com-shallow-20160116-063913-edecb-meta.warc.gz 9462 download   job
theconversation.com-shallow-20160116-063913-edecb-meta.warc.os.cdx.gz 47 download
theconversation.com-shallow-20160116-063913-edecb.json 334 download   job
tonyarcieri.com-shallow-20160115-230506-6m4ec-00000.warc.gz 800413 download   job
tonyarcieri.com-shallow-20160115-230506-6m4ec-00000.warc.gz.png 415337 download
tonyarcieri.com-shallow-20160115-230506-6m4ec-00000.warc.gz_thumb.jpg 4084 download
tonyarcieri.com-shallow-20160115-230506-6m4ec-00000.warc.os.cdx.gz 2199 download
tonyarcieri.com-shallow-20160115-230506-6m4ec-meta.warc.gz 4582 download   job
tonyarcieri.com-shallow-20160115-230506-6m4ec-meta.warc.os.cdx.gz 47 download
tonyarcieri.com-shallow-20160115-230506-6m4ec.json 290 download   job
tucson.com-shallow-20160117-004526-bw0ks-00000.warc.gz 2776514 download   job
tucson.com-shallow-20160117-004526-bw0ks-00000.warc.os.cdx.gz 15532 download
tucson.com-shallow-20160117-004526-bw0ks-meta.warc.gz 13249 download   job
tucson.com-shallow-20160117-004526-bw0ks-meta.warc.os.cdx.gz 47 download
tucson.com-shallow-20160117-004526-bw0ks.json 360 download   job
twitter.com-inf-20160115-033220-2pppg-00000.warc.gz 2887962248 download   job
twitter.com-inf-20160115-033220-2pppg-00000.warc.os.cdx.gz 3538602 download
twitter.com-inf-20160115-033220-2pppg-meta.warc.gz 7464397 download   job
twitter.com-inf-20160115-033220-2pppg-meta.warc.os.cdx.gz 47 download
twitter.com-inf-20160115-033220-2pppg.json 245 download   job
twitter.com-shallow-20160117-005354-4d0x0-00000.warc.gz 4238343 download   job
twitter.com-shallow-20160117-005354-4d0x0-00000.warc.gz.png 338588 download
twitter.com-shallow-20160117-005354-4d0x0-00000.warc.gz_thumb.jpg 3698 download
twitter.com-shallow-20160117-005354-4d0x0-00000.warc.os.cdx.gz 8906 download
twitter.com-shallow-20160117-005354-4d0x0-meta.warc.gz 9054 download   job
twitter.com-shallow-20160117-005354-4d0x0-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20160117-005354-4d0x0.json 279 download   job
twitter.com-shallow-20160117-005514-91z6z-00000.warc.gz 3926557 download   job
twitter.com-shallow-20160117-005514-91z6z-00000.warc.gz.png 159442 download
twitter.com-shallow-20160117-005514-91z6z-00000.warc.gz_thumb.jpg 3395 download
twitter.com-shallow-20160117-005514-91z6z-00000.warc.os.cdx.gz 7605 download
twitter.com-shallow-20160117-005514-91z6z-meta.warc.gz 8079 download   job
twitter.com-shallow-20160117-005514-91z6z-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20160117-005514-91z6z.json 279 download   job
twitter.com-shallow-20160117-025440-djnnw-00000.warc.gz 6380475 download   job
twitter.com-shallow-20160117-025440-djnnw-00000.warc.gz.png 798954 download
twitter.com-shallow-20160117-025440-djnnw-00000.warc.gz_thumb.jpg 4653 download
twitter.com-shallow-20160117-025440-djnnw-00000.warc.os.cdx.gz 8548 download
twitter.com-shallow-20160117-025440-djnnw-meta.warc.gz 8825 download   job
twitter.com-shallow-20160117-025440-djnnw-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20160117-025440-djnnw.json 276 download   job
twitter.com-shallow-20160117-064113-5dcs5-00000.warc.gz 3301942 download   job
twitter.com-shallow-20160117-064113-5dcs5-00000.warc.gz.png 457307 download
twitter.com-shallow-20160117-064113-5dcs5-00000.warc.gz_thumb.jpg 3947 download
twitter.com-shallow-20160117-064113-5dcs5-00000.warc.os.cdx.gz 6853 download
twitter.com-shallow-20160117-064113-5dcs5-meta.warc.gz 7235 download   job
twitter.com-shallow-20160117-064113-5dcs5-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20160117-064113-5dcs5.json 275 download   job
virtuallyfun.superglobalmegacorp.com-shallow-20160117-005149-7ebr7-00000.warc.gz 583235 download   job
virtuallyfun.superglobalmegacorp.com-shallow-20160117-005149-7ebr7-00000.warc.gz.png 511616 download
virtuallyfun.superglobalmegacorp.com-shallow-20160117-005149-7ebr7-00000.warc.gz_thumb.jpg 3507 download
virtuallyfun.superglobalmegacorp.com-shallow-20160117-005149-7ebr7-00000.warc.os.cdx.gz 2749 download
virtuallyfun.superglobalmegacorp.com-shallow-20160117-005149-7ebr7-meta.warc.gz 4842 download   job
virtuallyfun.superglobalmegacorp.com-shallow-20160117-005149-7ebr7-meta.warc.os.cdx.gz 47 download
virtuallyfun.superglobalmegacorp.com-shallow-20160117-005149-7ebr7.json 289 download   job
worldsworstdevotee.tumblr.com-inf-20160117-044508-cqhrn-00000.warc.gz 60417101 download   job
worldsworstdevotee.tumblr.com-inf-20160117-044508-cqhrn-00000.warc.gz.png 106286 download
worldsworstdevotee.tumblr.com-inf-20160117-044508-cqhrn-00000.warc.gz_thumb.jpg 2010 download
worldsworstdevotee.tumblr.com-inf-20160117-044508-cqhrn-00000.warc.os.cdx.gz 92085 download
worldsworstdevotee.tumblr.com-inf-20160117-044508-cqhrn-meta.warc.gz 1659890 download   job
worldsworstdevotee.tumblr.com-inf-20160117-044508-cqhrn-meta.warc.os.cdx.gz 47 download
worldsworstdevotee.tumblr.com-inf-20160117-044508-cqhrn.json 256 download   job
writtenwaters.tumblr.com-inf-20160117-044722-9qboh-00000.warc.gz 7310665 download   job
writtenwaters.tumblr.com-inf-20160117-044722-9qboh-00000.warc.gz.png 34378 download
writtenwaters.tumblr.com-inf-20160117-044722-9qboh-00000.warc.gz_thumb.jpg 1967 download
writtenwaters.tumblr.com-inf-20160117-044722-9qboh-00000.warc.os.cdx.gz 22816 download
writtenwaters.tumblr.com-inf-20160117-044722-9qboh-meta.warc.gz 91289 download   job
writtenwaters.tumblr.com-inf-20160117-044722-9qboh-meta.warc.os.cdx.gz 47 download
writtenwaters.tumblr.com-inf-20160117-044722-9qboh.json 251 download   job
www.abc.net.au-shallow-20160117-005722-d87m0-00000.warc.gz 2034283 download   job
www.abc.net.au-shallow-20160117-005722-d87m0-00000.warc.gz.png 778209 download
www.abc.net.au-shallow-20160117-005722-d87m0-00000.warc.gz_thumb.jpg 5491 download
www.abc.net.au-shallow-20160117-005722-d87m0-00000.warc.os.cdx.gz 10239 download
www.abc.net.au-shallow-20160117-005722-d87m0-meta.warc.gz 9973 download   job
www.abc.net.au-shallow-20160117-005722-d87m0-meta.warc.os.cdx.gz 47 download
www.abc.net.au-shallow-20160117-005722-d87m0.json 285 download   job
www.abc.net.au-shallow-20160117-011505-710of-00000.warc.gz 37749988 download   job
www.abc.net.au-shallow-20160117-011505-710of-00000.warc.gz.png 234991 download
www.abc.net.au-shallow-20160117-011505-710of-00000.warc.gz_thumb.jpg 3818 download
www.abc.net.au-shallow-20160117-011505-710of-00000.warc.os.cdx.gz 5947 download
www.abc.net.au-shallow-20160117-011505-710of-meta.warc.gz 7469 download   job
www.abc.net.au-shallow-20160117-011505-710of-meta.warc.os.cdx.gz 47 download
www.abc.net.au-shallow-20160117-011505-710of.json 278 download   job
www.alchemistowl.org-inf-20160117-034744-5wfm5-00000.warc.gz 1224907 download   job
www.alchemistowl.org-inf-20160117-034744-5wfm5-00000.warc.gz.png 44263 download
www.alchemistowl.org-inf-20160117-034744-5wfm5-00000.warc.gz_thumb.jpg 1697 download
www.alchemistowl.org-inf-20160117-034744-5wfm5-00000.warc.os.cdx.gz 6397 download
www.alchemistowl.org-inf-20160117-034744-5wfm5-meta.warc.gz 7741 download   job
www.alchemistowl.org-inf-20160117-034744-5wfm5-meta.warc.os.cdx.gz 47 download
www.alchemistowl.org-inf-20160117-034744-5wfm5.json 259 download   job
www.aph.gov.au-shallow-20160117-011441-cinkl-00000.warc.gz 106613 download   job
www.aph.gov.au-shallow-20160117-011441-cinkl-00000.warc.os.cdx.gz 262 download
www.aph.gov.au-shallow-20160117-011441-cinkl-meta.warc.gz 3211 download   job
www.aph.gov.au-shallow-20160117-011441-cinkl-meta.warc.os.cdx.gz 47 download
www.aph.gov.au-shallow-20160117-011441-cinkl.json 311 download   job
www.babelstone.co.uk-inf-20160115-044608-4clw1-00000.warc.gz 5368929594 download   job
www.babelstone.co.uk-inf-20160115-044608-4clw1-00000.warc.gz.png 128514 download
www.babelstone.co.uk-inf-20160115-044608-4clw1-00000.warc.gz_thumb.jpg 3464 download
www.babelstone.co.uk-inf-20160115-044608-4clw1-00000.warc.os.cdx.gz 3464901 download
www.babelstone.co.uk-inf-20160115-044608-4clw1-00001.warc.gz 3332815236 download   job
www.babelstone.co.uk-inf-20160115-044608-4clw1-00001.warc.os.cdx.gz 548466 download
www.babelstone.co.uk-inf-20160115-044608-4clw1-meta.warc.gz 1978889 download   job
www.babelstone.co.uk-inf-20160115-044608-4clw1-meta.warc.os.cdx.gz 47 download
www.babelstone.co.uk-inf-20160115-044608-4clw1.json 270 download   job
www.belch.com-inf-20160113-054841-awqvk-00001.warc.gz 5368974067 download   job
www.belch.com-inf-20160113-054841-awqvk-00001.warc.gz.png 125607 download
www.belch.com-inf-20160113-054841-awqvk-00001.warc.gz_thumb.jpg 3544 download
www.belch.com-inf-20160113-054841-awqvk-00001.warc.os.cdx.gz 4892145 download
www.belch.com-inf-20160113-054841-awqvk-00002.warc.gz 5441350013 download   job
www.belch.com-inf-20160113-054841-awqvk-00002.warc.gz.png 168267 download
www.belch.com-inf-20160113-054841-awqvk-00002.warc.gz_thumb.jpg 3959 download
www.belch.com-inf-20160113-054841-awqvk-00002.warc.os.cdx.gz 3716437 download
www.bowiewonderworld.com-inf-20160111-163228-ewj57-00002.warc.gz 3823016576 download   job
www.bowiewonderworld.com-inf-20160111-163228-ewj57-00002.warc.gz.png 59935 download
www.bowiewonderworld.com-inf-20160111-163228-ewj57-00002.warc.gz_thumb.jpg 1727 download
www.bowiewonderworld.com-inf-20160111-163228-ewj57-00002.warc.os.cdx.gz 3663695 download
www.bowiewonderworld.com-inf-20160111-163228-ewj57-meta.warc.gz 10590301 download   job
www.bowiewonderworld.com-inf-20160111-163228-ewj57-meta.warc.os.cdx.gz 47 download
www.bowiewonderworld.com-inf-20160111-163228-ewj57.json 251 download   job
www.chicagotribune.com-shallow-20160116-042704-c810m-00000.warc.gz 1124079 download   job
www.chicagotribune.com-shallow-20160116-042704-c810m-00000.warc.gz.png 165173 download
www.chicagotribune.com-shallow-20160116-042704-c810m-00000.warc.gz_thumb.jpg 3389 download
www.chicagotribune.com-shallow-20160116-042704-c810m-00000.warc.os.cdx.gz 5407 download
www.chicagotribune.com-shallow-20160116-042704-c810m-meta.warc.gz 6994 download   job
www.chicagotribune.com-shallow-20160116-042704-c810m-meta.warc.os.cdx.gz 47 download
www.chicagotribune.com-shallow-20160116-042704-c810m.json 352 download   job
www.debian.org-shallow-20160116-073211-58bt5-00000.warc.gz 64103 download   job
www.debian.org-shallow-20160116-073211-58bt5-00000.warc.gz.png 164186 download
www.debian.org-shallow-20160116-073211-58bt5-00000.warc.gz_thumb.jpg 3625 download
www.debian.org-shallow-20160116-073211-58bt5-00000.warc.os.cdx.gz 1181 download
www.debian.org-shallow-20160116-073211-58bt5-meta.warc.gz 3684 download   job
www.debian.org-shallow-20160116-073211-58bt5-meta.warc.os.cdx.gz 47 download
www.debian.org-shallow-20160116-073211-58bt5.json 243 download   job
www.debian.org-shallow-20160116-073229-cqfhj-00000.warc.gz 41294 download   job
www.debian.org-shallow-20160116-073229-cqfhj-00000.warc.gz.png 183725 download
www.debian.org-shallow-20160116-073229-cqfhj-00000.warc.gz_thumb.jpg 3505 download
www.debian.org-shallow-20160116-073229-cqfhj-00000.warc.os.cdx.gz 825 download
www.debian.org-shallow-20160116-073229-cqfhj-meta.warc.gz 3499 download   job
www.debian.org-shallow-20160116-073229-cqfhj-meta.warc.os.cdx.gz 47 download
www.debian.org-shallow-20160116-073229-cqfhj.json 262 download   job
www.facebook.com-shallow-20160116-053901-8dy0d-00000.warc.gz 3295344 download   job
www.facebook.com-shallow-20160116-053901-8dy0d-00000.warc.gz.png 136063 download
www.facebook.com-shallow-20160116-053901-8dy0d-00000.warc.gz_thumb.jpg 2424 download
www.facebook.com-shallow-20160116-053901-8dy0d-00000.warc.os.cdx.gz 28947 download
www.facebook.com-shallow-20160116-053901-8dy0d-meta.warc.gz 22811 download   job
www.facebook.com-shallow-20160116-053901-8dy0d-meta.warc.os.cdx.gz 47 download
www.facebook.com-shallow-20160116-053901-8dy0d.json 287 download   job
www.halfdog.net-shallow-20160116-062432-8d8cc-00000.warc.gz 9613 download   job
www.halfdog.net-shallow-20160116-062432-8d8cc-00000.warc.gz.png 226826 download
www.halfdog.net-shallow-20160116-062432-8d8cc-00000.warc.gz_thumb.jpg 3729 download
www.halfdog.net-shallow-20160116-062432-8d8cc-00000.warc.os.cdx.gz 388 download
www.halfdog.net-shallow-20160116-062432-8d8cc-meta.warc.gz 3501 download   job
www.halfdog.net-shallow-20160116-062432-8d8cc-meta.warc.os.cdx.gz 47 download
www.halfdog.net-shallow-20160116-062432-8d8cc.json 297 download   job
www.irit.fr-shallow-20160116-063320-c5sz4-00000.warc.gz 450471 download   job
www.irit.fr-shallow-20160116-063320-c5sz4-00000.warc.os.cdx.gz 244 download
www.irit.fr-shallow-20160116-063320-c5sz4-meta.warc.gz 3152 download   job
www.irit.fr-shallow-20160116-063320-c5sz4-meta.warc.os.cdx.gz 47 download
www.irit.fr-shallow-20160116-063320-c5sz4.json 268 download   job
www.irit.fr-shallow-20160116-063326-15rce-00000.warc.gz 13162070 download   job
www.irit.fr-shallow-20160116-063326-15rce-00000.warc.os.cdx.gz 244 download
www.irit.fr-shallow-20160116-063326-15rce-meta.warc.gz 3158 download   job
www.irit.fr-shallow-20160116-063326-15rce-meta.warc.os.cdx.gz 47 download
www.irit.fr-shallow-20160116-063326-15rce.json 268 download   job
www.irle.berkeley.edu-inf-20160114-051614-19jmt-00001.warc.gz 1249596883 download   job
www.irle.berkeley.edu-inf-20160114-051614-19jmt-00001.warc.os.cdx.gz 2843924 download
www.irle.berkeley.edu-inf-20160114-051614-19jmt-meta.warc.gz 4100584 download   job
www.irle.berkeley.edu-inf-20160114-051614-19jmt-meta.warc.os.cdx.gz 47 download
www.irle.berkeley.edu-inf-20160114-051614-19jmt.json 250 download   job
www.medisafe.com-inf-20160116-061113-9sjuk-00000.warc.gz 1099721901 download   job
www.medisafe.com-inf-20160116-061113-9sjuk-00000.warc.gz.png 507249 download
www.medisafe.com-inf-20160116-061113-9sjuk-00000.warc.gz_thumb.jpg 5036 download
www.medisafe.com-inf-20160116-061113-9sjuk-00000.warc.os.cdx.gz 1884146 download
www.medisafe.com-inf-20160116-061113-9sjuk-meta.warc.gz 1243357 download   job
www.medisafe.com-inf-20160116-061113-9sjuk-meta.warc.os.cdx.gz 47 download
www.medisafe.com-inf-20160116-061113-9sjuk.json 243 download   job
www.mirror.co.uk-shallow-20160116-073046-e5u8n-00000.warc.gz 4005 download   job
www.mirror.co.uk-shallow-20160116-073046-e5u8n-00000.warc.os.cdx.gz 263 download
www.mirror.co.uk-shallow-20160116-073046-e5u8n-meta.warc.gz 3216 download   job
www.mirror.co.uk-shallow-20160116-073046-e5u8n-meta.warc.os.cdx.gz 47 download
www.mirror.co.uk-shallow-20160116-073046-e5u8n.json 325 download   job
www.nacionalrock.com-inf-20160107-173345-ere6e-00035.warc.gz 5402064876 download   job
www.nacionalrock.com-inf-20160107-173345-ere6e-00035.warc.gz.png 117968 download
www.nacionalrock.com-inf-20160107-173345-ere6e-00035.warc.gz_thumb.jpg 4031 download
www.nacionalrock.com-inf-20160107-173345-ere6e-00035.warc.os.cdx.gz 182623 download
www.nacionalrock.com-inf-20160107-173345-ere6e-00036.warc.gz 5371983073 download   job
www.nacionalrock.com-inf-20160107-173345-ere6e-00036.warc.gz.png 117968 download
www.nacionalrock.com-inf-20160107-173345-ere6e-00036.warc.gz_thumb.jpg 4031 download
www.nacionalrock.com-inf-20160107-173345-ere6e-00036.warc.os.cdx.gz 125182 download
www.nacionalrock.com-inf-20160107-173345-ere6e-00037.warc.gz 5381394490 download   job
www.nacionalrock.com-inf-20160107-173345-ere6e-00037.warc.gz.png 117968 download
www.nacionalrock.com-inf-20160107-173345-ere6e-00037.warc.gz_thumb.jpg 4031 download
www.nacionalrock.com-inf-20160107-173345-ere6e-00037.warc.os.cdx.gz 176888 download
www.nacionalrock.com-inf-20160107-173345-ere6e-00038.warc.gz 5380786223 download   job
www.nacionalrock.com-inf-20160107-173345-ere6e-00038.warc.gz.png 117968 download
www.nacionalrock.com-inf-20160107-173345-ere6e-00038.warc.gz_thumb.jpg 4031 download
www.nacionalrock.com-inf-20160107-173345-ere6e-00038.warc.os.cdx.gz 229111 download
www.nacionalrock.com-inf-20160107-173345-ere6e-00039.warc.gz 5374343019 download   job
www.nacionalrock.com-inf-20160107-173345-ere6e-00039.warc.gz.png 117968 download
www.nacionalrock.com-inf-20160107-173345-ere6e-00039.warc.gz_thumb.jpg 4031 download
www.nacionalrock.com-inf-20160107-173345-ere6e-00039.warc.os.cdx.gz 406753 download
www.nacionalrock.com-inf-20160107-173345-ere6e-00040.warc.gz 1090424615 download   job
www.nacionalrock.com-inf-20160107-173345-ere6e-00040.warc.gz.png 117968 download
www.nacionalrock.com-inf-20160107-173345-ere6e-00040.warc.gz_thumb.jpg 4031 download
www.nacionalrock.com-inf-20160107-173345-ere6e-00040.warc.os.cdx.gz 50347 download
www.nacionalrock.com-inf-20160107-173345-ere6e-meta.warc.gz 26818251 download   job
www.nacionalrock.com-inf-20160107-173345-ere6e-meta.warc.os.cdx.gz 47 download
www.nacionalrock.com-inf-20160107-173345-ere6e.json 248 download   job
www.offiziere.ch-inf-20160116-043645-9a1w3-00000.warc.gz 635448445 download   job
www.offiziere.ch-inf-20160116-043645-9a1w3-00000.warc.gz.png 385975 download
www.offiziere.ch-inf-20160116-043645-9a1w3-00000.warc.gz_thumb.jpg 4806 download
www.offiziere.ch-inf-20160116-043645-9a1w3-00000.warc.os.cdx.gz 1008026 download
www.offiziere.ch-inf-20160116-043645-9a1w3-meta.warc.gz 607057 download   job
www.offiziere.ch-inf-20160116-043645-9a1w3-meta.warc.os.cdx.gz 47 download
www.offiziere.ch-inf-20160116-043645-9a1w3.json 257 download   job
www.phoneshopbysainsburys.co.uk-shallow-20160117-174547-bu5qt-00000.warc.gz 5914468 download   job
www.phoneshopbysainsburys.co.uk-shallow-20160117-174547-bu5qt-00000.warc.gz.png 191403 download
www.phoneshopbysainsburys.co.uk-shallow-20160117-174547-bu5qt-00000.warc.gz_thumb.jpg 3937 download
www.phoneshopbysainsburys.co.uk-shallow-20160117-174547-bu5qt-00000.warc.os.cdx.gz 16126 download
www.phoneshopbysainsburys.co.uk-shallow-20160117-174547-bu5qt-meta.warc.gz 11336 download   job
www.phoneshopbysainsburys.co.uk-shallow-20160117-174547-bu5qt-meta.warc.os.cdx.gz 47 download
www.phoneshopbysainsburys.co.uk-shallow-20160117-174547-bu5qt.json 265 download   job
www.polygon.com-shallow-20160116-014106-eidx3-00000.warc.gz 39847448 download   job
www.polygon.com-shallow-20160116-014106-eidx3-00000.warc.gz.png 619279 download
www.polygon.com-shallow-20160116-014106-eidx3-00000.warc.gz_thumb.jpg 3420 download
www.polygon.com-shallow-20160116-014106-eidx3-00000.warc.os.cdx.gz 8930 download
www.polygon.com-shallow-20160116-014106-eidx3-meta.warc.gz 9288 download   job
www.polygon.com-shallow-20160116-014106-eidx3-meta.warc.os.cdx.gz 47 download
www.polygon.com-shallow-20160116-014106-eidx3.json 299 download   job
www.prnewswire.com-shallow-20160116-063752-62fo2-00000.warc.gz 1993353 download   job
www.prnewswire.com-shallow-20160116-063752-62fo2-00000.warc.gz.png 151218 download
www.prnewswire.com-shallow-20160116-063752-62fo2-00000.warc.gz_thumb.jpg 3276 download
www.prnewswire.com-shallow-20160116-063752-62fo2-00000.warc.os.cdx.gz 10303 download
www.prnewswire.com-shallow-20160116-063752-62fo2-meta.warc.gz 10545 download   job
www.prnewswire.com-shallow-20160116-063752-62fo2-meta.warc.os.cdx.gz 47 download
www.prnewswire.com-shallow-20160116-063752-62fo2.json 369 download   job
www.propublica.org-inf-20160116-044746-cjh5l-00000.warc.gz 67942716 download   job
www.propublica.org-inf-20160116-044746-cjh5l-00000.warc.gz.png 116782 download
www.propublica.org-inf-20160116-044746-cjh5l-00000.warc.gz_thumb.jpg 2849 download
www.propublica.org-inf-20160116-044746-cjh5l-00000.warc.os.cdx.gz 181366 download
www.propublica.org-inf-20160116-044746-cjh5l-meta.warc.gz 121000 download   job
www.propublica.org-inf-20160116-044746-cjh5l-meta.warc.os.cdx.gz 47 download
www.propublica.org-inf-20160116-044746-cjh5l.json 319 download   job
www.ratemyprofessors.com-shallow-20160116-072714-8pccm-00000.warc.gz 4317043 download   job
www.ratemyprofessors.com-shallow-20160116-072714-8pccm-00000.warc.gz.png 147648 download
www.ratemyprofessors.com-shallow-20160116-072714-8pccm-00000.warc.gz_thumb.jpg 3292 download
www.ratemyprofessors.com-shallow-20160116-072714-8pccm-00000.warc.os.cdx.gz 9598 download
www.ratemyprofessors.com-shallow-20160116-072714-8pccm-meta.warc.gz 9743 download   job
www.ratemyprofessors.com-shallow-20160116-072714-8pccm-meta.warc.os.cdx.gz 47 download
www.ratemyprofessors.com-shallow-20160116-072714-8pccm.json 279 download   job
www.reddit.com-inf-20160116-075051-axtlf-00000.warc.gz 300466020 download   job
www.reddit.com-inf-20160116-075051-axtlf-00000.warc.gz.png 251710 download
www.reddit.com-inf-20160116-075051-axtlf-00000.warc.gz_thumb.jpg 4174 download
www.reddit.com-inf-20160116-075051-axtlf-00000.warc.os.cdx.gz 544185 download
www.reddit.com-inf-20160116-075051-axtlf-meta.warc.gz 583772 download   job
www.reddit.com-inf-20160116-075051-axtlf-meta.warc.os.cdx.gz 47 download
www.reddit.com-inf-20160116-075051-axtlf.json 313 download   job
www.reddit.com-inf-20160117-005448-uwycz-00000.warc.gz 178525086 download   job
www.reddit.com-inf-20160117-005448-uwycz-00000.warc.gz.png 195610 download
www.reddit.com-inf-20160117-005448-uwycz-00000.warc.gz_thumb.jpg 3351 download
www.reddit.com-inf-20160117-005448-uwycz-00000.warc.os.cdx.gz 203982 download
www.reddit.com-inf-20160117-005448-uwycz-meta.warc.gz 159668 download   job
www.reddit.com-inf-20160117-005448-uwycz-meta.warc.os.cdx.gz 47 download
www.reddit.com-inf-20160117-005448-uwycz.json 314 download   job
www.reddit.com-inf-20160117-005749-aawpk-00000.warc.gz 45863494 download   job
www.reddit.com-inf-20160117-005749-aawpk-00000.warc.gz.png 240089 download
www.reddit.com-inf-20160117-005749-aawpk-00000.warc.gz_thumb.jpg 3517 download
www.reddit.com-inf-20160117-005749-aawpk-00000.warc.os.cdx.gz 138397 download
www.reddit.com-inf-20160117-005749-aawpk-meta.warc.gz 109560 download   job
www.reddit.com-inf-20160117-005749-aawpk-meta.warc.os.cdx.gz 47 download
www.reddit.com-inf-20160117-005749-aawpk.json 317 download   job
www.reddit.com-inf-20160117-010114-5ar31-00000.warc.gz 96397982 download   job
www.reddit.com-inf-20160117-010114-5ar31-00000.warc.gz.png 252873 download
www.reddit.com-inf-20160117-010114-5ar31-00000.warc.gz_thumb.jpg 3970 download
www.reddit.com-inf-20160117-010114-5ar31-00000.warc.os.cdx.gz 100220 download
www.reddit.com-inf-20160117-010114-5ar31-meta.warc.gz 81857 download   job
www.reddit.com-inf-20160117-010114-5ar31-meta.warc.os.cdx.gz 47 download
www.reddit.com-inf-20160117-010114-5ar31.json 318 download   job
www.reddit.com-inf-20160117-010135-blv6y-00000.warc.gz 119213676 download   job
www.reddit.com-inf-20160117-010135-blv6y-00000.warc.gz.png 244332 download
www.reddit.com-inf-20160117-010135-blv6y-00000.warc.gz_thumb.jpg 3895 download
www.reddit.com-inf-20160117-010135-blv6y-00000.warc.os.cdx.gz 190047 download
www.reddit.com-inf-20160117-010135-blv6y-meta.warc.gz 149540 download   job
www.reddit.com-inf-20160117-010135-blv6y-meta.warc.os.cdx.gz 47 download
www.reddit.com-inf-20160117-010135-blv6y.json 325 download   job
www.reddit.com-shallow-20160117-005628-c9jvx-00000.warc.gz 1748084 download   job
www.reddit.com-shallow-20160117-005628-c9jvx-00000.warc.gz.png 213192 download
www.reddit.com-shallow-20160117-005628-c9jvx-00000.warc.gz_thumb.jpg 3362 download
www.reddit.com-shallow-20160117-005628-c9jvx-00000.warc.os.cdx.gz 8684 download
www.reddit.com-shallow-20160117-005628-c9jvx-meta.warc.gz 8199 download   job
www.reddit.com-shallow-20160117-005628-c9jvx-meta.warc.os.cdx.gz 47 download
www.reddit.com-shallow-20160117-005628-c9jvx.json 328 download   job
www.reuters.com-shallow-20160116-044412-65uv2-00000.warc.gz 964941 download   job
www.reuters.com-shallow-20160116-044412-65uv2-00000.warc.gz.png 188607 download
www.reuters.com-shallow-20160116-044412-65uv2-00000.warc.gz_thumb.jpg 4133 download
www.reuters.com-shallow-20160116-044412-65uv2-00000.warc.os.cdx.gz 7645 download
www.reuters.com-shallow-20160116-044412-65uv2-meta.warc.gz 8203 download   job
www.reuters.com-shallow-20160116-044412-65uv2-meta.warc.os.cdx.gz 47 download
www.reuters.com-shallow-20160116-044412-65uv2.json 296 download   job
www.ruskeys.net-inf-20160116-024451-2yl6f-00000.warc.gz 393728844 download   job
www.ruskeys.net-inf-20160116-024451-2yl6f-00000.warc.gz.png 294622 download
www.ruskeys.net-inf-20160116-024451-2yl6f-00000.warc.gz_thumb.jpg 3448 download
www.ruskeys.net-inf-20160116-024451-2yl6f-00000.warc.os.cdx.gz 175201 download
www.ruskeys.net-inf-20160116-024451-2yl6f-meta.warc.gz 99706 download   job
www.ruskeys.net-inf-20160116-024451-2yl6f-meta.warc.os.cdx.gz 47 download
www.ruskeys.net-inf-20160116-024451-2yl6f.json 244 download   job
www.someecards.com-shallow-20160116-053841-1pe21-00000.warc.gz 2563075 download   job
www.someecards.com-shallow-20160116-053841-1pe21-00000.warc.gz.png 48100 download
www.someecards.com-shallow-20160116-053841-1pe21-00000.warc.gz_thumb.jpg 1567 download
www.someecards.com-shallow-20160116-053841-1pe21-00000.warc.os.cdx.gz 15966 download
www.someecards.com-shallow-20160116-053841-1pe21-meta.warc.gz 13110 download   job
www.someecards.com-shallow-20160116-053841-1pe21-meta.warc.os.cdx.gz 47 download
www.someecards.com-shallow-20160116-053841-1pe21.json 296 download   job
www.teenagewildlife.com-inf-20160111-163634-5vqqw-00001.warc.gz 5368721584 download   job
www.teenagewildlife.com-inf-20160111-163634-5vqqw-00001.warc.os.cdx.gz 9512654 download
www.theatlantic.com-shallow-20160116-061737-4svev-00000.warc.gz 4593818 download   job
www.theatlantic.com-shallow-20160116-061737-4svev-00000.warc.gz.png 369128 download
www.theatlantic.com-shallow-20160116-061737-4svev-00000.warc.gz_thumb.jpg 4348 download
www.theatlantic.com-shallow-20160116-061737-4svev-00000.warc.os.cdx.gz 10870 download
www.theatlantic.com-shallow-20160116-061737-4svev-meta.warc.gz 10017 download   job
www.theatlantic.com-shallow-20160116-061737-4svev-meta.warc.os.cdx.gz 47 download
www.theatlantic.com-shallow-20160116-061737-4svev.json 310 download   job
www.theforce.net-inf-20160101-005916-1660y-00010.warc.gz 5369108907 download   job
www.theforce.net-inf-20160101-005916-1660y-00010.warc.gz.png 43477 download
www.theforce.net-inf-20160101-005916-1660y-00010.warc.gz_thumb.jpg 2218 download
www.theforce.net-inf-20160101-005916-1660y-00010.warc.os.cdx.gz 3431414 download
www.theforce.net-inf-20160101-005916-1660y-00011.warc.gz 5369335770 download   job
www.theforce.net-inf-20160101-005916-1660y-00011.warc.os.cdx.gz 2955529 download
www.theguardian.com-shallow-20160116-183523-7qse7-00000.warc.gz 9223315 download   job
www.theguardian.com-shallow-20160116-183523-7qse7-00000.warc.gz.png 82015 download
www.theguardian.com-shallow-20160116-183523-7qse7-00000.warc.gz_thumb.jpg 3179 download
www.theguardian.com-shallow-20160116-183523-7qse7-00000.warc.os.cdx.gz 26931 download
www.theguardian.com-shallow-20160116-183523-7qse7-meta.warc.gz 19909 download   job
www.theguardian.com-shallow-20160116-183523-7qse7-meta.warc.os.cdx.gz 47 download
www.theguardian.com-shallow-20160116-183523-7qse7.json 328 download   job
www.thelocal.at-shallow-20160116-161855-ahwc0-00000.warc.gz 1402756 download   job
www.thelocal.at-shallow-20160116-161855-ahwc0-00000.warc.gz.png 383319 download
www.thelocal.at-shallow-20160116-161855-ahwc0-00000.warc.gz_thumb.jpg 4263 download
www.thelocal.at-shallow-20160116-161855-ahwc0-00000.warc.os.cdx.gz 8773 download
www.thelocal.at-shallow-20160116-161855-ahwc0-meta.warc.gz 8964 download   job
www.thelocal.at-shallow-20160116-161855-ahwc0-meta.warc.os.cdx.gz 47 download
www.thelocal.at-shallow-20160116-161855-ahwc0.json 303 download   job
www.vidarholen.net-inf-20160115-185418-7zn7b-00000.warc.gz 4175709 download   job
www.vidarholen.net-inf-20160115-185418-7zn7b-00000.warc.gz.png 164118 download
www.vidarholen.net-inf-20160115-185418-7zn7b-00000.warc.gz_thumb.jpg 3407 download
www.vidarholen.net-inf-20160115-185418-7zn7b-00000.warc.os.cdx.gz 23849 download
www.vidarholen.net-inf-20160115-185418-7zn7b-meta.warc.gz 16699 download   job
www.vidarholen.net-inf-20160115-185418-7zn7b-meta.warc.os.cdx.gz 47 download
www.vidarholen.net-inf-20160115-185418-7zn7b.json 260 download   job
www.zijderzin.nl-inf-20160116-060930-2fe8e-00000.warc.gz 15807400 download   job
www.zijderzin.nl-inf-20160116-060930-2fe8e-00000.warc.gz.png 486942 download
www.zijderzin.nl-inf-20160116-060930-2fe8e-00000.warc.gz_thumb.jpg 3337 download
www.zijderzin.nl-inf-20160116-060930-2fe8e-00000.warc.os.cdx.gz 10804 download
www.zijderzin.nl-inf-20160116-060930-2fe8e-meta.warc.gz 9099 download   job
www.zijderzin.nl-inf-20160116-060930-2fe8e-meta.warc.os.cdx.gz 47 download
www.zijderzin.nl-inf-20160116-060930-2fe8e.json 245 download   job