View on Internet Archive

Filename Size
8ch.net-inf-20170129-032323-4yjr9.json 255 download   job
acheaven.buwahaha.com-inf-20170130-170214-9j2b4.json 250 download   job
acpedia.org-inf-20170129-063707-81s4l-aborted-00000.warc.gz 170104090 download   job
acpedia.org-inf-20170129-063707-81s4l-aborted-00000.warc.os.cdx.gz 737051 download
acpedia.org-inf-20170129-063707-81s4l-aborted.json 237 download   job
aftermathnews.wordpress.com-shallow-20170130-200453-6vwsy.json 322 download   job
agenciabrasil.ebc.com.br-inf-20161227-164409-8jz5a-00030.warc.gz 5470772974 download   job
agenciabrasil.ebc.com.br-inf-20161227-164409-8jz5a-00030.warc.os.cdx.gz 4816941 download
agenciabrasil.ebc.com.br-inf-20161227-164409-8jz5a-00031.warc.gz 5498550328 download   job
agenciabrasil.ebc.com.br-inf-20161227-164409-8jz5a-00031.warc.os.cdx.gz 7740 download
applemuseum.bott.org-inf-20170130-053543-26ago.json 247 download   job
archive.pdp-11.org.ru-inf-20170130-032216-7xg43.json 250 download   job
archiveteam_archivebot_go_20170130210002.cdx.gz 38202087 download
archiveteam_archivebot_go_20170130210002.cdx.idx 41193 download
archiveteam_archivebot_go_20170130210002_archive.torrent 614576 download
archiveteam_archivebot_go_20170130210002_files.xml 0 download
archiveteam_archivebot_go_20170130210002_meta.sqlite 322560 download
archiveteam_archivebot_go_20170130210002_meta.xml 793 download
arstechnica.com-shallow-20170129-124649-2d9gi.json 336 download   job
assets.documentcloud.org-shallow-20170129-124633-4atht.json 322 download   job
bearingarms.com-inf-20170123-201858-8rqrv.json 244 download   job
bit.ly-shallow-20170128-235819-4eg5h.json 252 download   job
boards.4chan.org-inf-20170128-223430-e3zg5-00003.warc.gz 1379588557 download   job
boards.4chan.org-inf-20170128-223430-e3zg5-00003.warc.os.cdx.gz 389126 download
boards.4chan.org-inf-20170128-223430-e3zg5.json 248 download   job
chickenscrawlings.com-inf-20170128-125340-35l11-00000.warc.gz 910841778 download   job
chickenscrawlings.com-inf-20170128-125340-35l11-00000.warc.os.cdx.gz 1311319 download
chickenscrawlings.com-inf-20170128-125340-35l11-meta.warc.gz 938709 download   job
chickenscrawlings.com-inf-20170128-125340-35l11-meta.warc.os.cdx.gz 47 download
chickenscrawlings.com-inf-20170128-125340-35l11.json 246 download   job
cock.li-inf-20170128-224735-bjhsj.json 293 download   job
conservative-headlines.com-inf-20170123-202006-5iijs.json 254 download   job
democraciadigit.al-inf-20170129-171948-d0qi6.json 249 download   job
derethconservancy.org-inf-20170130-164927-2c6na.json 265 download   job
derethconservancy.org-inf-20170130-165126-11t8r.json 250 download   job
download.ddo.akamai.turbine.com-shallow-20170130-155955-dj77a.json 295 download   job
download.lavadomefive.com-inf-20170130-144124-5h5r7-00000.warc.gz 4601 download   job
download.lavadomefive.com-inf-20170130-144124-5h5r7-00000.warc.os.cdx.gz 230 download
download.lavadomefive.com-inf-20170130-144124-5h5r7-meta.warc.gz 3193 download   job
download.lavadomefive.com-inf-20170130-144124-5h5r7-meta.warc.os.cdx.gz 47 download
download.lavadomefive.com-inf-20170130-144124-5h5r7.json 270 download   job
download.lavadomefive.com-inf-20170130-144243-c6lqn-00000.warc.gz 14814 download   job
download.lavadomefive.com-inf-20170130-144243-c6lqn-00000.warc.os.cdx.gz 551 download
download.lavadomefive.com-inf-20170130-144243-c6lqn-meta.warc.gz 3500 download   job
download.lavadomefive.com-inf-20170130-144243-c6lqn-meta.warc.os.cdx.gz 47 download
download.lavadomefive.com-inf-20170130-144243-c6lqn.json 269 download   job
download.lavadomefive.com-inf-20170130-144311-806v2-00000.warc.gz 172377 download   job
download.lavadomefive.com-inf-20170130-144311-806v2-00000.warc.os.cdx.gz 2764 download
download.lavadomefive.com-inf-20170130-144311-806v2-meta.warc.gz 4578 download   job
download.lavadomefive.com-inf-20170130-144311-806v2-meta.warc.os.cdx.gz 47 download
download.lavadomefive.com-inf-20170130-144311-806v2.json 270 download   job
download.lavadomefive.com-inf-20170130-144420-6xne2-00000.warc.gz 798001105 download   job
download.lavadomefive.com-inf-20170130-144420-6xne2-00000.warc.os.cdx.gz 5482 download
download.lavadomefive.com-inf-20170130-144420-6xne2-meta.warc.gz 6207 download   job
download.lavadomefive.com-inf-20170130-144420-6xne2-meta.warc.os.cdx.gz 47 download
download.lavadomefive.com-inf-20170130-144420-6xne2.json 268 download   job
drmcninja.com-inf-20170129-150007-5gmas.json 241 download   job
edukado.net-inf-20170127-120244-7eqn1.json 241 download   job
falconio.weebly.com-inf-20170130-053950-251xe-00000.warc.gz 454702216 download   job
falconio.weebly.com-inf-20170130-053950-251xe-00000.warc.os.cdx.gz 374612 download
falconio.weebly.com-inf-20170130-053950-251xe-meta.warc.gz 242769 download   job
falconio.weebly.com-inf-20170130-053950-251xe-meta.warc.os.cdx.gz 47 download
falconio.weebly.com-inf-20170130-053950-251xe.json 244 download   job
francaisdefrance.wordpress.com-inf-20170128-070725-7nlkq-00003.warc.gz 1416334065 download   job
francaisdefrance.wordpress.com-inf-20170128-070725-7nlkq-00003.warc.os.cdx.gz 3938991 download
francaisdefrance.wordpress.com-inf-20170128-070725-7nlkq.json 259 download   job
ftp.ea.com-inf-20170130-154839-9juy9-00000.warc.gz 5565950384 download   job
ftp.ea.com-inf-20170130-154839-9juy9-00000.warc.os.cdx.gz 11319 download
ftp.ea.com-inf-20170130-154839-9juy9-00001.warc.gz 5374027303 download   job
ftp.ea.com-inf-20170130-154839-9juy9-00001.warc.os.cdx.gz 50296 download
ftp.ea.com-inf-20170130-154839-9juy9-00002.warc.gz 5434119963 download   job
ftp.ea.com-inf-20170130-154839-9juy9-00002.warc.os.cdx.gz 45018 download
ftp.ea.com-inf-20170130-154839-9juy9-00003.warc.gz 5368812355 download   job
ftp.ea.com-inf-20170130-154839-9juy9-00003.warc.os.cdx.gz 77148 download
ftp.kernel.org-inf-20170129-030157-93056-aborted-00000.warc.gz 534101509 download   job
ftp.kernel.org-inf-20170129-030157-93056-aborted-00000.warc.os.cdx.gz 3021 download
ftp.kernel.org-inf-20170129-030157-93056-aborted.json 239 download   job
ftp.kernel.org-inf-20170129-032013-93056-aborted-00152.warc.gz 702554500 download   job
ftp.kernel.org-inf-20170129-032013-93056-aborted-00152.warc.os.cdx.gz 818 download
geoftp.ibge.gov.br-inf-20170130-140527-asa8t.json 311 download   job
grep.geek-inf-20170130-042717-du2v9.json 239 download   job
gresillon.org-inf-20170129-132209-9sk4z.json 243 download   job
haozip.2345.com-inf-20170129-001418-934pu.json 245 download   job
horror.org-inf-20170130-111103-efbzo.json 238 download   job
kikaibunsho.web.fc2.com-inf-20170129-185827-sbk3l.json 253 download   job
koin.com-shallow-20170130-185037-biq6q.json 310 download   job
landing.google.com-inf-20170129-035103-22qoi.json 255 download   job
lawfareblog.com-shallow-20170130-161353-c3uig.json 326 download   job
lepeuple.be-inf-20170128-221949-34kei-00005.warc.gz 1419107891 download   job
lepeuple.be-inf-20170128-221949-34kei-00005.warc.os.cdx.gz 2329448 download
lepeuple.be-inf-20170128-221949-34kei.json 239 download   job
mamedev.emulab.it-inf-20170130-160955-1svuq.json 249 download   job
medium.com-inf-20170130-161842-a44ra.json 292 download   job
medium.com-shallow-20170130-172822-s5v38.json 291 download   job
mirrors.kernel.org-inf-20170129-030413-br32a-aborted-00000.warc.gz 86219795 download   job
mirrors.kernel.org-inf-20170129-030413-br32a-aborted-00000.warc.os.cdx.gz 2751 download
mirrors.kernel.org-inf-20170129-030413-br32a-aborted.json 243 download   job
montreal.ctvnews.ca-shallow-20170130-053724-b2pn4-00000.warc.gz 1952343 download   job
montreal.ctvnews.ca-shallow-20170130-053724-b2pn4-00000.warc.os.cdx.gz 14602 download
montreal.ctvnews.ca-shallow-20170130-053724-b2pn4-meta.warc.gz 11490 download   job
montreal.ctvnews.ca-shallow-20170130-053724-b2pn4-meta.warc.os.cdx.gz 47 download
montreal.ctvnews.ca-shallow-20170130-053724-b2pn4.json 306 download   job
mormonleaks.com-inf-20170129-073258-bqijm.json 245 download   job
mormonleaks.io-inf-20170129-073424-32484.json 245 download   job
motherboard.vice.com-shallow-20170130-181557-8d7kr.json 299 download   job
obamawhitehouse.archives.gov-inf-20170120-213554-2747t-00026.warc.gz 5376920696 download   job
obamawhitehouse.archives.gov-inf-20170120-213554-2747t-00026.warc.os.cdx.gz 7699923 download
pastebin.com-inf-20170129-031513-925w4.json 251 download   job
patches.ubi.com-inf-20170130-100317-35gp6.json 240 download   job
pdp-11.org.ru-inf-20170130-032505-a2x4h-00000.warc.gz 2096821324 download   job
pdp-11.org.ru-inf-20170130-032505-a2x4h-00000.warc.os.cdx.gz 23554 download
pdp-11.org.ru-inf-20170130-032505-a2x4h-meta.warc.gz 15958 download   job
pdp-11.org.ru-inf-20170130-032505-a2x4h-meta.warc.os.cdx.gz 47 download
pdp-11.org.ru-inf-20170130-032505-a2x4h.json 242 download   job
portland.backpage.com-inf-20170129-070952-ex20r.json 251 download   job
profesiulo.info-inf-20170129-222832-75vea.json 245 download   job
refusefascism.org-inf-20170126-010935-d1k3a.json 248 download   job
ripostelaique.com-inf-20170128-221828-bpp4i-00002.warc.gz 503388024 download   job
ripostelaique.com-inf-20170128-221828-bpp4i-00002.warc.os.cdx.gz 407558 download
ripostelaique.com-inf-20170128-221828-bpp4i.json 245 download   job
sakaiesperanto.web.fc2.com-inf-20170129-115220-7ck6s.json 256 download   job
sijmen.ruwhof.net-shallow-20170130-200702-bu28j.json 303 download   job
skoolkit.ca-inf-20170129-095829-16jx0.json 238 download   job
techcrunch.com-shallow-20170128-194836-adftb-00000.warc.gz 5875361 download   job
techcrunch.com-shallow-20170128-194836-adftb-00000.warc.os.cdx.gz 11431 download
techcrunch.com-shallow-20170128-194836-adftb-meta.warc.gz 10859 download   job
techcrunch.com-shallow-20170128-194836-adftb-meta.warc.os.cdx.gz 47 download
techcrunch.com-shallow-20170128-194836-adftb.json 298 download   job
the1709blog.blogspot.com-inf-20170127-162527-31ae6-00007.warc.gz 6458575630 download   job
the1709blog.blogspot.com-inf-20170127-162527-31ae6-00007.warc.os.cdx.gz 768779 download
the1709blog.blogspot.com-inf-20170127-162527-31ae6-meta.warc.gz 15911741 download   job
the1709blog.blogspot.com-inf-20170127-162527-31ae6-meta.warc.os.cdx.gz 47 download
thejackcat.com-inf-20170130-165856-8g6zp.json 243 download   job
theopporeport.com-inf-20170129-223123-4basv.json 340 download   job
tulis.co-inf-20170127-035206-e7nre-00000.warc.gz 892152191 download   job
tulis.co-inf-20170127-035206-e7nre-00000.warc.os.cdx.gz 3881743 download
tulis.co-inf-20170127-035206-e7nre-meta.warc.gz 3950123 download   job
tulis.co-inf-20170127-035206-e7nre-meta.warc.os.cdx.gz 47 download
tulis.co-inf-20170127-035206-e7nre.json 246 download   job
twitter.com-shallow-20170128-182050-em6rk-meta.warc.gz 7503 download   job
twitter.com-shallow-20170128-182050-em6rk-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170128-182050-em6rk.json 259 download   job
twitter.com-shallow-20170128-192526-6ev64.json 285 download   job
twitter.com-shallow-20170129-005842-dhy9b.json 278 download   job
twitter.com-shallow-20170129-064055-beusu.json 279 download   job
twitter.com-shallow-20170129-064156-659yl.json 277 download   job
twitter.com-shallow-20170129-081535-23ro3.json 281 download   job
twitter.com-shallow-20170129-195855-eh7ob.json 254 download   job
twitter.com-shallow-20170129-195924-6pkbh.json 280 download   job
twitter.com-shallow-20170130-031729-3b0pw.json 286 download   job
twitter.com-shallow-20170130-035731-14i6m-00000.warc.gz 45858 download   job
twitter.com-shallow-20170130-035731-14i6m-00000.warc.os.cdx.gz 241 download
twitter.com-shallow-20170130-035731-14i6m-meta.warc.gz 4351 download   job
twitter.com-shallow-20170130-035731-14i6m-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170130-035731-14i6m.json 282 download   job
twitter.com-shallow-20170130-035820-14i6m.json 282 download   job
twitter.com-shallow-20170130-070312-3sbug-00000.warc.gz 40689 download   job
twitter.com-shallow-20170130-070312-3sbug-00000.warc.os.cdx.gz 242 download
twitter.com-shallow-20170130-070312-3sbug-meta.warc.gz 5090 download   job
twitter.com-shallow-20170130-070312-3sbug-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170130-070312-3sbug.json 284 download   job
twitter.com-shallow-20170130-102548-55uhw-00000.warc.gz 2778588 download   job
twitter.com-shallow-20170130-102548-55uhw-00000.warc.os.cdx.gz 6146 download
twitter.com-shallow-20170130-102548-55uhw-meta.warc.gz 7657 download   job
twitter.com-shallow-20170130-102548-55uhw-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170130-102548-55uhw.json 276 download   job
twitter.com-shallow-20170130-152911-bezhq.json 279 download   job
twitter.com-shallow-20170130-194138-64pqj.json 280 download   job
upagainstthelaw.org-inf-20170128-233459-brp39.json 247 download   job
urls-pastebin.com-9489HBgt-inf-20170120-045859-4f0fy-urls.txt 3626 download
urls-pastebin.com-9489HBgt-inf-20170120-045859-4f0fy.json 282 download   job
urls-pastebin.com-GCLEKsNT-shallow-20170129-034412-ehzim-urls.txt 172 download
urls-pastebin.com-GCLEKsNT-shallow-20170129-034412-ehzim.json 287 download   job
urls-pastebin.com-tPi8TW0a-shallow-20170130-174516-7obw0-urls.txt 664 download
urls-pastebin.com-tPi8TW0a-shallow-20170130-174516-7obw0.json 285 download   job
urls-www.example.com-some-file.txt-inf-20170129-030934-byu50-00000.warc.gz 2519 download   job
urls-www.example.com-some-file.txt-inf-20170129-030934-byu50-00000.warc.os.cdx.gz 47 download
urls-www.example.com-some-file.txt-inf-20170129-030934-byu50-urls.txt 1270 download
urls-www.example.com-some-file.txt-inf-20170129-030934-byu50.json 289 download   job
usa.streetsblog.org-inf-20170125-225352-cc5x8-00016.warc.gz 208253877 download   job
usa.streetsblog.org-inf-20170125-225352-cc5x8-00016.warc.os.cdx.gz 302739 download
usa.streetsblog.org-inf-20170125-225352-cc5x8.json 249 download   job
usa.streetsblog.org-inf-20170128-222403-cc5x8-00000.warc.gz 5368742442 download   job
usa.streetsblog.org-inf-20170128-222403-cc5x8-00000.warc.os.cdx.gz 2697801 download
usa.streetsblog.org-inf-20170128-222403-cc5x8-00001.warc.gz 5368743796 download   job
usa.streetsblog.org-inf-20170128-222403-cc5x8-00001.warc.os.cdx.gz 1511831 download
usa.streetsblog.org-inf-20170128-222403-cc5x8-00002.warc.gz 5372002368 download   job
usa.streetsblog.org-inf-20170128-222403-cc5x8-00002.warc.os.cdx.gz 2058449 download
usa.streetsblog.org-inf-20170128-222403-cc5x8-00003.warc.gz 5370698311 download   job
usa.streetsblog.org-inf-20170128-222403-cc5x8-00003.warc.os.cdx.gz 2701499 download
usa.streetsblog.org-inf-20170128-222403-cc5x8-00004.warc.gz 5369015137 download   job
usa.streetsblog.org-inf-20170128-222403-cc5x8-00004.warc.os.cdx.gz 3688615 download
variety.com-shallow-20170129-180929-6v9md.json 311 download   job
voxday.blogspot.com-inf-20170119-055747-10vyz-00035.warc.gz 1258008772 download   job
voxday.blogspot.com-inf-20170119-055747-10vyz.json 247 download   job
whitehatmag.com-shallow-20170128-223527-311ef-00000.warc.gz 2883863 download   job
whitehatmag.com-shallow-20170128-223527-311ef-meta.warc.gz 10218 download   job
whitehatmag.com-shallow-20170128-223527-311ef.json 306 download   job
www.221b-baker-street.org-inf-20170130-085606-2dr85-00000.warc.gz 2525 download   job
www.221b-baker-street.org-inf-20170130-085606-2dr85-meta.warc.gz 3347 download   job
www.221b-baker-street.org-inf-20170130-085606-2dr85.json 253 download   job
www.aclu.org-shallow-20170128-221516-64rvj-00000.warc.gz 169671 download   job
www.aclu.org-shallow-20170128-221516-64rvj-meta.warc.gz 3200 download   job
www.aclu.org-shallow-20170128-221516-64rvj.json 296 download   job
www.albeshiloh.com-inf-20170130-085711-5xvb3.json 246 download   job
www.annebishop.com-inf-20170130-085931-b12ed.json 246 download   job
www.asheronsguide.com-inf-20170130-165448-aoiin.json 250 download   job
www.avoiceformen.com-inf-20170129-083814-exgze-00000.warc.gz 5368798177 download   job
www.avoiceformen.com-inf-20170129-083814-exgze-00001.warc.gz 5380334525 download   job
www.avoiceformen.com-inf-20170129-083814-exgze-00002.warc.gz 5368740415 download   job
www.avoiceformen.com-inf-20170129-083814-exgze-00003.warc.gz 1230659852 download   job
www.avoiceformen.com-inf-20170129-083814-exgze.json 251 download   job
www.avramdavidson.org-inf-20170130-090056-8bfy2.json 249 download   job
www.baltimoresun.com-shallow-20170129-002619-8la4z.json 322 download   job
www.barbarabovaliteraryagency.com-inf-20170130-090608-8yjdd.json 261 download   job
www.bbc.com-shallow-20170130-063743-1knkx.json 271 download   job
www.benbova.net-inf-20170130-090639-c9dzo.json 243 download   job
www.bloomberg.com-shallow-20170128-195011-60qpc-00000.warc.gz 5587537 download   job
www.bloomberg.com-shallow-20170128-195011-60qpc-meta.warc.gz 13733 download   job
www.bloomberg.com-shallow-20170128-195011-60qpc.json 337 download   job
www.bloomberg.com-shallow-20170130-190634-1axk7.json 325 download   job
www.blu-ray.com-shallow-20170128-204918-atbhq.json 261 download   job
www.brazenhussies.net-inf-20170130-090705-hazhs.json 249 download   job
www.brooklynwriter.com-inf-20170130-090129-9nkwy-00000.warc.gz 193340114 download   job
www.brooklynwriter.com-inf-20170130-090129-9nkwy-meta.warc.gz 246323 download   job
www.brooklynwriter.com-inf-20170130-090129-9nkwy.json 250 download   job
www.caerfyrddin.org-inf-20170130-091106-ci8t9.json 247 download   job
www.camillelaguire.com-inf-20170130-091547-4te1g.json 250 download   job
www.catherineshaffer.com-inf-20170130-091602-2yj8y.json 252 download   job
www.change.org-shallow-20170130-073357-134ld-00000.warc.gz 10550165 download   job
www.change.org-shallow-20170130-073357-134ld-meta.warc.gz 32585 download   job
www.change.org-shallow-20170130-073357-134ld.json 368 download   job
www.christopherreynaga.com-inf-20170130-091636-fqawt-aborted-00000.warc.gz 165817175 download   job
www.christopherreynaga.com-inf-20170130-091636-fqawt-aborted.json 253 download   job
www.christopherreynaga.com-inf-20170130-112440-fqawt.json 254 download   job
www.chuckfair.com-inf-20170130-091641-8dayg.json 245 download   job
www.dailystormer.com-inf-20170106-031934-ck6yf.json 248 download   job
www.davidbarrkirtley.com-inf-20170130-091722-56v9k.json 252 download   job
www.desperationmorale.com-inf-20170130-091919-1lt53.json 253 download   job
www.destanyblair.com-inf-20170130-092219-1jatu.json 248 download   job
www.dmv.ca.gov-shallow-20170129-020817-6myd7.json 332 download   job
www.douglastriggs.com-inf-20170130-092600-2g20i.json 249 download   job
www.electrictoolbox.com-inf-20170129-074030-8vr7p.json 254 download   job
www.elizabethmoon.com-inf-20170130-102810-16azi-00000.warc.gz 39744408 download   job
www.elizabethmoon.com-inf-20170130-102810-16azi-meta.warc.gz 46319 download   job
www.elizabethmoon.com-inf-20170130-102810-16azi.json 249 download   job
www.esperanto-sumoo.strefa.pl-inf-20170129-164230-iock3.json 259 download   job
www.ethshar.com-inf-20170130-102820-3caa9.json 243 download   job
www.facebook.com-inf-20170130-181152-9cglx.json 278 download   job
www.farrellworlds.com-inf-20170130-102840-ayu75.json 249 download   job
www.fightcyberstalking.org-inf-20170129-074532-d1ys0.json 257 download   job
www.foulpapers.com-inf-20170130-103426-f3joq-00000.warc.gz 22064875 download   job
www.foulpapers.com-inf-20170130-103426-f3joq-aborted-meta.warc.gz 4319 download   job
www.foulpapers.com-inf-20170130-103426-f3joq-aborted.json 245 download   job
www.foulpapers.com-inf-20170130-103426-f3joq-meta.warc.gz 268213 download   job
www.future-classics.org-inf-20170130-110534-32hok.json 251 download   job
www.gov.uk-shallow-20170129-195227-bw2wa.json 288 download   job
www.heisanevilgenius.com-inf-20170128-160824-djhwm.json 253 download   job
www.helixsf.com-inf-20170130-110909-1gvgh-00000.warc.gz 332772914 download   job
www.helixsf.com-inf-20170130-110909-1gvgh-meta.warc.gz 1058396 download   job
www.hitthosekeys.com-inf-20170130-110938-ccd7d.json 248 download   job
www.hmv.ca-inf-20170128-130840-4dkwc.json 241 download   job
www.independent-thinking.com-inf-20170129-080902-5hopm-00000.warc.gz 155737262 download   job
www.independent-thinking.com-inf-20170129-080902-5hopm-meta.warc.gz 163286 download   job
www.independent-thinking.com-inf-20170129-080902-5hopm.json 258 download   job
www.independent.co.uk-shallow-20170130-165929-cpjq8.json 396 download   job
www.independent.ie-shallow-20170128-215746-913dx-00000.warc.gz 2879992 download   job
www.independent.ie-shallow-20170128-215746-913dx-meta.warc.gz 11625 download   job
www.independent.ie-shallow-20170128-215746-913dx.json 351 download   job
www.jackchalker.com-inf-20170130-173944-7r0at.json 247 download   job
www.kmonos.net-inf-20170129-005610-6e57p-00000.warc.gz 5725590381 download   job
www.kmonos.net-inf-20170129-005610-6e57p-00001.warc.gz 5760307725 download   job
www.lgbtqnation.com-shallow-20170130-201319-c95vk.json 320 download   job
www.midnightspecial.net-inf-20170128-232411-1a43f-00000.warc.gz 101234977 download   job
www.midnightspecial.net-inf-20170128-232411-1a43f-meta.warc.gz 103276 download   job
www.midnightspecial.net-inf-20170128-232411-1a43f.json 251 download   job
www.mormonfileleaks.com-inf-20170129-073208-7ozty-00000.warc.gz 438494703 download   job
www.mormonfileleaks.com-inf-20170129-073208-7ozty-meta.warc.gz 176635 download   job
www.mormonfileleaks.com-inf-20170129-073208-7ozty.json 253 download   job
www.newyorker.com-shallow-20170129-125030-ckgoa.json 330 download   job
www.nlg.org-inf-20170128-232759-dhcjd.json 240 download   job
www.npr.org-shallow-20170129-132821-63c2r.json 332 download   job
www.reddit.com-shallow-20170128-192451-9d6mz-00000.warc.gz 2015808 download   job
www.reddit.com-shallow-20170128-192451-9d6mz-meta.warc.gz 8984 download   job
www.reddit.com-shallow-20170128-192451-9d6mz.json 330 download   job
www.reddit.com-shallow-20170128-192457-erokb-00000.warc.gz 2015225 download   job
www.reddit.com-shallow-20170128-192457-erokb-meta.warc.gz 8717 download   job
www.reddit.com-shallow-20170128-192457-erokb.json 306 download   job
www.retrolordi.com-shallow-20170129-174336-30jjx.json 292 download   job
www.sciencealert.com-inf-20170128-234944-4pp5i-00000.warc.gz 5368711447 download   job
www.sciencealert.com-inf-20170128-234944-4pp5i-00001.warc.gz 5371276797 download   job
www.sciencealert.com-inf-20170128-234944-4pp5i-00002.warc.gz 658293352 download   job
www.sciencealert.com-inf-20170128-234944-4pp5i.json 248 download   job
www.sff.net-inf-20170130-090344-2dmkp.json 244 download   job
www.shortshortshort.com-inf-20170130-090852-3fho7.json 251 download   job
www.sizecoding.org-inf-20170129-102041-9tj3n.json 245 download   job
www.surfingmagazine.com-inf-20170126-010245-eyjze-00022.warc.gz 5372476031 download   job
www.surfingmagazine.com-inf-20170126-010245-eyjze-00023.warc.gz 5458369485 download   job
www.surfingmagazine.com-inf-20170126-010245-eyjze-00024.warc.gz 5401097095 download   job
www.surfingmagazine.com-inf-20170126-010245-eyjze-00025.warc.gz 5395980780 download   job
www.surfingmagazine.com-inf-20170126-010245-eyjze-00026.warc.gz 5588041014 download   job
www.surfingmagazine.com-inf-20170126-010245-eyjze-00027.warc.gz 4652863470 download   job
www.surfingmagazine.com-inf-20170126-010245-eyjze-meta.warc.gz 16089968 download   job
www.surfingmagazine.com-inf-20170126-010245-eyjze.json 250 download   job
www.technologyreview.com-shallow-20170130-183238-85j0o.json 307 download   job
www.theclarionfoundation.org-inf-20170130-091531-7788i.json 256 download   job
www.theguardian.com-shallow-20170129-184303-4fzxy.json 317 download   job
www.thelocal.at-shallow-20170129-054435-3w3uh.json 306 download   job
www.theregister.co.uk-shallow-20170130-200803-eyfxd.json 319 download   job
www.theverge.com-shallow-20170129-053452-bdmcw.json 309 download   job
www.vnend.net-inf-20170130-091817-6vrdp.json 241 download   job
www.vox.com-shallow-20170130-165309-6x48o.json 308 download   job
www.washingtonpost.com-shallow-20170130-175758-79yfi.json 373 download   job
www.whitehouserealitycheck.com-shallow-20170128-221247-9rt42.json 287 download   job
www.wsj.com-shallow-20170128-224826-4cw6d-00000.warc.gz 1198316 download   job
www.wsj.com-shallow-20170128-224826-4cw6d-meta.warc.gz 6646 download   job
www.wsj.com-shallow-20170128-224826-4cw6d.json 346 download   job
www.yokotayu.com-inf-20170129-115127-6o1q5.json 246 download   job
www.youtube.com-shallow-20170129-032513-dtqzg.json 264 download   job
www.youtube.com-shallow-20170129-182354-86mwb.json 267 download   job
www.youtube.com-shallow-20170129-232226-7cazh.json 267 download   job
www.zuggsoft.com-inf-20170130-201831-uc28k-00000.warc.gz 2279012 download   job
www.zuggsoft.com-inf-20170130-201831-uc28k.json 248 download   job