View on Internet Archive

Filename Size
aa-shallow-20170113-163143-71tbq-00000.warc.gz 2472 download   job
aa-shallow-20170113-163143-71tbq-00000.warc.os.cdx.gz 47 download
aa-shallow-20170113-163143-71tbq.json 314 download   job
abcnews.go.com-shallow-20170126-075652-acg79.json 325 download   job
afo.wtrpg.com-shallow-20170124-142727-873xh.json 245 download   job
aftp.cmdl.noaa.gov-inf-20170125-175731-8fmz7-00000.warc.gz 5438529237 download   job
aftp.cmdl.noaa.gov-inf-20170125-175731-8fmz7-00000.warc.os.cdx.gz 19264 download
aftp.cmdl.noaa.gov-inf-20170125-175731-8fmz7-00001.warc.gz 6781580758 download   job
aftp.cmdl.noaa.gov-inf-20170125-175731-8fmz7-00001.warc.os.cdx.gz 6307 download
aftp.cmdl.noaa.gov-inf-20170125-175731-8fmz7-00002.warc.gz 9252172090 download   job
aftp.cmdl.noaa.gov-inf-20170125-175731-8fmz7-00002.warc.os.cdx.gz 43968 download
aftp.cmdl.noaa.gov-inf-20170125-175731-8fmz7-00003.warc.gz 5692947223 download   job
aftp.cmdl.noaa.gov-inf-20170125-175731-8fmz7-00003.warc.os.cdx.gz 8702 download
aftp.cmdl.noaa.gov-inf-20170125-175731-8fmz7-00004.warc.gz 1310577356 download   job
aftp.cmdl.noaa.gov-inf-20170125-175731-8fmz7-00004.warc.os.cdx.gz 1642 download
aftp.cmdl.noaa.gov-inf-20170125-175731-8fmz7-meta.warc.gz 49287 download   job
aftp.cmdl.noaa.gov-inf-20170125-175731-8fmz7-meta.warc.os.cdx.gz 47 download
aftp.cmdl.noaa.gov-inf-20170125-175731-8fmz7.json 245 download   job
agatcomp.ru-inf-20170125-004656-4lslw.json 240 download   job
agenciabrasil.ebc.com.br-inf-20161227-164409-8jz5a-00018.warc.gz 5370928001 download   job
agenciabrasil.ebc.com.br-inf-20161227-164409-8jz5a-00018.warc.os.cdx.gz 1014387 download
agenciabrasil.ebc.com.br-inf-20161227-164409-8jz5a-00019.warc.gz 5371296732 download   job
agenciabrasil.ebc.com.br-inf-20161227-164409-8jz5a-00019.warc.os.cdx.gz 1199424 download
agenciabrasil.ebc.com.br-inf-20161227-164409-8jz5a-00020.warc.gz 5371290180 download   job
agenciabrasil.ebc.com.br-inf-20161227-164409-8jz5a-00020.warc.os.cdx.gz 1370945 download
agenciabrasil.ebc.com.br-inf-20161227-164409-8jz5a-00021.warc.gz 5370533453 download   job
agenciabrasil.ebc.com.br-inf-20161227-164409-8jz5a-00021.warc.os.cdx.gz 1301276 download
agenciabrasil.ebc.com.br-inf-20161227-164409-8jz5a-00022.warc.gz 5370191780 download   job
agenciabrasil.ebc.com.br-inf-20161227-164409-8jz5a-00022.warc.os.cdx.gz 1145304 download
agenciabrasil.ebc.com.br-inf-20161227-164409-8jz5a-00023.warc.gz 5373261955 download   job
agenciabrasil.ebc.com.br-inf-20161227-164409-8jz5a-00023.warc.os.cdx.gz 942220 download
agenciabrasil.ebc.com.br-inf-20161227-164409-8jz5a-00024.warc.gz 5370198405 download   job
agenciabrasil.ebc.com.br-inf-20161227-164409-8jz5a-00024.warc.os.cdx.gz 1035192 download
ampd.epa.gov-inf-20170124-160937-bo0b0-aborted-00000.warc.gz 662810 download   job
ampd.epa.gov-inf-20170124-160937-bo0b0-aborted-00000.warc.os.cdx.gz 2911 download
ampd.epa.gov-inf-20170124-160937-bo0b0-aborted.json 242 download   job
ampd.epa.gov-inf-20170124-161720-bo0b0.json 243 download   job
annotatedtrump.com-inf-20170126-193430-b1gvd-00000.warc.gz 2884033123 download   job
annotatedtrump.com-inf-20170126-193430-b1gvd-00000.warc.os.cdx.gz 1288218 download
annotatedtrump.com-inf-20170126-193430-b1gvd-meta.warc.gz 799650 download   job
annotatedtrump.com-inf-20170126-193430-b1gvd-meta.warc.os.cdx.gz 47 download
annotatedtrump.com-inf-20170126-193430-b1gvd.json 246 download   job
antidotezine.com-shallow-20170126-024445-f39u1.json 278 download   job
apnews.com-shallow-20170126-171850-blvxr-00000.warc.gz 2168171 download   job
apnews.com-shallow-20170126-171850-blvxr-00000.warc.os.cdx.gz 10539 download
apnews.com-shallow-20170126-171850-blvxr-meta.warc.gz 9262 download   job
apnews.com-shallow-20170126-171850-blvxr-meta.warc.os.cdx.gz 47 download
apnews.com-shallow-20170126-171850-blvxr.json 275 download   job
archiveteam_archivebot_go_20170126210001.cdx.gz 62833955 download
archiveteam_archivebot_go_20170126210001.cdx.idx 65285 download
archiveteam_archivebot_go_20170126210001_archive.torrent 29194 download
archiveteam_archivebot_go_20170126210001_files.xml 0 download
archiveteam_archivebot_go_20170126210001_meta.sqlite 13312 download
archiveteam_archivebot_go_20170126210001_meta.xml 793 download
assets.documentcloud.org-shallow-20170125-230552-6gz72.json 325 download   job
attainsprod.epa.gov-shallow-20170125-010716-ewb3h.json 253 download   job
bermanphotos.wordpress.com-shallow-20170125-035312-96moi.json 305 download   job
blog.cletile.com-inf-20170123-193143-eai7p.json 252 download   job
blog.recurity-labs.com-shallow-20170126-142348-5scut-00000.warc.gz 14189 download   job
blog.recurity-labs.com-shallow-20170126-142348-5scut-00000.warc.os.cdx.gz 503 download
blog.recurity-labs.com-shallow-20170126-142348-5scut-meta.warc.gz 3676 download   job
blog.recurity-labs.com-shallow-20170126-142348-5scut-meta.warc.os.cdx.gz 47 download
blog.recurity-labs.com-shallow-20170126-142348-5scut.json 302 download   job
blogs.wsj.com-shallow-20170126-002250-2sry3.json 295 download   job
cdn0.vox-cdn.com-shallow-20170126-025244-50cks.json 356 download   job
cdn0.vox-cdn.com-shallow-20170126-025411-askrc.json 390 download   job
cdn2.vox-cdn.com-shallow-20170126-025328-cllye.json 334 download   job
cdn3.vox-cdn.com-shallow-20170126-025440-53ni9.json 395 download   job
cleanpowerplanmaps.epa.gov-inf-20170124-151001-9b8zv.json 255 download   job
cleanpowerplanmaps.epa.gov-inf-20170124-160413-2in33.json 270 download   job
climatetorrent.com-inf-20170126-070245-433l8.json 245 download   job
cnmnewz.com-inf-20170123-202621-6gfv2.json 239 download   job
contribute.globalchange.gov-inf-20170125-180833-d4bex.json 258 download   job
council.seattle.gov-inf-20170121-035439-6c3co.json 249 download   job
crooksandliars.com-shallow-20170126-025205-cyc0h.json 298 download   job
desu.sh-inf-20170124-093014-epivp.json 238 download   job
dl.dropboxusercontent.com-shallow-20170126-174141-f0yx8.json 291 download   job
enterthestream.wordpress.com-inf-20170126-040036-4g3pf.json 259 download   job
esforces.com-inf-20170124-195401-7fphn.json 236 download   job
example.com-shallow-20170124-085436-9tfs9.json 239 download   job
finance.yahoo.com-shallow-20170124-175920-56wp0.json 307 download   job
fn-55.blogspot.com-inf-20170125-163118-ck95e.json 246 download   job
frontnational81.over-blog.com-inf-20170125-154000-gdsf9.json 257 download   job
github.com-shallow-20170126-022803-2q8yc.json 276 download   job
github.com-shallow-20170126-022823-6ufa1.json 281 download   job
glenngreenwald.net-inf-20170125-172827-88lnm-00000.warc.gz 127512564 download   job
glenngreenwald.net-inf-20170125-172827-88lnm-00000.warc.os.cdx.gz 159439 download
glenngreenwald.net-inf-20170125-172827-88lnm-meta.warc.gz 98673 download   job
glenngreenwald.net-inf-20170125-172827-88lnm-meta.warc.os.cdx.gz 47 download
glenngreenwald.net-inf-20170125-172827-88lnm.json 248 download   job
ham-radio.com-inf-20170124-124846-6o62s.json 240 download   job
hollaforums.com-inf-20170112-143240-c2g3o-00033.warc.gz 799824627 download   job
hollaforums.com-inf-20170112-143240-c2g3o-00033.warc.os.cdx.gz 1396238 download
hollaforums.com-inf-20170112-143240-c2g3o.json 243 download   job
humanstxt.org-inf-20170124-073542-59bm8.json 243 download   job
huwieler.net-shallow-20170124-071220-37uvh.json 294 download   job
i.reddituploads.com-shallow-20170124-232503-8ac3e.json 343 download   job
imgur.com-shallow-20170126-051121-l6tsu.json 249 download   job
jenkins.cyanogenmod.org-shallow-20170126-175019-9hj5r-00000.warc.gz 635308 download   job
jenkins.cyanogenmod.org-shallow-20170126-175019-9hj5r-00000.warc.os.cdx.gz 6891 download
jenkins.cyanogenmod.org-shallow-20170126-175019-9hj5r-meta.warc.gz 7399 download   job
jenkins.cyanogenmod.org-shallow-20170126-175019-9hj5r-meta.warc.os.cdx.gz 47 download
jenkins.cyanogenmod.org-shallow-20170126-175019-9hj5r.json 258 download   job
jenkins.lineageos.org-shallow-20170126-174912-74f8o-00000.warc.gz 536729 download   job
jenkins.lineageos.org-shallow-20170126-174912-74f8o-00000.warc.os.cdx.gz 6541 download
jenkins.lineageos.org-shallow-20170126-174912-74f8o-meta.warc.gz 7143 download   job
jenkins.lineageos.org-shallow-20170126-174912-74f8o-meta.warc.os.cdx.gz 47 download
jenkins.lineageos.org-shallow-20170126-174912-74f8o.json 256 download   job
kern.punkto.info-shallow-20170126-002502-79e8f.json 282 download   job
lists.gnu.org-shallow-20170124-160008-2cxcb.json 294 download   job
m.huffpost.com-shallow-20170126-162122-ci70l.json 262 download   job
market-ticker.org-inf-20170126-060107-9l8oj-00000.warc.gz 1838756729 download   job
market-ticker.org-inf-20170126-060107-9l8oj-00000.warc.os.cdx.gz 1090804 download
market-ticker.org-inf-20170126-060107-9l8oj-meta.warc.gz 873122 download   job
market-ticker.org-inf-20170126-060107-9l8oj-meta.warc.os.cdx.gz 47 download
market-ticker.org-inf-20170126-060107-9l8oj.json 248 download   job
match.globalchange.gov-inf-20170125-180622-99acf.json 253 download   job
media.greenpeace.org-shallow-20170126-045906-16c3j.json 276 download   job
mediamatters.org-shallow-20170126-172055-22rje.json 350 download   job
medium.com-shallow-20170124-165230-49uj5.json 320 download   job
mpcdot.com-inf-20170124-173727-6olxb-aborted-00000.warc.gz 69997722 download   job
mpcdot.com-inf-20170124-173727-6olxb-aborted-00000.warc.os.cdx.gz 309046 download
mpcdot.com-inf-20170124-173727-6olxb-aborted.json 245 download   job
nao.usem.xyz-inf-20170126-002553-6epe7-00000.warc.gz 4103215 download   job
nao.usem.xyz-inf-20170126-002553-6epe7-00000.warc.os.cdx.gz 5454 download
nao.usem.xyz-inf-20170126-002553-6epe7-meta.warc.gz 6409 download   job
nao.usem.xyz-inf-20170126-002553-6epe7-meta.warc.os.cdx.gz 47 download
nao.usem.xyz-inf-20170126-002553-6epe7.json 242 download   job
networkagainstpsychiatricassault.org-inf-20170126-064457-42lf6.json 266 download   job
nighto.net-inf-20170124-094059-8i2s4.json 240 download   job
nodewebrss.epa.gov-inf-20170124-150921-3gzm3.json 247 download   job
obamawhitehouse.archives.gov-inf-20170120-213554-2747t-00006.warc.gz 5372220629 download   job
obamawhitehouse.archives.gov-inf-20170120-213554-2747t-00006.warc.os.cdx.gz 5511891 download
obamawhitehouse.archives.gov-inf-20170120-213554-2747t-00007.warc.gz 5368709247 download   job
obamawhitehouse.archives.gov-inf-20170120-213554-2747t-00007.warc.os.cdx.gz 3314617 download
obamawhitehouse.archives.gov-inf-20170120-213554-2747t-00008.warc.gz 5387638825 download   job
obamawhitehouse.archives.gov-inf-20170120-213554-2747t-00008.warc.os.cdx.gz 5148642 download
obamawhitehouse.archives.gov-inf-20170120-213554-2747t-00009.warc.gz 5372387404 download   job
obamawhitehouse.archives.gov-inf-20170120-213554-2747t-00009.warc.os.cdx.gz 4799815 download
obamawhitehouse.archives.gov-inf-20170120-213554-2747t-00010.warc.gz 5369736615 download   job
obamawhitehouse.archives.gov-inf-20170120-213554-2747t-00010.warc.os.cdx.gz 4221125 download
police.uw.edu-shallow-20170125-172537-39stt-00000.warc.gz 1665815 download   job
police.uw.edu-shallow-20170125-172537-39stt-00000.warc.os.cdx.gz 6725 download
police.uw.edu-shallow-20170125-172537-39stt-meta.warc.gz 7005 download   job
police.uw.edu-shallow-20170125-172537-39stt-meta.warc.os.cdx.gz 47 download
police.uw.edu-shallow-20170125-172537-39stt.json 280 download   job
purple.com-shallow-20170124-174446-1wfso.json 244 download   job
redstatewatcher.com-inf-20170123-031549-6mtun.json 249 download   job
research.trust.salesforce.com-shallow-20170126-045146-7m81b.json 335 download   job
research.trust.salesforce.com-shallow-20170126-045241-y04vt.json 335 download   job
review.globalchange.gov-inf-20170125-180759-2o3xo.json 254 download   job
rsc.walker.house.gov-shallow-20170124-074814-ea6lm.json 316 download   job
sezonoj.ru-inf-20170123-151758-dw7pw.json 240 download   job
shop.surfingmagazine.com-inf-20170126-181409-e3rze-00000.warc.gz 1459776 download   job
shop.surfingmagazine.com-inf-20170126-181409-e3rze-00000.warc.os.cdx.gz 4598 download
shop.surfingmagazine.com-inf-20170126-181409-e3rze-meta.warc.gz 6246 download   job
shop.surfingmagazine.com-inf-20170126-181409-e3rze-meta.warc.os.cdx.gz 47 download
shop.surfingmagazine.com-inf-20170126-181409-e3rze.json 255 download   job
stuff.milkywan.xyz-inf-20170125-193650-d34cn.json 263 download   job
t.co-shallow-20170126-164153-4aa7y-00000.warc.gz 4419741 download   job
t.co-shallow-20170126-164153-4aa7y-00000.warc.os.cdx.gz 338 download
t.co-shallow-20170126-164153-4aa7y-meta.warc.gz 3227 download   job
t.co-shallow-20170126-164153-4aa7y-meta.warc.os.cdx.gz 47 download
t.co-shallow-20170126-164153-4aa7y.json 243 download   job
t.co-shallow-20170126-164203-co3kd-00000.warc.gz 3896 download   job
t.co-shallow-20170126-164203-co3kd-00000.warc.os.cdx.gz 209 download
t.co-shallow-20170126-164203-co3kd-meta.warc.gz 3412 download   job
t.co-shallow-20170126-164203-co3kd-meta.warc.os.cdx.gz 47 download
t.co-shallow-20170126-164203-co3kd.json 243 download   job
talkingpointsmemo.com-shallow-20170125-203342-3v3yk.json 294 download   job
theheartysoul.com-shallow-20170126-165448-9tfwt.json 274 download   job
thephotobrigade.com-shallow-20170125-035223-958zb.json 357 download   job
time.com-shallow-20170124-135055-7ae0z.json 287 download   job
time.com-shallow-20170126-070306-66mh5.json 299 download   job
transition.fcc.gov-shallow-20170126-010131-citkv-00000.warc.gz 61519 download   job
transition.fcc.gov-shallow-20170126-010131-citkv-00000.warc.os.cdx.gz 273 download
transition.fcc.gov-shallow-20170126-010131-citkv-meta.warc.gz 3236 download   job
transition.fcc.gov-shallow-20170126-010131-citkv-meta.warc.os.cdx.gz 47 download
transition.fcc.gov-shallow-20170126-010131-citkv.json 307 download   job
trilhatranscarioca.com.br-inf-20170124-093102-56ggk.json 255 download   job
twitter.com-inf-20170122-204357-azr3x-00008.warc.gz 1561049888 download   job
twitter.com-inf-20170122-204357-azr3x-00008.warc.os.cdx.gz 1775037 download
twitter.com-inf-20170122-204357-azr3x.json 254 download   job
twitter.com-inf-20170124-153443-ajlq4.json 253 download   job
twitter.com-inf-20170125-205805-c0qtl.json 257 download   job
twitter.com-inf-20170126-033042-4rh3p.json 255 download   job
twitter.com-inf-20170126-035740-64wxs.json 257 download   job
twitter.com-inf-20170126-040701-29cy3.json 265 download   job
twitter.com-inf-20170126-040748-b2jiv.json 253 download   job
twitter.com-inf-20170126-144011-enir1-00000.warc.gz 255225488 download   job
twitter.com-inf-20170126-144011-enir1-00000.warc.os.cdx.gz 299311 download
twitter.com-inf-20170126-144011-enir1-meta.warc.gz 304961 download   job
twitter.com-inf-20170126-144011-enir1-meta.warc.os.cdx.gz 47 download
twitter.com-inf-20170126-144011-enir1.json 252 download   job
twitter.com-shallow-20170124-073817-4ci36.json 261 download   job
twitter.com-shallow-20170124-080722-10r49.json 257 download   job
twitter.com-shallow-20170124-084229-4l67w.json 285 download   job
twitter.com-shallow-20170124-173233-4psa1.json 260 download   job
twitter.com-shallow-20170124-174211-cw8h8.json 278 download   job
twitter.com-shallow-20170124-174530-28ry9.json 283 download   job
twitter.com-shallow-20170124-174852-e7q7x.json 262 download   job
twitter.com-shallow-20170124-185222-73506.json 283 download   job
twitter.com-shallow-20170124-193629-12nox.json 261 download   job
twitter.com-shallow-20170124-193952-60pv1.json 286 download   job
twitter.com-shallow-20170124-202324-1vr80.json 280 download   job
twitter.com-shallow-20170125-001212-ej4ue.json 279 download   job
twitter.com-shallow-20170125-001232-dbcqb.json 279 download   job
twitter.com-shallow-20170125-015841-a2dcd.json 278 download   job
twitter.com-shallow-20170125-024917-70421.json 260 download   job
twitter.com-shallow-20170125-035635-a643t.json 256 download   job
twitter.com-shallow-20170125-035753-660ft.json 254 download   job
twitter.com-shallow-20170125-041609-c6c15.json 260 download   job
twitter.com-shallow-20170125-062242-2gu6b.json 256 download   job
twitter.com-shallow-20170125-062401-4cry4.json 254 download   job
twitter.com-shallow-20170125-064840-8ywda.json 258 download   job
twitter.com-shallow-20170125-071926-4gfdt.json 258 download   job
twitter.com-shallow-20170125-073647-1orp2.json 257 download   job
twitter.com-shallow-20170125-074201-aflzx.json 260 download   job
twitter.com-shallow-20170125-230033-5qw37.json 249 download   job
twitter.com-shallow-20170125-230446-demo2-00000.warc.gz 39976 download   job
twitter.com-shallow-20170125-230446-demo2-00000.warc.os.cdx.gz 219 download
twitter.com-shallow-20170125-230446-demo2-meta.warc.gz 5297 download   job
twitter.com-shallow-20170125-230446-demo2-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-230446-demo2.json 255 download   job
twitter.com-shallow-20170125-230536-astlw.json 253 download   job
twitter.com-shallow-20170125-230546-7iynw-00000.warc.gz 43537 download   job
twitter.com-shallow-20170125-230546-7iynw-00000.warc.os.cdx.gz 218 download
twitter.com-shallow-20170125-230546-7iynw-meta.warc.gz 5165 download   job
twitter.com-shallow-20170125-230546-7iynw-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-230546-7iynw.json 254 download   job
twitter.com-shallow-20170125-230556-1rer6-00000.warc.gz 2642718 download   job
twitter.com-shallow-20170125-230556-1rer6-00000.warc.os.cdx.gz 5335 download
twitter.com-shallow-20170125-230556-1rer6-meta.warc.gz 6341 download   job
twitter.com-shallow-20170125-230556-1rer6-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-230556-1rer6.json 256 download   job
twitter.com-shallow-20170125-230630-1n0f3-00000.warc.gz 35794 download   job
twitter.com-shallow-20170125-230630-1n0f3-00000.warc.os.cdx.gz 212 download
twitter.com-shallow-20170125-230630-1n0f3-meta.warc.gz 5044 download   job
twitter.com-shallow-20170125-230630-1n0f3-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-230630-1n0f3.json 254 download   job
twitter.com-shallow-20170125-230639-6j9fr-00000.warc.gz 42640 download   job
twitter.com-shallow-20170125-230639-6j9fr-00000.warc.os.cdx.gz 219 download
twitter.com-shallow-20170125-230639-6j9fr-meta.warc.gz 5581 download   job
twitter.com-shallow-20170125-230639-6j9fr-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-230639-6j9fr.json 255 download   job
twitter.com-shallow-20170125-230654-rjw7o-00000.warc.gz 39433 download   job
twitter.com-shallow-20170125-230654-rjw7o-00000.warc.os.cdx.gz 225 download
twitter.com-shallow-20170125-230654-rjw7o-meta.warc.gz 5053 download   job
twitter.com-shallow-20170125-230654-rjw7o-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-230654-rjw7o.json 260 download   job
twitter.com-shallow-20170125-230755-706vo.json 255 download   job
twitter.com-shallow-20170125-230803-6pwk2-00000.warc.gz 48753 download   job
twitter.com-shallow-20170125-230803-6pwk2-00000.warc.os.cdx.gz 213 download
twitter.com-shallow-20170125-230803-6pwk2-meta.warc.gz 5199 download   job
twitter.com-shallow-20170125-230803-6pwk2-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-230803-6pwk2.json 252 download   job
twitter.com-shallow-20170125-230830-6x9ul-00000.warc.gz 48945 download   job
twitter.com-shallow-20170125-230830-6x9ul-00000.warc.os.cdx.gz 217 download
twitter.com-shallow-20170125-230830-6x9ul-meta.warc.gz 5064 download   job
twitter.com-shallow-20170125-230830-6x9ul-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-230830-6x9ul.json 256 download   job
twitter.com-shallow-20170125-230849-32mlh-00000.warc.gz 43731 download   job
twitter.com-shallow-20170125-230849-32mlh-00000.warc.os.cdx.gz 223 download
twitter.com-shallow-20170125-230849-32mlh-meta.warc.gz 5189 download   job
twitter.com-shallow-20170125-230849-32mlh-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-230849-32mlh.json 258 download   job
twitter.com-shallow-20170125-230914-9j4wr.json 252 download   job
twitter.com-shallow-20170125-230925-13ivq-00000.warc.gz 50229 download   job
twitter.com-shallow-20170125-230925-13ivq-00000.warc.os.cdx.gz 217 download
twitter.com-shallow-20170125-230925-13ivq-meta.warc.gz 5048 download   job
twitter.com-shallow-20170125-230925-13ivq-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-230925-13ivq.json 259 download   job
twitter.com-shallow-20170125-230935-4n740-00000.warc.gz 44056 download   job
twitter.com-shallow-20170125-230935-4n740-00000.warc.os.cdx.gz 216 download
twitter.com-shallow-20170125-230935-4n740-meta.warc.gz 5090 download   job
twitter.com-shallow-20170125-230935-4n740-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-230935-4n740.json 255 download   job
twitter.com-shallow-20170125-231006-7dqt2-00000.warc.gz 42004 download   job
twitter.com-shallow-20170125-231006-7dqt2-00000.warc.os.cdx.gz 224 download
twitter.com-shallow-20170125-231006-7dqt2-meta.warc.gz 5072 download   job
twitter.com-shallow-20170125-231006-7dqt2-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-231006-7dqt2.json 261 download   job
twitter.com-shallow-20170125-231026-402jj-00000.warc.gz 60224 download   job
twitter.com-shallow-20170125-231026-402jj-00000.warc.os.cdx.gz 220 download
twitter.com-shallow-20170125-231026-402jj-meta.warc.gz 5171 download   job
twitter.com-shallow-20170125-231026-402jj-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-231026-402jj.json 255 download   job
twitter.com-shallow-20170125-231109-6tl2v-00000.warc.gz 43828 download   job
twitter.com-shallow-20170125-231109-6tl2v-00000.warc.os.cdx.gz 226 download
twitter.com-shallow-20170125-231109-6tl2v-meta.warc.gz 5062 download   job
twitter.com-shallow-20170125-231109-6tl2v-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-231109-6tl2v.json 261 download   job
twitter.com-shallow-20170125-231129-cbp3m-00000.warc.gz 42885 download   job
twitter.com-shallow-20170125-231129-cbp3m-00000.warc.os.cdx.gz 244 download
twitter.com-shallow-20170125-231129-cbp3m-meta.warc.gz 5109 download   job
twitter.com-shallow-20170125-231129-cbp3m-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-231129-cbp3m.json 286 download   job
twitter.com-shallow-20170125-231139-4z2tp-00000.warc.gz 43489 download   job
twitter.com-shallow-20170125-231139-4z2tp-00000.warc.os.cdx.gz 245 download
twitter.com-shallow-20170125-231139-4z2tp-meta.warc.gz 5110 download   job
twitter.com-shallow-20170125-231139-4z2tp-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-231139-4z2tp.json 286 download   job
twitter.com-shallow-20170125-231822-dst9q-00000.warc.gz 41171 download   job
twitter.com-shallow-20170125-231822-dst9q-00000.warc.os.cdx.gz 242 download
twitter.com-shallow-20170125-231822-dst9q-meta.warc.gz 5068 download   job
twitter.com-shallow-20170125-231822-dst9q-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-231822-dst9q.json 286 download   job
twitter.com-shallow-20170125-231901-dos5w-00000.warc.gz 44019 download   job
twitter.com-shallow-20170125-231901-dos5w-00000.warc.os.cdx.gz 217 download
twitter.com-shallow-20170125-231901-dos5w-meta.warc.gz 5254 download   job
twitter.com-shallow-20170125-231901-dos5w-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-231901-dos5w.json 253 download   job
twitter.com-shallow-20170125-231915-dep3z-00000.warc.gz 51350 download   job
twitter.com-shallow-20170125-231915-dep3z-00000.warc.os.cdx.gz 213 download
twitter.com-shallow-20170125-231915-dep3z-meta.warc.gz 5227 download   job
twitter.com-shallow-20170125-231915-dep3z-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-231915-dep3z.json 251 download   job
twitter.com-shallow-20170125-231923-5c9na-00000.warc.gz 43577 download   job
twitter.com-shallow-20170125-231923-5c9na-00000.warc.os.cdx.gz 223 download
twitter.com-shallow-20170125-231923-5c9na-meta.warc.gz 5081 download   job
twitter.com-shallow-20170125-231923-5c9na-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-231923-5c9na.json 260 download   job
twitter.com-shallow-20170125-231932-68eer-00000.warc.gz 61732 download   job
twitter.com-shallow-20170125-231932-68eer-00000.warc.os.cdx.gz 225 download
twitter.com-shallow-20170125-231932-68eer-meta.warc.gz 5085 download   job
twitter.com-shallow-20170125-231932-68eer-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-231932-68eer.json 262 download   job
twitter.com-shallow-20170125-232006-eo0in-00000.warc.gz 47055 download   job
twitter.com-shallow-20170125-232006-eo0in-00000.warc.os.cdx.gz 217 download
twitter.com-shallow-20170125-232006-eo0in-meta.warc.gz 4997 download   job
twitter.com-shallow-20170125-232006-eo0in-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-232006-eo0in.json 260 download   job
twitter.com-shallow-20170125-232055-cg6ml-00000.warc.gz 51037 download   job
twitter.com-shallow-20170125-232055-cg6ml-00000.warc.os.cdx.gz 220 download
twitter.com-shallow-20170125-232055-cg6ml-meta.warc.gz 5074 download   job
twitter.com-shallow-20170125-232055-cg6ml-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-232055-cg6ml.json 258 download   job
twitter.com-shallow-20170125-232121-2zefw-00000.warc.gz 63820 download   job
twitter.com-shallow-20170125-232121-2zefw-00000.warc.os.cdx.gz 214 download
twitter.com-shallow-20170125-232121-2zefw-meta.warc.gz 5197 download   job
twitter.com-shallow-20170125-232121-2zefw-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-232121-2zefw.json 252 download   job
twitter.com-shallow-20170125-232217-6ayn4-00000.warc.gz 40473 download   job
twitter.com-shallow-20170125-232217-6ayn4-00000.warc.os.cdx.gz 217 download
twitter.com-shallow-20170125-232217-6ayn4-meta.warc.gz 5054 download   job
twitter.com-shallow-20170125-232217-6ayn4-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-232217-6ayn4.json 255 download   job
twitter.com-shallow-20170125-232229-79fx0-00000.warc.gz 44209 download   job
twitter.com-shallow-20170125-232229-79fx0-00000.warc.os.cdx.gz 214 download
twitter.com-shallow-20170125-232229-79fx0-meta.warc.gz 5053 download   job
twitter.com-shallow-20170125-232229-79fx0-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-232229-79fx0.json 255 download   job
twitter.com-shallow-20170125-232252-4ptm4-00000.warc.gz 40378 download   job
twitter.com-shallow-20170125-232252-4ptm4-00000.warc.os.cdx.gz 218 download
twitter.com-shallow-20170125-232252-4ptm4-meta.warc.gz 5057 download   job
twitter.com-shallow-20170125-232252-4ptm4-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170125-232252-4ptm4.json 257 download   job
twitter.com-shallow-20170126-000030-860l7-00000.warc.gz 37194 download   job
twitter.com-shallow-20170126-000030-860l7-00000.warc.os.cdx.gz 226 download
twitter.com-shallow-20170126-000030-860l7-meta.warc.gz 5018 download   job
twitter.com-shallow-20170126-000030-860l7-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170126-000030-860l7.json 274 download   job
twitter.com-shallow-20170126-000249-as26w-00000.warc.gz 42923 download   job
twitter.com-shallow-20170126-000249-as26w-00000.warc.os.cdx.gz 209 download
twitter.com-shallow-20170126-000249-as26w-meta.warc.gz 5152 download   job
twitter.com-shallow-20170126-000249-as26w-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170126-000249-as26w.json 254 download   job
twitter.com-shallow-20170126-000259-d3wq5.json 255 download   job
twitter.com-shallow-20170126-000309-cwmg3-00000.warc.gz 40181 download   job
twitter.com-shallow-20170126-000309-cwmg3-00000.warc.os.cdx.gz 216 download
twitter.com-shallow-20170126-000309-cwmg3-meta.warc.gz 5061 download   job
twitter.com-shallow-20170126-000309-cwmg3-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170126-000309-cwmg3.json 253 download   job
twitter.com-shallow-20170126-000315-cee78.json 254 download   job
twitter.com-shallow-20170126-000338-bd3xi-00000.warc.gz 47976 download   job
twitter.com-shallow-20170126-000338-bd3xi-00000.warc.os.cdx.gz 218 download
twitter.com-shallow-20170126-000338-bd3xi-meta.warc.gz 5007 download   job
twitter.com-shallow-20170126-000338-bd3xi-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170126-000338-bd3xi.json 260 download   job
twitter.com-shallow-20170126-000350-bultm.json 254 download   job
twitter.com-shallow-20170126-000401-d056l-00000.warc.gz 44599 download   job
twitter.com-shallow-20170126-000401-d056l-00000.warc.os.cdx.gz 211 download
twitter.com-shallow-20170126-000401-d056l-meta.warc.gz 5103 download   job
twitter.com-shallow-20170126-000401-d056l-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170126-000401-d056l.json 254 download   job
twitter.com-shallow-20170126-000449-7mi8a-00000.warc.gz 54664 download   job
twitter.com-shallow-20170126-000449-7mi8a-00000.warc.os.cdx.gz 213 download
twitter.com-shallow-20170126-000449-7mi8a-meta.warc.gz 5221 download   job
twitter.com-shallow-20170126-000449-7mi8a-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170126-000449-7mi8a.json 252 download   job
twitter.com-shallow-20170126-001012-9xrpi.json 255 download   job
twitter.com-shallow-20170126-002129-5r32j.json 260 download   job
twitter.com-shallow-20170126-010341-5x76x.json 251 download   job
twitter.com-shallow-20170126-010413-75lhv.json 260 download   job
twitter.com-shallow-20170126-010447-4pk9i.json 262 download   job
twitter.com-shallow-20170126-010538-9sogo.json 251 download   job
twitter.com-shallow-20170126-010556-39j8k.json 260 download   job
twitter.com-shallow-20170126-011146-2rxea.json 266 download   job
twitter.com-shallow-20170126-013940-c1aqk.json 282 download   job
twitter.com-shallow-20170126-014920-ee30f.json 260 download   job
twitter.com-shallow-20170126-020019-c6yeq.json 262 download   job
twitter.com-shallow-20170126-020148-5jhqs.json 257 download   job
twitter.com-shallow-20170126-020252-cpqqu.json 260 download   job
twitter.com-shallow-20170126-020313-76f62.json 259 download   job
twitter.com-shallow-20170126-020519-73w5s.json 255 download   job
twitter.com-shallow-20170126-020539-5u0mx.json 262 download   job
twitter.com-shallow-20170126-020911-9xx63.json 261 download   job
twitter.com-shallow-20170126-021819-622db.json 261 download   job
twitter.com-shallow-20170126-021922-bv5fy.json 274 download   job
twitter.com-shallow-20170126-023621-cnydo.json 257 download   job
twitter.com-shallow-20170126-024141-emfrv.json 281 download   job
twitter.com-shallow-20170126-024201-993j3.json 272 download   job
twitter.com-shallow-20170126-024337-cgfrt.json 286 download   job
twitter.com-shallow-20170126-024829-ergs6.json 257 download   job
twitter.com-shallow-20170126-025556-cv97e.json 257 download   job
twitter.com-shallow-20170126-030125-b6syd.json 260 download   job
twitter.com-shallow-20170126-030147-c0eli.json 253 download   job
twitter.com-shallow-20170126-030709-n5v6t.json 258 download   job
twitter.com-shallow-20170126-031245-2fh2e.json 256 download   job
twitter.com-shallow-20170126-031311-dh3kp.json 260 download   job
twitter.com-shallow-20170126-031642-eo6tc.json 262 download   job
twitter.com-shallow-20170126-031848-3isfi.json 261 download   job
twitter.com-shallow-20170126-033627-4ci36.json 261 download   job
twitter.com-shallow-20170126-033853-3gzth.json 259 download   job
twitter.com-shallow-20170126-033914-7tedy.json 260 download   job
twitter.com-shallow-20170126-034113-btpiy.json 257 download   job
twitter.com-shallow-20170126-034128-dz0jt.json 259 download   job
twitter.com-shallow-20170126-034412-ar7c7.json 250 download   job
twitter.com-shallow-20170126-034525-dc9a6.json 282 download   job
twitter.com-shallow-20170126-034738-akkq7.json 258 download   job
twitter.com-shallow-20170126-034750-ai9te.json 257 download   job
twitter.com-shallow-20170126-035144-1h7to.json 281 download   job
twitter.com-shallow-20170126-040411-di41r.json 260 download   job
twitter.com-shallow-20170126-040559-6pkbh.json 282 download   job
twitter.com-shallow-20170126-042807-d8qzr.json 285 download   job
twitter.com-shallow-20170126-044028-37m5z.json 258 download   job
twitter.com-shallow-20170126-044300-b9799.json 258 download   job
twitter.com-shallow-20170126-044723-5zi4b.json 259 download   job
twitter.com-shallow-20170126-052503-49erj-00000.warc.gz 116702349 download   job
twitter.com-shallow-20170126-052503-49erj-00000.warc.os.cdx.gz 125098 download
twitter.com-shallow-20170126-052503-49erj-meta.warc.gz 78468 download   job
twitter.com-shallow-20170126-052503-49erj-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170126-052503-49erj.json 265 download   job
twitter.com-shallow-20170126-053014-54orj-00000.warc.gz 25667510 download   job
twitter.com-shallow-20170126-053014-54orj-00000.warc.os.cdx.gz 12124 download
twitter.com-shallow-20170126-053014-54orj-meta.warc.gz 12354 download   job
twitter.com-shallow-20170126-053014-54orj-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170126-053014-54orj.json 268 download   job
twitter.com-shallow-20170126-053138-conus-00000.warc.gz 31372822 download   job
twitter.com-shallow-20170126-053138-conus-00000.warc.os.cdx.gz 20074 download
twitter.com-shallow-20170126-053138-conus-meta.warc.gz 17377 download   job
twitter.com-shallow-20170126-053138-conus-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170126-053138-conus.json 269 download   job
twitter.com-shallow-20170126-053428-dab2p-00000.warc.gz 18070460 download   job
twitter.com-shallow-20170126-053428-dab2p-00000.warc.os.cdx.gz 10533 download
twitter.com-shallow-20170126-053428-dab2p-meta.warc.gz 11635 download   job
twitter.com-shallow-20170126-053428-dab2p-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170126-053428-dab2p.json 269 download   job
twitter.com-shallow-20170126-053549-beoqv-00000.warc.gz 16711779 download   job
twitter.com-shallow-20170126-053549-beoqv-00000.warc.os.cdx.gz 9983 download
twitter.com-shallow-20170126-053549-beoqv-meta.warc.gz 11359 download   job
twitter.com-shallow-20170126-053549-beoqv-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170126-053549-beoqv.json 269 download   job
twitter.com-shallow-20170126-053700-10sqg-00000.warc.gz 1885335 download   job
twitter.com-shallow-20170126-053700-10sqg-00000.warc.os.cdx.gz 3060 download
twitter.com-shallow-20170126-053700-10sqg-meta.warc.gz 6525 download   job
twitter.com-shallow-20170126-053700-10sqg-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170126-053700-10sqg.json 269 download   job
twitter.com-shallow-20170126-053719-bnexf-00000.warc.gz 4182166 download   job
twitter.com-shallow-20170126-053719-bnexf-00000.warc.os.cdx.gz 4500 download
twitter.com-shallow-20170126-053719-bnexf-meta.warc.gz 7683 download   job
twitter.com-shallow-20170126-053719-bnexf-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170126-053719-bnexf.json 269 download   job
twitter.com-shallow-20170126-053744-8r6oi-00000.warc.gz 13178807 download   job
twitter.com-shallow-20170126-053744-8r6oi-00000.warc.os.cdx.gz 7061 download
twitter.com-shallow-20170126-053744-8r6oi-meta.warc.gz 8917 download   job
twitter.com-shallow-20170126-053744-8r6oi-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170126-053744-8r6oi.json 269 download   job
twitter.com-shallow-20170126-053834-byu61-00000.warc.gz 13567520 download   job
twitter.com-shallow-20170126-053834-byu61-00000.warc.os.cdx.gz 7395 download
twitter.com-shallow-20170126-053834-byu61-meta.warc.gz 9332 download   job
twitter.com-shallow-20170126-053834-byu61-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170126-053834-byu61.json 269 download   job
twitter.com-shallow-20170126-053922-bmvzu-00000.warc.gz 21978036 download   job
twitter.com-shallow-20170126-053922-bmvzu-00000.warc.os.cdx.gz 10107 download
twitter.com-shallow-20170126-053922-bmvzu-meta.warc.gz 10932 download   job
twitter.com-shallow-20170126-053922-bmvzu-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170126-053922-bmvzu.json 269 download   job
twitter.com-shallow-20170126-054017-5z1go-00000.warc.gz 17394039 download   job
twitter.com-shallow-20170126-054017-5z1go-00000.warc.os.cdx.gz 6950 download
twitter.com-shallow-20170126-054017-5z1go-meta.warc.gz 9301 download   job
twitter.com-shallow-20170126-054017-5z1go-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170126-054017-5z1go.json 269 download   job
twitter.com-shallow-20170126-054025-lxqph-00000.warc.gz 6540983 download   job
twitter.com-shallow-20170126-054025-lxqph-00000.warc.os.cdx.gz 5168 download
twitter.com-shallow-20170126-054025-lxqph-meta.warc.gz 8073 download   job
twitter.com-shallow-20170126-054025-lxqph-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170126-054025-lxqph.json 269 download   job
twitter.com-shallow-20170126-054102-3rtse-00000.warc.gz 10119522 download   job
twitter.com-shallow-20170126-054102-3rtse-00000.warc.os.cdx.gz 6413 download
twitter.com-shallow-20170126-054102-3rtse-meta.warc.gz 8973 download   job
twitter.com-shallow-20170126-054102-3rtse-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170126-054102-3rtse.json 268 download   job
twitter.com-shallow-20170126-054106-1reao-00000.warc.gz 16109821 download   job
twitter.com-shallow-20170126-054106-1reao-00000.warc.os.cdx.gz 6373 download
twitter.com-shallow-20170126-054106-1reao-meta.warc.gz 8505 download   job
twitter.com-shallow-20170126-054106-1reao-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170126-054106-1reao.json 268 download   job
twitter.com-shallow-20170126-054143-5lw50-00000.warc.gz 57155718 download   job
twitter.com-shallow-20170126-054143-5lw50-00000.warc.os.cdx.gz 39952 download
twitter.com-shallow-20170126-054143-5lw50-meta.warc.gz 30659 download   job
twitter.com-shallow-20170126-054143-5lw50-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170126-054143-5lw50.json 262 download   job
twitter.com-shallow-20170126-054159-b0tjb-00000.warc.gz 76259335 download   job
twitter.com-shallow-20170126-054159-b0tjb-00000.warc.os.cdx.gz 53173 download
twitter.com-shallow-20170126-054159-b0tjb-meta.warc.gz 40933 download   job
twitter.com-shallow-20170126-054159-b0tjb-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170126-054159-b0tjb.json 262 download   job
twitter.com-shallow-20170126-055211-7azx7-00000.warc.gz 81743673 download   job
twitter.com-shallow-20170126-055211-7azx7-00000.warc.os.cdx.gz 43026 download
twitter.com-shallow-20170126-055211-7azx7-meta.warc.gz 31979 download   job
twitter.com-shallow-20170126-055211-7azx7-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20170126-055211-7azx7.json 254 download   job
twitter.com-shallow-20170126-062858-9mosu.json 254 download   job
twitter.com-shallow-20170126-063006-dusx0.json 253 download   job
twitter.com-shallow-20170126-075738-6exv9.json 275 download   job
twitter.com-shallow-20170126-080558-9674r.json 262 download   job
twitter.com-shallow-20170126-092147-efbmx.json 274 download   job
twitter.com-shallow-20170126-105252-aobha.json 274 download   job
twitter.com-shallow-20170126-201308-bh3sr.json 260 download   job
twitter.com-shallow-20170126-203317-e6gjd.json 258 download   job
twitter.com-shallow-20170126-210643-r5h4l.json 275 download   job
twitter.com-shallow-20170126-213320-76c3t.json 262 download   job
uk.linkedin.com-shallow-20170126-014727-bjicr.json 274 download   job
uk.linkedin.com-shallow-20170126-014754-2wer1.json 278 download   job
uk.linkedin.com-shallow-20170126-015101-7g3oa.json 272 download   job
uk.linkedin.com-shallow-20170126-025110-3g8ra.json 273 download   job
urls-fos.textfiles.com-EPA.gov-shallow-20170125-042241-baaoj-urls.txt 1201305 download
urls-fos.textfiles.com-EPA.gov-shallow-20170125-042241-baaoj.json 288 download   job
urls-gist.githubusercontent.com-gistfile1.txt-inf-20170125-193924-3gz64-urls.txt 111 download
urls-gist.githubusercontent.com-gistfile1.txt-inf-20170125-193924-3gz64.json 489 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20170124-184350-bosg4-urls.txt 53418 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20170124-184350-bosg4.json 498 download   job
urls-pastebin.com-0gPBdfb1-shallow-20170113-095743-47vzx-00006.warc.gz 1098706201 download   job
urls-pastebin.com-0gPBdfb1-shallow-20170113-095743-47vzx-00006.warc.os.cdx.gz 16815 download
urls-pastebin.com-0gPBdfb1-shallow-20170113-095743-47vzx-urls.txt 12778 download
urls-pastebin.com-0gPBdfb1-shallow-20170113-095743-47vzx.json 283 download   job
urls-pastebin.com-bDEeTxyQ-shallow-20170126-111843-6ggwr-urls.txt 17695 download
urls-pastebin.com-bDEeTxyQ-shallow-20170126-111843-6ggwr.json 287 download   job
usa.streetsblog.org-shallow-20170125-224813-69h8x-00000.warc.gz 745209 download   job
usa.streetsblog.org-shallow-20170125-224813-69h8x-00000.warc.os.cdx.gz 4640 download
usa.streetsblog.org-shallow-20170125-224813-69h8x-meta.warc.gz 6078 download   job
usa.streetsblog.org-shallow-20170125-224813-69h8x-meta.warc.os.cdx.gz 47 download
usa.streetsblog.org-shallow-20170125-224813-69h8x.json 327 download   job
variety.com-shallow-20170126-164900-b64ie-00000.warc.gz 2988602 download   job
variety.com-shallow-20170126-164900-b64ie-00000.warc.os.cdx.gz 19474 download
variety.com-shallow-20170126-164900-b64ie-meta.warc.gz 16337 download   job
variety.com-shallow-20170126-164900-b64ie-meta.warc.os.cdx.gz 47 download
variety.com-shallow-20170126-164900-b64ie.json 317 download   job
vine.co-shallow-20170126-031817-cw5ia.json 263 download   job
www.a2datarescue.com-inf-20170125-174741-bqiwo-00000.warc.gz 43467573 download   job
www.a2datarescue.com-inf-20170125-174741-bqiwo-00000.warc.os.cdx.gz 86231 download
www.a2datarescue.com-inf-20170125-174741-bqiwo-meta.warc.gz 57981 download   job
www.a2datarescue.com-inf-20170125-174741-bqiwo-meta.warc.os.cdx.gz 47 download
www.a2datarescue.com-inf-20170125-174741-bqiwo.json 251 download   job
www.alternativefacts.com-shallow-20170126-191246-4jjem-00000.warc.gz 2327119 download   job
www.alternativefacts.com-shallow-20170126-191246-4jjem-00000.warc.os.cdx.gz 12788 download
www.alternativefacts.com-shallow-20170126-191246-4jjem-meta.warc.gz 11919 download   job
www.alternativefacts.com-shallow-20170126-191246-4jjem-meta.warc.os.cdx.gz 47 download
www.alternativefacts.com-shallow-20170126-191246-4jjem.json 252 download   job
www.apnews.com-shallow-20170124-080104-q7667.json 278 download   job
www.archives.gov-shallow-20170125-224824-u9a88.json 291 download   job
www.bizapedia.com-shallow-20170126-071429-6cg2r.json 295 download   job
www.bloomberg.com-shallow-20170125-123908-c1unr-00000.warc.gz 4359813 download   job
www.bloomberg.com-shallow-20170125-123908-c1unr-00000.warc.os.cdx.gz 18062 download
www.bloomberg.com-shallow-20170125-123908-c1unr-meta.warc.gz 13435 download   job
www.bloomberg.com-shallow-20170125-123908-c1unr-meta.warc.os.cdx.gz 47 download
www.bloomberg.com-shallow-20170125-123908-c1unr.json 333 download   job
www.breitbart.com-shallow-20170125-060729-5nrco.json 334 download   job
www.burojansen.nl-shallow-20170125-174647-3rskv-00000.warc.gz 159016 download   job
www.burojansen.nl-shallow-20170125-174647-3rskv-00000.warc.os.cdx.gz 789 download
www.burojansen.nl-shallow-20170125-174647-3rskv-meta.warc.gz 3580 download   job
www.burojansen.nl-shallow-20170125-174647-3rskv-meta.warc.os.cdx.gz 47 download
www.burojansen.nl-shallow-20170125-174647-3rskv.json 307 download   job
www.burojansen.nl-shallow-20170125-175152-ajqko.json 287 download   job
www.burojansen.nl-shallow-20170125-185139-b2tgd.json 298 download   job
www.burojansen.nl-shallow-20170125-185158-2ietj.json 282 download   job
www.businessinsider.com-shallow-20170124-142300-eieqy.json 298 download   job
www.climate.gov-inf-20170125-055125-d91at-00000.warc.gz 5427133475 download   job
www.climate.gov-inf-20170125-055125-d91at-00000.warc.os.cdx.gz 2414867 download
www.climate.gov-inf-20170125-055125-d91at-00001.warc.gz 1808578710 download   job
www.climate.gov-inf-20170125-055125-d91at-00001.warc.os.cdx.gz 618631 download
www.climate.gov-inf-20170125-055125-d91at.json 242 download   job
www.clipshop.ca-inf-20170126-182455-8dsz8-00000.warc.gz 122308534 download   job
www.clipshop.ca-inf-20170126-182455-8dsz8-00000.warc.os.cdx.gz 6745 download
www.clipshop.ca-inf-20170126-182455-8dsz8-meta.warc.gz 7053 download   job
www.clipshop.ca-inf-20170126-182455-8dsz8-meta.warc.os.cdx.gz 47 download
www.clipshop.ca-inf-20170126-182455-8dsz8.json 250 download   job
www.clipshop.ca-inf-20170126-182539-albmc-00000.warc.gz 34363 download   job
www.clipshop.ca-inf-20170126-182539-albmc-00000.warc.os.cdx.gz 507 download
www.clipshop.ca-inf-20170126-182539-albmc-meta.warc.gz 3336 download   job
www.clipshop.ca-inf-20170126-182539-albmc-meta.warc.os.cdx.gz 47 download
www.clipshop.ca-inf-20170126-182539-albmc.json 244 download   job
www.cnn.com-shallow-20170125-162038-409ld-00000.warc.gz 6175 download   job
www.cnn.com-shallow-20170125-162038-409ld-00000.warc.os.cdx.gz 249 download
www.cnn.com-shallow-20170125-162038-409ld-meta.warc.gz 4332 download   job
www.cnn.com-shallow-20170125-162038-409ld-meta.warc.os.cdx.gz 47 download
www.cnn.com-shallow-20170125-162038-409ld.json 301 download   job
www.cnn.com-shallow-20170125-164236-409ld-aborted-00000.warc.gz 7368404 download   job
www.cnn.com-shallow-20170125-164236-409ld-aborted-00000.warc.os.cdx.gz 16582 download
www.cnn.com-shallow-20170125-164236-409ld-aborted.json 300 download   job
www.copacabanapalace.com.br-inf-20170124-093135-8jjfq.json 257 download   job
www.deseretnews.com-shallow-20170124-064559-axj1j.json 278 download   job
www.documentcloud.org-shallow-20170124-175154-6cjt6.json 322 download   job
www.eater.com-shallow-20170126-022529-b9lvd.json 302 download   job
www.engadget.com-shallow-20170126-002216-eooyf-00000.warc.gz 7706954 download   job
www.engadget.com-shallow-20170126-002216-eooyf-00000.warc.os.cdx.gz 24280 download
www.engadget.com-shallow-20170126-002216-eooyf-meta.warc.gz 23961 download   job
www.engadget.com-shallow-20170126-002216-eooyf-meta.warc.os.cdx.gz 47 download
www.engadget.com-shallow-20170126-002216-eooyf.json 311 download   job
www.epa.gov-inf-20170124-205827-4rno8.json 252 download   job
www.epa.gov-inf-20170125-050157-4rno8-00000.warc.gz 928009120 download   job
www.epa.gov-inf-20170125-050157-4rno8-00000.warc.os.cdx.gz 1155880 download
www.epa.gov-inf-20170125-050157-4rno8-meta.warc.gz 740715 download   job
www.epa.gov-inf-20170125-050157-4rno8-meta.warc.os.cdx.gz 47 download
www.epa.gov-inf-20170125-050157-4rno8.json 252 download   job
www.epa.gov-inf-20170125-193707-at9a6-aborted-00000.warc.gz 2119434 download   job
www.epa.gov-inf-20170125-193707-at9a6-aborted-00000.warc.os.cdx.gz 4534 download
www.epa.gov-inf-20170125-193707-at9a6-aborted.json 262 download   job
www.epa.gov-inf-20170125-193830-2fo04.json 264 download   job
www.epa.gov-inf-20170125-194155-14s09.json 254 download   job
www.epa.gov-inf-20170125-195231-14s09-00000.warc.gz 725611671 download   job
www.epa.gov-inf-20170125-195231-14s09-00000.warc.os.cdx.gz 351129 download
www.epa.gov-inf-20170125-195231-14s09-meta.warc.gz 222602 download   job
www.epa.gov-inf-20170125-195231-14s09-meta.warc.os.cdx.gz 47 download
www.epa.gov-inf-20170125-195231-14s09.json 254 download   job
www.epa.gov-inf-20170125-202055-2fo04-00000.warc.gz 333552239 download   job
www.epa.gov-inf-20170125-202055-2fo04-00000.warc.os.cdx.gz 318259 download
www.epa.gov-inf-20170125-202055-2fo04-meta.warc.gz 195844 download   job
www.epa.gov-inf-20170125-202055-2fo04-meta.warc.os.cdx.gz 47 download
www.epa.gov-inf-20170125-202055-2fo04.json 264 download   job
www.epa.gov-inf-20170125-204708-8iuc6-00000.warc.gz 858084499 download   job
www.epa.gov-inf-20170125-204708-8iuc6-00000.warc.os.cdx.gz 673094 download
www.epa.gov-inf-20170125-204708-8iuc6-meta.warc.gz 427432 download   job
www.epa.gov-inf-20170125-204708-8iuc6-meta.warc.os.cdx.gz 47 download
www.epa.gov-inf-20170125-204708-8iuc6.json 260 download   job
www.epa.gov-inf-20170125-214743-92iez-00000.warc.gz 166313816 download   job
www.epa.gov-inf-20170125-214743-92iez-00000.warc.os.cdx.gz 236521 download
www.epa.gov-inf-20170125-214743-92iez-meta.warc.gz 146462 download   job
www.epa.gov-inf-20170125-214743-92iez-meta.warc.os.cdx.gz 47 download
www.epa.gov-inf-20170125-214743-92iez.json 258 download   job
www.facebook.com-inf-20170125-224121-d5tr7-00000.warc.gz 33479 download   job
www.facebook.com-inf-20170125-224121-d5tr7-00000.warc.os.cdx.gz 338 download
www.facebook.com-inf-20170125-224121-d5tr7-meta.warc.gz 3297 download   job
www.facebook.com-inf-20170125-224121-d5tr7-meta.warc.os.cdx.gz 47 download
www.facebook.com-inf-20170125-224121-d5tr7.json 260 download   job
www.facebook.com-shallow-20170125-054509-du1um.json 289 download   job
www.facebook.com-shallow-20170126-062943-2ey19.json 258 download   job
www.facebook.com-shallow-20170126-063203-detye.json 285 download   job
www.facebook.com-shallow-20170126-080443-5upxe.json 270 download   job
www.fs.fed.us-inf-20170126-032306-2uvhl.json 244 download   job
www.fs.fed.us-inf-20170126-032442-2uvhl.json 244 download   job
www.gigamonkeys.com-inf-20170124-093919-42un8.json 250 download   job
www.glassdoor.com-shallow-20170126-015156-9z58r.json 289 download   job
www.gopcausedtrump.com-inf-20170125-094743-8bsab.json 252 download   job
www.greenpeace.org-shallow-20170126-035652-11ehp.json 332 download   job
www.hewillnotdivide.us-inf-20170124-155908-az7j4.json 249 download   job
www.huffingtonpost.com-shallow-20170125-024819-aj5ie.json 322 download   job
www.indivisibleguide.com-inf-20170126-184728-5aguk-00000.warc.gz 775284397 download   job
www.indivisibleguide.com-inf-20170126-184728-5aguk-00000.warc.os.cdx.gz 1005447 download
www.indivisibleguide.com-inf-20170126-184728-5aguk-meta.warc.gz 675104 download   job
www.indivisibleguide.com-inf-20170126-184728-5aguk-meta.warc.os.cdx.gz 47 download
www.indivisibleguide.com-inf-20170126-184728-5aguk.json 254 download   job
www.instagram.com-shallow-20170126-050030-3nj4j.json 269 download   job
www.kcna.kp-inf-20170124-105421-83268.json 297 download   job
www.lexisnexis.com-shallow-20170126-024225-c2vn9.json 435 download   job
www.lexisnexis.com-shallow-20170126-024349-bupi3.json 451 download   job
www.lexisnexis.com-shallow-20170126-024708-3gi4b.json 431 download   job
www.lightvortexastronomy.com-inf-20170124-092957-3rndo.json 254 download   job
www.linkedin.com-shallow-20170126-014652-8y6jf.json 278 download   job
www.linkedin.com-shallow-20170126-014711-6qe02.json 286 download   job
www.linkedin.com-shallow-20170126-014814-9xoyr.json 280 download   job
www.linkedin.com-shallow-20170126-014837-24bnn.json 281 download   job
www.linkedin.com-shallow-20170126-014900-e403c.json 280 download   job
www.linkedin.com-shallow-20170126-014942-c0t5g.json 276 download   job
www.linkedin.com-shallow-20170126-015005-6yqcf.json 265 download   job
www.linkedin.com-shallow-20170126-015032-6garn.json 278 download   job
www.linkedin.com-shallow-20170126-015127-1q8fk.json 266 download   job
www.linkedin.com-shallow-20170126-024826-5osht.json 261 download   job
www.linkedin.com-shallow-20170126-024853-b4u7r.json 288 download   job
www.linkedin.com-shallow-20170126-024921-31vjg.json 277 download   job
www.linkedin.com-shallow-20170126-024947-2lxsb.json 263 download   job
www.linkedin.com-shallow-20170126-025014-699nm.json 268 download   job
www.linkedin.com-shallow-20170126-025042-cydfa.json 276 download   job
www.linkedin.com-shallow-20170126-025138-cku36.json 275 download   job
www.marinij.com-shallow-20170126-045624-5f3td.json 298 download   job
www.marinij.com-shallow-20170126-045733-ctllp.json 298 download   job
www.maxkeiser.com-inf-20170110-100528-5vyzm-00011.warc.gz 4425429864 download   job
www.maxkeiser.com-inf-20170110-100528-5vyzm-00011.warc.os.cdx.gz 2294500 download
www.mcclatchydc.com-shallow-20170125-030049-2wwes.json 311 download   job
www.neafunded.us-shallow-20170125-074612-3m1yg.json 250 download   job
www.neh.gov-inf-20170126-055215-3uiww-00000.warc.gz 5452358779 download   job
www.neh.gov-inf-20170126-055215-3uiww-00000.warc.os.cdx.gz 4562529 download
www.neh.gov-inf-20170126-055215-3uiww-00001.warc.gz 5420856419 download   job
www.neh.gov-inf-20170126-055215-3uiww-00001.warc.os.cdx.gz 2868574 download
www.neh.gov-inf-20170126-055215-3uiww-00002.warc.gz 5368749381 download   job
www.neh.gov-inf-20170126-055215-3uiww-00002.warc.os.cdx.gz 695783 download
www.neh.gov-inf-20170126-055215-3uiww-00003.warc.gz 5369369828 download   job
www.neh.gov-inf-20170126-055215-3uiww-00003.warc.os.cdx.gz 1734357 download
www.neh.gov-inf-20170126-055215-3uiww-00004.warc.gz 5387635028 download   job
www.neh.gov-inf-20170126-055215-3uiww-00004.warc.os.cdx.gz 1730830 download
www.npr.org-shallow-20170124-125001-94ibb.json 346 download   job
www.nps.gov-shallow-20170125-032620-acu05.json 298 download   job
www.nps.gov-shallow-20170126-191731-b4y76-00000.warc.gz 2145430 download   job
www.nps.gov-shallow-20170126-191731-b4y76-00000.warc.os.cdx.gz 265 download
www.nps.gov-shallow-20170126-191731-b4y76-meta.warc.gz 3528 download   job
www.nps.gov-shallow-20170126-191731-b4y76-meta.warc.os.cdx.gz 47 download
www.nps.gov-shallow-20170126-191731-b4y76.json 307 download   job
www.nytimes.com-shallow-20170126-034955-42m3c.json 278 download   job
www.nytimes.com-shallow-20170126-035028-2qg94.json 250 download   job
www.pharmaskeletons.com-inf-20170124-085803-3o0kd.json 252 download   job
www.politico.com-shallow-20170126-211242-34tch.json 327 download   job
www.portlandhearingvoices.net-inf-20170126-061230-c2mkc.json 259 download   job
www.poynter.org-shallow-20170126-050500-dfh3n.json 346 download   job
www.rbleumarine.fr-inf-20170125-163133-8cgiz-aborted-00003.warc.gz 790373269 download   job
www.rbleumarine.fr-inf-20170125-163133-8cgiz-aborted-00003.warc.os.cdx.gz 1518592 download
www.rbleumarine.fr-inf-20170125-163133-8cgiz-aborted.json 245 download   job
www.recode.net-shallow-20170125-163916-2ldcw.json 312 download   job
www.reddit.com-inf-20170125-212443-ezji3.json 262 download   job
www.reddit.com-shallow-20170125-131947-12c9p.json 324 download   job
www.reddit.com-shallow-20170125-154339-cp7y0-00000.warc.gz 3038359 download   job
www.reddit.com-shallow-20170125-154339-cp7y0-00000.warc.os.cdx.gz 8868 download
www.reddit.com-shallow-20170125-154339-cp7y0-meta.warc.gz 9141 download   job
www.reddit.com-shallow-20170125-154339-cp7y0-meta.warc.os.cdx.gz 47 download
www.reddit.com-shallow-20170125-154339-cp7y0.json 288 download   job
www.reddit.com-shallow-20170126-041014-dcbh9.json 318 download   job
www.resistancemanual.org-shallow-20170126-200643-7ebit.json 256 download   job
www.reuters.com-shallow-20170125-031942-emgsd.json 305 download   job
www.richardsilverstein.com-shallow-20170124-204526-d6sj2.json 292 download   job
www.rollingstone.com-shallow-20170126-040430-elf10.json 315 download   job
www.scientistsmarchonwashington.com-inf-20170124-190024-ceqqn.json 265 download   job
www.seattletimes.com-shallow-20170125-004341-13jbv.json 277 download   job
www.slate.com-shallow-20170125-060015-bdsta.json 375 download   job
www.smirkingchimp.com-inf-20170125-075929-3976t-00002.warc.gz 1004681928 download   job
www.smirkingchimp.com-inf-20170125-075929-3976t-00002.warc.os.cdx.gz 1423850 download
www.smirkingchimp.com-inf-20170125-075929-3976t.json 249 download   job
www.snopes.com-shallow-20170124-225047-4fe6x.json 277 download   job
www.splcenter.org-shallow-20170125-064703-6ezp6.json 352 download   job
www.stashmedia.tv-inf-20170120-075803-4jtix.json 245 download   job
www.stephenspinola.com-inf-20170125-213048-2rfyp.json 252 download   job
www.supremecourt.uk-inf-20170124-160445-4ejgm.json 285 download   job
www.supremecourt.uk-shallow-20170124-160650-977qx.json 285 download   job
www.supremecourt.uk-shallow-20170125-010618-s054s.json 294 download   job
www.surfingmagazine.com-inf-20170126-010245-eyjze-00000.warc.gz 5385664447 download   job
www.surfingmagazine.com-inf-20170126-010245-eyjze-00000.warc.os.cdx.gz 2491952 download
www.surfingmagazine.com-inf-20170126-010245-eyjze-00001.warc.gz 5368718072 download   job
www.surfingmagazine.com-inf-20170126-010245-eyjze-00001.warc.os.cdx.gz 2927387 download
www.techdirt.com-shallow-20170125-212801-4tv4c.json 398 download   job
www.technologyreview.com-shallow-20170126-185059-8oh33-00000.warc.gz 2030251 download   job
www.technologyreview.com-shallow-20170126-185059-8oh33-00000.warc.os.cdx.gz 7036 download
www.technologyreview.com-shallow-20170126-185059-8oh33-meta.warc.gz 7712 download   job
www.technologyreview.com-shallow-20170126-185059-8oh33-meta.warc.os.cdx.gz 47 download
www.technologyreview.com-shallow-20170126-185059-8oh33.json 339 download   job
www.theblaze.com-shallow-20170124-080040-6qawy.json 348 download   job
www.theguardian.com-shallow-20170124-232619-65iua.json 329 download   job
www.theguardian.com-shallow-20170126-045601-e27pr.json 307 download   job
www.theguardian.com-shallow-20170126-170438-6dn0s-00000.warc.gz 1964311 download   job
www.theguardian.com-shallow-20170126-170438-6dn0s-00000.warc.os.cdx.gz 14985 download
www.theguardian.com-shallow-20170126-170438-6dn0s-meta.warc.gz 12549 download   job
www.theguardian.com-shallow-20170126-170438-6dn0s-meta.warc.os.cdx.gz 47 download
www.theguardian.com-shallow-20170126-170438-6dn0s.json 321 download   job
www.theknot.com-inf-20170125-035010-9m48t.json 288 download   job
www.thestranger.com-shallow-20170126-162047-1oiy1-00000.warc.gz 6128306 download   job
www.thestranger.com-shallow-20170126-162047-1oiy1-00000.warc.os.cdx.gz 5453 download
www.thestranger.com-shallow-20170126-162047-1oiy1-meta.warc.gz 6965 download   job
www.thestranger.com-shallow-20170126-162047-1oiy1-meta.warc.os.cdx.gz 47 download
www.thestranger.com-shallow-20170126-162047-1oiy1.json 351 download   job
www.thestranger.com-shallow-20170126-162104-76dnc-00000.warc.gz 5890198 download   job
www.thestranger.com-shallow-20170126-162104-76dnc-00000.warc.os.cdx.gz 5304 download
www.thestranger.com-shallow-20170126-162104-76dnc-meta.warc.gz 6928 download   job
www.thestranger.com-shallow-20170126-162104-76dnc-meta.warc.os.cdx.gz 47 download
www.thestranger.com-shallow-20170126-162104-76dnc.json 346 download   job
www.theverge.com-shallow-20170124-212542-1yhdh.json 336 download   job
www.tiffe.de-inf-20170123-191027-6geph.json 251 download   job
www.vicrailstations.com-inf-20170125-185606-df1ai.json 253 download   job
www.vox.com-shallow-20170126-025144-7u90w.json 330 download   job
www.washingtonpost.com-shallow-20170124-065843-er3jh.json 390 download   job
www.washingtonpost.com-shallow-20170124-135040-eosb8.json 371 download   job
www.washingtonpost.com-shallow-20170125-032933-rqid4.json 292 download   job
www.washingtonpost.com-shallow-20170125-042801-awpwr.json 440 download   job
www.washingtonpost.com-shallow-20170126-024036-5lsj8.json 268 download   job
www.washingtonpost.com-shallow-20170126-162500-evde8-00000.warc.gz 3679210 download   job
www.washingtonpost.com-shallow-20170126-162500-evde8-00000.warc.os.cdx.gz 7746 download
www.washingtonpost.com-shallow-20170126-162500-evde8-meta.warc.gz 8451 download   job
www.washingtonpost.com-shallow-20170126-162500-evde8-meta.warc.os.cdx.gz 47 download
www.washingtonpost.com-shallow-20170126-162500-evde8.json 351 download   job
www.wesearchr.com-inf-20170124-085445-3fvbj.json 248 download   job
www.youtube.com-shallow-20170125-060214-2z8cs.json 267 download   job
www.youtube.com-shallow-20170125-071831-6cny3.json 286 download   job
www.youtube.com-shallow-20170125-171149-9soko-00000.warc.gz 49729 download   job
www.youtube.com-shallow-20170125-171149-9soko-00000.warc.os.cdx.gz 237 download
www.youtube.com-shallow-20170125-171149-9soko-meta.warc.gz 4262 download   job
www.youtube.com-shallow-20170125-171149-9soko-meta.warc.os.cdx.gz 47 download
www.youtube.com-shallow-20170125-171149-9soko.json 267 download   job
www.youtube.com-shallow-20170125-171801-b9wgp.json 267 download   job
www.youtube.com-shallow-20170126-041153-bf5pk.json 266 download   job
www.youtube.com-shallow-20170126-202430-edatn.json 269 download   job