Item archiveteam_archivebot_go_20200712200005

View on Internet Archive

Filename Size
archive.max.fan-shallow-20200712-174408-c5tot-00000.warc.gz 95333 download   job
archive.max.fan-shallow-20200712-174408-c5tot-00000.warc.os.cdx.gz 253 download
archive.max.fan-shallow-20200712-174408-c5tot-meta.warc.gz 3523 download   job
archive.max.fan-shallow-20200712-174408-c5tot-meta.warc.os.cdx.gz 47 download
archiveteam_archivebot_go_20200712200005.cdx.gz 142964416 download
archiveteam_archivebot_go_20200712200005.cdx.idx 114274 download
archiveteam_archivebot_go_20200712200005_files.xml 0 download
archiveteam_archivebot_go_20200712200005_meta.sqlite 949248 download
archiveteam_archivebot_go_20200712200005_meta.xml 969 download
ektoplazm.com-inf-20200704-233408-66i1h-00028.warc.gz 5512281290 download   job
ektoplazm.com-inf-20200704-233408-66i1h-00028.warc.os.cdx.gz 14337 download
luc.devroye.org-inf-20200629-195003-6kmq5-00052.warc.gz 5369595902 download   job
luc.devroye.org-inf-20200629-195003-6kmq5-00052.warc.os.cdx.gz 3218863 download
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00137.warc.gz 5396430180 download   job
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00137.warc.os.cdx.gz 54274 download
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00139.warc.gz 4378158237 download   job
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00139.warc.os.cdx.gz 38127 download
mediaset.sdasofia.org-inf-20200709-091713-c8wet-meta.warc.gz 5327523 download   job
mediaset.sdasofia.org-inf-20200709-091713-c8wet-meta.warc.os.cdx.gz 47 download
mediaset.sdasofia.org-inf-20200709-091713-c8wet.json 245 download   job
old.reddit.com-inf-20200712-011858-coxzo-00010.warc.gz 3043792320 download   job
old.reddit.com-inf-20200712-011858-coxzo-00010.warc.os.cdx.gz 1484446 download
old.reddit.com-inf-20200712-011858-coxzo.json 260 download   job
player.fm-inf-20200501-233943-6recr-00689.warc.gz 5430834618 download   job
player.fm-inf-20200501-233943-6recr-00689.warc.os.cdx.gz 213692 download
urls-archive.max.fan-twitter-@BRVB_NHS-filtered.txt-shallow-20200712-195426-3rwyr-00000.warc.gz 136459643 download   job
urls-archive.max.fan-twitter-@BRVB_NHS-filtered.txt-shallow-20200712-195426-3rwyr-00000.warc.os.cdx.gz 161555 download
urls-archive.max.fan-twitter-@BRVB_NHS-filtered.txt-shallow-20200712-195426-3rwyr-meta.warc.gz 89776 download   job
urls-archive.max.fan-twitter-@BRVB_NHS-filtered.txt-shallow-20200712-195426-3rwyr-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BRVB_NHS-filtered.txt-shallow-20200712-195426-3rwyr-urls.txt 47321 download
urls-archive.max.fan-twitter-@BRVB_NHS-filtered.txt-shallow-20200712-195426-3rwyr.json 331 download   job
urls-archive.max.fan-twitter-@BulgariaUNHCR-filtered.txt-shallow-20200712-195424-62wpi-00000.warc.gz 39006724 download   job
urls-archive.max.fan-twitter-@BulgariaUNHCR-filtered.txt-shallow-20200712-195424-62wpi-00000.warc.os.cdx.gz 39111 download
urls-archive.max.fan-twitter-@BulgariaUNHCR-filtered.txt-shallow-20200712-195424-62wpi-meta.warc.gz 26005 download   job
urls-archive.max.fan-twitter-@BulgariaUNHCR-filtered.txt-shallow-20200712-195424-62wpi-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BulgariaUNHCR-filtered.txt-shallow-20200712-195424-62wpi-urls.txt 30300 download
urls-archive.max.fan-twitter-@BulgariaUNHCR-filtered.txt-shallow-20200712-195424-62wpi.json 341 download   job
urls-archive.max.fan-twitter-@BurundiForeign-filtered.txt-shallow-20200712-195421-1cxp1-00000.warc.gz 7197594 download   job
urls-archive.max.fan-twitter-@BurundiForeign-filtered.txt-shallow-20200712-195421-1cxp1-00000.warc.os.cdx.gz 10099 download
urls-archive.max.fan-twitter-@BurundiForeign-filtered.txt-shallow-20200712-195421-1cxp1-meta.warc.gz 9550 download   job
urls-archive.max.fan-twitter-@BurundiForeign-filtered.txt-shallow-20200712-195421-1cxp1-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BurundiForeign-filtered.txt-shallow-20200712-195421-1cxp1-urls.txt 5795 download
urls-archive.max.fan-twitter-@CALFIRE_ButteCo-filtered.txt-shallow-20200712-194139-du46e-00000.warc.gz 1045500818 download   job
urls-archive.max.fan-twitter-@CALFIRE_ButteCo-filtered.txt-shallow-20200712-194139-du46e-00000.warc.os.cdx.gz 834490 download
urls-archive.max.fan-twitter-@CALFIRE_ButteCo-filtered.txt-shallow-20200712-194139-du46e-meta.warc.gz 438174 download   job
urls-archive.max.fan-twitter-@CALFIRE_ButteCo-filtered.txt-shallow-20200712-194139-du46e-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CALFIRE_ButteCo-filtered.txt-shallow-20200712-194139-du46e-urls.txt 243889 download
urls-archive.max.fan-twitter-@CALFIRE_ButteCo-filtered.txt-shallow-20200712-194139-du46e.json 345 download   job
urls-archive.max.fan-twitter-@CARAS_LDN-filtered.txt-shallow-20200712-193609-1bv53-00000.warc.gz 315526051 download   job
urls-archive.max.fan-twitter-@CARAS_LDN-filtered.txt-shallow-20200712-193609-1bv53-00000.warc.os.cdx.gz 283205 download
urls-archive.max.fan-twitter-@CARAS_LDN-filtered.txt-shallow-20200712-193609-1bv53-meta.warc.gz 154407 download   job
urls-archive.max.fan-twitter-@CARAS_LDN-filtered.txt-shallow-20200712-193609-1bv53-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CARAS_LDN-filtered.txt-shallow-20200712-193609-1bv53-urls.txt 103473 download
urls-archive.max.fan-twitter-@CARAS_LDN-filtered.txt-shallow-20200712-193609-1bv53.json 333 download   job
urls-archive.max.fan-twitter-@CASenatorJim-filtered.txt-shallow-20200712-192742-3y7ri-00000.warc.gz 5598067 download   job
urls-archive.max.fan-twitter-@CASenatorJim-filtered.txt-shallow-20200712-192742-3y7ri-00000.warc.os.cdx.gz 14186 download
urls-archive.max.fan-twitter-@CASenatorJim-filtered.txt-shallow-20200712-192742-3y7ri-meta.warc.gz 11912 download   job
urls-archive.max.fan-twitter-@CASenatorJim-filtered.txt-shallow-20200712-192742-3y7ri-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CASenatorJim-filtered.txt-shallow-20200712-192742-3y7ri-urls.txt 1500 download
urls-archive.max.fan-twitter-@CASenatorJim-filtered.txt-shallow-20200712-192742-3y7ri.json 339 download   job
urls-archive.max.fan-twitter-@CAgovernor-filtered.txt-shallow-20200712-194143-52a4p-00000.warc.gz 499152766 download   job
urls-archive.max.fan-twitter-@CAgovernor-filtered.txt-shallow-20200712-194143-52a4p-00000.warc.os.cdx.gz 1308048 download
urls-archive.max.fan-twitter-@CAgovernor-filtered.txt-shallow-20200712-194143-52a4p-meta.warc.gz 684972 download   job
urls-archive.max.fan-twitter-@CAgovernor-filtered.txt-shallow-20200712-194143-52a4p-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CAgovernor-filtered.txt-shallow-20200712-194143-52a4p-urls.txt 136532 download
urls-archive.max.fan-twitter-@CAgovernor-filtered.txt-shallow-20200712-194143-52a4p.json 335 download   job
urls-archive.max.fan-twitter-@CBurtonAP-filtered.txt-shallow-20200712-192625-60dqi-00000.warc.gz 85199112 download   job
urls-archive.max.fan-twitter-@CBurtonAP-filtered.txt-shallow-20200712-192625-60dqi-00000.warc.os.cdx.gz 91333 download
urls-archive.max.fan-twitter-@CBurtonAP-filtered.txt-shallow-20200712-192625-60dqi-meta.warc.gz 54030 download   job
urls-archive.max.fan-twitter-@CBurtonAP-filtered.txt-shallow-20200712-192625-60dqi-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CBurtonAP-filtered.txt-shallow-20200712-192625-60dqi-urls.txt 64229 download
urls-archive.max.fan-twitter-@CBurtonAP-filtered.txt-shallow-20200712-192625-60dqi.json 333 download   job
urls-archive.max.fan-twitter-@CCDD_HSPH-filtered.txt-shallow-20200712-192115-cgro1-00000.warc.gz 158784330 download   job
urls-archive.max.fan-twitter-@CCDD_HSPH-filtered.txt-shallow-20200712-192115-cgro1-00000.warc.os.cdx.gz 246770 download
urls-archive.max.fan-twitter-@CCDD_HSPH-filtered.txt-shallow-20200712-192115-cgro1-meta.warc.gz 135519 download   job
urls-archive.max.fan-twitter-@CCDD_HSPH-filtered.txt-shallow-20200712-192115-cgro1-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CCDD_HSPH-filtered.txt-shallow-20200712-192115-cgro1-urls.txt 107866 download
urls-archive.max.fan-twitter-@CCDD_HSPH-filtered.txt-shallow-20200712-192115-cgro1.json 333 download   job
urls-archive.max.fan-twitter-@CDCReady-filtered.txt-shallow-20200712-190909-1wzfn-00000.warc.gz 97995476 download   job
urls-archive.max.fan-twitter-@CDCReady-filtered.txt-shallow-20200712-190909-1wzfn-00000.warc.os.cdx.gz 302557 download
urls-archive.max.fan-twitter-@CDCReady-filtered.txt-shallow-20200712-190909-1wzfn-meta.warc.gz 169649 download   job
urls-archive.max.fan-twitter-@CDCReady-filtered.txt-shallow-20200712-190909-1wzfn-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CDCReady-filtered.txt-shallow-20200712-190909-1wzfn-urls.txt 75075 download
urls-archive.max.fan-twitter-@CDCReady-filtered.txt-shallow-20200712-190909-1wzfn.json 331 download   job
urls-archive.max.fan-twitter-@CEPCDurango-filtered.txt-shallow-20200712-190837-9azt1-00000.warc.gz 1624801015 download   job
urls-archive.max.fan-twitter-@CEPCDurango-filtered.txt-shallow-20200712-190837-9azt1-00000.warc.os.cdx.gz 1113382 download
urls-archive.max.fan-twitter-@CEPCDurango-filtered.txt-shallow-20200712-190837-9azt1-meta.warc.gz 573829 download   job
urls-archive.max.fan-twitter-@CEPCDurango-filtered.txt-shallow-20200712-190837-9azt1-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CEPCDurango-filtered.txt-shallow-20200712-190837-9azt1-urls.txt 294504 download
urls-archive.max.fan-twitter-@CEPCDurango-filtered.txt-shallow-20200712-190837-9azt1.json 337 download   job
urls-archive.max.fan-twitter-@CHISNPS-filtered.txt-shallow-20200712-190135-b7l90-00000.warc.gz 100254780 download   job
urls-archive.max.fan-twitter-@CHISNPS-filtered.txt-shallow-20200712-190135-b7l90-00000.warc.os.cdx.gz 158110 download
urls-archive.max.fan-twitter-@CHISNPS-filtered.txt-shallow-20200712-190135-b7l90-meta.warc.gz 89423 download   job
urls-archive.max.fan-twitter-@CHISNPS-filtered.txt-shallow-20200712-190135-b7l90-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CHISNPS-filtered.txt-shallow-20200712-190135-b7l90-urls.txt 70613 download
urls-archive.max.fan-twitter-@CHISNPS-filtered.txt-shallow-20200712-190135-b7l90.json 329 download   job
urls-archive.max.fan-twitter-@CHPWestValley-filtered.txt-shallow-20200712-183122-3z03d-00000.warc.gz 520915687 download   job
urls-archive.max.fan-twitter-@CHPWestValley-filtered.txt-shallow-20200712-183122-3z03d-00000.warc.os.cdx.gz 639550 download
urls-archive.max.fan-twitter-@CHPWestValley-filtered.txt-shallow-20200712-183122-3z03d-meta.warc.gz 345616 download   job
urls-archive.max.fan-twitter-@CHPWestValley-filtered.txt-shallow-20200712-183122-3z03d-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CHPWestValley-filtered.txt-shallow-20200712-183122-3z03d-urls.txt 243200 download
urls-archive.max.fan-twitter-@CHPWestValley-filtered.txt-shallow-20200712-183122-3z03d.json 341 download   job
urls-archive.max.fan-twitter-@CHP_SantaBarb-filtered.txt-shallow-20200712-185854-8va56-00000.warc.gz 89845348 download   job
urls-archive.max.fan-twitter-@CHP_SantaBarb-filtered.txt-shallow-20200712-185854-8va56-00000.warc.os.cdx.gz 132582 download
urls-archive.max.fan-twitter-@CHP_SantaBarb-filtered.txt-shallow-20200712-185854-8va56-meta.warc.gz 76386 download   job
urls-archive.max.fan-twitter-@CHP_SantaBarb-filtered.txt-shallow-20200712-185854-8va56-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CHP_SantaBarb-filtered.txt-shallow-20200712-185854-8va56-urls.txt 26003 download
urls-archive.max.fan-twitter-@CINU_Bolivia-filtered.txt-shallow-20200712-182325-8xztq-00000.warc.gz 59052863 download   job
urls-archive.max.fan-twitter-@CINU_Bolivia-filtered.txt-shallow-20200712-182325-8xztq-00000.warc.os.cdx.gz 60031 download
urls-archive.max.fan-twitter-@CINU_Bolivia-filtered.txt-shallow-20200712-182325-8xztq-meta.warc.gz 36174 download   job
urls-archive.max.fan-twitter-@CINU_Bolivia-filtered.txt-shallow-20200712-182325-8xztq-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CINU_Bolivia-filtered.txt-shallow-20200712-182325-8xztq-urls.txt 30149 download
urls-archive.max.fan-twitter-@CINU_Bolivia-filtered.txt-shallow-20200712-182325-8xztq.json 339 download   job
urls-archive.max.fan-twitter-@CITES-filtered.txt-shallow-20200712-182237-3ek3g-00000.warc.gz 619233550 download   job
urls-archive.max.fan-twitter-@CITES-filtered.txt-shallow-20200712-182237-3ek3g-00000.warc.os.cdx.gz 1479730 download
urls-archive.max.fan-twitter-@CITES-filtered.txt-shallow-20200712-182237-3ek3g-meta.warc.gz 785596 download   job
urls-archive.max.fan-twitter-@CITES-filtered.txt-shallow-20200712-182237-3ek3g-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CITES-filtered.txt-shallow-20200712-182237-3ek3g-urls.txt 128434 download
urls-archive.max.fan-twitter-@CITES-filtered.txt-shallow-20200712-182237-3ek3g.json 325 download   job
urls-archive.max.fan-twitter-@CJEMetcalf-filtered.txt-shallow-20200712-182102-1biaq-00000.warc.gz 17664684 download   job
urls-archive.max.fan-twitter-@CJEMetcalf-filtered.txt-shallow-20200712-182102-1biaq-00000.warc.os.cdx.gz 40615 download
urls-archive.max.fan-twitter-@CJEMetcalf-filtered.txt-shallow-20200712-182102-1biaq-meta.warc.gz 26283 download   job
urls-archive.max.fan-twitter-@CJEMetcalf-filtered.txt-shallow-20200712-182102-1biaq-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CJEMetcalf-filtered.txt-shallow-20200712-182102-1biaq-urls.txt 9038 download
urls-archive.max.fan-twitter-@CJEMetcalf-filtered.txt-shallow-20200712-182102-1biaq.json 335 download   job
urls-archive.max.fan-twitter-@CLTMayor-filtered.txt-shallow-20200712-181051-724at-00000.warc.gz 67446335 download   job
urls-archive.max.fan-twitter-@CLTMayor-filtered.txt-shallow-20200712-181051-724at-00000.warc.os.cdx.gz 180317 download
urls-archive.max.fan-twitter-@CLTMayor-filtered.txt-shallow-20200712-181051-724at-meta.warc.gz 101133 download   job
urls-archive.max.fan-twitter-@CLTMayor-filtered.txt-shallow-20200712-181051-724at-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CLTMayor-filtered.txt-shallow-20200712-181051-724at-urls.txt 16563 download
urls-archive.max.fan-twitter-@CLTMayor-filtered.txt-shallow-20200712-181051-724at.json 331 download   job
urls-archive.max.fan-twitter-@CMNUCC-filtered.txt-shallow-20200712-181030-ctvdf-00000.warc.gz 1670492078 download   job
urls-archive.max.fan-twitter-@CMNUCC-filtered.txt-shallow-20200712-181030-ctvdf-00000.warc.os.cdx.gz 2394699 download
urls-archive.max.fan-twitter-@CMNUCC-filtered.txt-shallow-20200712-181030-ctvdf-meta.warc.gz 1268816 download   job
urls-archive.max.fan-twitter-@CMNUCC-filtered.txt-shallow-20200712-181030-ctvdf-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CMNUCC-filtered.txt-shallow-20200712-181030-ctvdf-urls.txt 423572 download
urls-archive.max.fan-twitter-@CMNUCC-filtered.txt-shallow-20200712-181030-ctvdf.json 327 download   job
urls-archive.max.fan-twitter-@CMOKerala-filtered.txt-shallow-20200712-181028-debr7-00000.warc.gz 554723483 download   job
urls-archive.max.fan-twitter-@CMOKerala-filtered.txt-shallow-20200712-181028-debr7-00000.warc.os.cdx.gz 1392417 download
urls-archive.max.fan-twitter-@CMOKerala-filtered.txt-shallow-20200712-181028-debr7-meta.warc.gz 727003 download   job
urls-archive.max.fan-twitter-@CMOKerala-filtered.txt-shallow-20200712-181028-debr7-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CMOKerala-filtered.txt-shallow-20200712-181028-debr7-urls.txt 222663 download
urls-archive.max.fan-twitter-@CMOKerala-filtered.txt-shallow-20200712-181028-debr7.json 333 download   job
urls-archive.max.fan-twitter-@COStateFire-filtered.txt-shallow-20200712-174017-b80fq-00000.warc.gz 427622834 download   job
urls-archive.max.fan-twitter-@COStateFire-filtered.txt-shallow-20200712-174017-b80fq-00000.warc.os.cdx.gz 385008 download
urls-archive.max.fan-twitter-@COStateFire-filtered.txt-shallow-20200712-174017-b80fq-meta.warc.gz 208087 download   job
urls-archive.max.fan-twitter-@COStateFire-filtered.txt-shallow-20200712-174017-b80fq-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@COStateFire-filtered.txt-shallow-20200712-174017-b80fq-urls.txt 148642 download
urls-archive.max.fan-twitter-@COStateFire-filtered.txt-shallow-20200712-174017-b80fq.json 337 download   job
urls-archive.max.fan-twitter-@COVID19Tracking-filtered.txt-shallow-20200712-173502-dylvu-00000.warc.gz 152517570 download   job
urls-archive.max.fan-twitter-@COVID19Tracking-filtered.txt-shallow-20200712-173502-dylvu-00000.warc.os.cdx.gz 452962 download
urls-archive.max.fan-twitter-@COVID19Tracking-filtered.txt-shallow-20200712-173502-dylvu-meta.warc.gz 244353 download   job
urls-archive.max.fan-twitter-@COVID19Tracking-filtered.txt-shallow-20200712-173502-dylvu-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@COVID19Tracking-filtered.txt-shallow-20200712-173502-dylvu-urls.txt 40887 download
urls-archive.max.fan-twitter-@COVID19Tracking-filtered.txt-shallow-20200712-173502-dylvu.json 345 download   job
urls-archive.max.fan-twitter-@CPJMENA-filtered.txt-shallow-20200712-173014-3zicp-00000.warc.gz 465706926 download   job
urls-archive.max.fan-twitter-@CPJMENA-filtered.txt-shallow-20200712-173014-3zicp-00000.warc.os.cdx.gz 943028 download
urls-archive.max.fan-twitter-@CPJMENA-filtered.txt-shallow-20200712-173014-3zicp-meta.warc.gz 504237 download   job
urls-archive.max.fan-twitter-@CPJMENA-filtered.txt-shallow-20200712-173014-3zicp-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CPJMENA-filtered.txt-shallow-20200712-173014-3zicp-urls.txt 336399 download
urls-archive.max.fan-twitter-@CPJMENA-filtered.txt-shallow-20200712-173014-3zicp.json 329 download   job
urls-archive.max.fan-twitter-@CQualtro-filtered.txt-shallow-20200712-173012-cn1gh-00000.warc.gz 1004775895 download   job
urls-archive.max.fan-twitter-@CQualtro-filtered.txt-shallow-20200712-173012-cn1gh-00000.warc.os.cdx.gz 1078240 download
urls-archive.max.fan-twitter-@CQualtro-filtered.txt-shallow-20200712-173012-cn1gh-meta.warc.gz 576828 download   job
urls-archive.max.fan-twitter-@CQualtro-filtered.txt-shallow-20200712-173012-cn1gh-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CQualtro-filtered.txt-shallow-20200712-173012-cn1gh-urls.txt 225647 download
urls-archive.max.fan-twitter-@CQualtro-filtered.txt-shallow-20200712-173012-cn1gh.json 331 download   job
urls-archive.max.fan-twitter-@CRHFootball-filtered.txt-shallow-20200712-173006-2mkej-00000.warc.gz 1567165 download   job
urls-archive.max.fan-twitter-@CRHFootball-filtered.txt-shallow-20200712-173006-2mkej-00000.warc.os.cdx.gz 6557 download
urls-archive.max.fan-twitter-@CRHFootball-filtered.txt-shallow-20200712-173006-2mkej-meta.warc.gz 7649 download   job
urls-archive.max.fan-twitter-@CRHFootball-filtered.txt-shallow-20200712-173006-2mkej-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CRTejada-filtered.txt-shallow-20200712-173005-3zxjm-00000.warc.gz 661721759 download   job
urls-archive.max.fan-twitter-@CRTejada-filtered.txt-shallow-20200712-173005-3zxjm-00000.warc.os.cdx.gz 1043285 download
urls-archive.max.fan-twitter-@CRTejada-filtered.txt-shallow-20200712-173005-3zxjm-meta.warc.gz 559182 download   job
urls-archive.max.fan-twitter-@CRTejada-filtered.txt-shallow-20200712-173005-3zxjm-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CRTejada-filtered.txt-shallow-20200712-173005-3zxjm-urls.txt 467083 download
urls-archive.max.fan-twitter-@CRTejada-filtered.txt-shallow-20200712-173005-3zxjm.json 331 download   job
urls-archive.max.fan-twitter-@CRefugeeCouncil-filtered.txt-shallow-20200712-173010-amfh0-00000.warc.gz 27711260 download   job
urls-archive.max.fan-twitter-@CRefugeeCouncil-filtered.txt-shallow-20200712-173010-amfh0-00000.warc.os.cdx.gz 35457 download
urls-archive.max.fan-twitter-@CRefugeeCouncil-filtered.txt-shallow-20200712-173010-amfh0-urls.txt 23591 download
urls-archive.max.fan-twitter-@CRefugeeCouncil-filtered.txt-shallow-20200712-173010-amfh0.json 345 download   job
urls-archive.max.fan-twitter-@CSheehanAP-filtered.txt-shallow-20200712-172439-4rd2v-00000.warc.gz 295272462 download   job
urls-archive.max.fan-twitter-@CSheehanAP-filtered.txt-shallow-20200712-172439-4rd2v-00000.warc.os.cdx.gz 323738 download
urls-archive.max.fan-twitter-@CSheehanAP-filtered.txt-shallow-20200712-172439-4rd2v-meta.warc.gz 175355 download   job
urls-archive.max.fan-twitter-@CSheehanAP-filtered.txt-shallow-20200712-172439-4rd2v-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CSheehanAP-filtered.txt-shallow-20200712-172439-4rd2v-urls.txt 194379 download
urls-archive.max.fan-twitter-@CSheehanAP-filtered.txt-shallow-20200712-172439-4rd2v.json 335 download   job
urls-archive.max.fan-twitter-@CT_Bergstrom-filtered.txt-shallow-20200712-172436-3b6n6-00000.warc.gz 2065072875 download   job
urls-archive.max.fan-twitter-@CT_Bergstrom-filtered.txt-shallow-20200712-172436-3b6n6-00000.warc.os.cdx.gz 3529222 download
urls-archive.max.fan-twitter-@CT_Bergstrom-filtered.txt-shallow-20200712-172436-3b6n6-urls.txt 822719 download
urls-archive.max.fan-twitter-@CT_Bergstrom-filtered.txt-shallow-20200712-172436-3b6n6.json 339 download   job
urls-archive.max.fan-twitter-@CVNPNPS-filtered.txt-shallow-20200712-171823-2xheh-00000.warc.gz 298003914 download   job
urls-archive.max.fan-twitter-@CVNPNPS-filtered.txt-shallow-20200712-171823-2xheh-00000.warc.os.cdx.gz 284257 download
urls-archive.max.fan-twitter-@CVNPNPS-filtered.txt-shallow-20200712-171823-2xheh-meta.warc.gz 155718 download   job
urls-archive.max.fan-twitter-@CVNPNPS-filtered.txt-shallow-20200712-171823-2xheh-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CVNPNPS-filtered.txt-shallow-20200712-171823-2xheh-urls.txt 59273 download
urls-archive.max.fan-twitter-@CVNPNPS-filtered.txt-shallow-20200712-171823-2xheh.json 329 download   job
urls-archive.max.fan-twitter-@CVNPmtb-filtered.txt-shallow-20200712-171830-7704o-00000.warc.gz 69175977 download   job
urls-archive.max.fan-twitter-@CVNPmtb-filtered.txt-shallow-20200712-171830-7704o-00000.warc.os.cdx.gz 102622 download
urls-archive.max.fan-twitter-@Callahanwrick-filtered.txt-shallow-20200712-193930-4eo9x-00000.warc.gz 20506294 download   job
urls-archive.max.fan-twitter-@Callahanwrick-filtered.txt-shallow-20200712-193930-4eo9x-00000.warc.os.cdx.gz 24561 download
urls-archive.max.fan-twitter-@Callahanwrick-filtered.txt-shallow-20200712-193930-4eo9x-meta.warc.gz 17510 download   job
urls-archive.max.fan-twitter-@Callahanwrick-filtered.txt-shallow-20200712-193930-4eo9x-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Callahanwrick-filtered.txt-shallow-20200712-193930-4eo9x-urls.txt 19404 download
urls-archive.max.fan-twitter-@Callahanwrick-filtered.txt-shallow-20200712-193930-4eo9x.json 341 download   job
urls-archive.max.fan-twitter-@CaltransD5-filtered.txt-shallow-20200712-193703-7nvyv-00000.warc.gz 277066923 download   job
urls-archive.max.fan-twitter-@CaltransD5-filtered.txt-shallow-20200712-193703-7nvyv-00000.warc.os.cdx.gz 400134 download
urls-archive.max.fan-twitter-@CaltransD5-filtered.txt-shallow-20200712-193703-7nvyv-meta.warc.gz 216934 download   job
urls-archive.max.fan-twitter-@CaltransD5-filtered.txt-shallow-20200712-193703-7nvyv-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CaltransD5-filtered.txt-shallow-20200712-193703-7nvyv-urls.txt 160668 download
urls-archive.max.fan-twitter-@CaltransD5-filtered.txt-shallow-20200712-193703-7nvyv.json 335 download   job
urls-archive.max.fan-twitter-@CanadaUN-filtered.txt-shallow-20200712-193650-1y116-00000.warc.gz 6246773 download   job
urls-archive.max.fan-twitter-@CanadaUN-filtered.txt-shallow-20200712-193650-1y116-00000.warc.os.cdx.gz 19808 download
urls-archive.max.fan-twitter-@CanadaUN-filtered.txt-shallow-20200712-193650-1y116-meta.warc.gz 14742 download   job
urls-archive.max.fan-twitter-@CanadaUN-filtered.txt-shallow-20200712-193650-1y116-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CanadaUN-filtered.txt-shallow-20200712-193650-1y116-urls.txt 1008 download
urls-archive.max.fan-twitter-@CanadaUN-filtered.txt-shallow-20200712-193650-1y116.json 331 download   job
urls-archive.max.fan-twitter-@Canope_55-filtered.txt-shallow-20200712-193636-33o85-00000.warc.gz 326389377 download   job
urls-archive.max.fan-twitter-@Canope_55-filtered.txt-shallow-20200712-193636-33o85-00000.warc.os.cdx.gz 294616 download
urls-archive.max.fan-twitter-@Canope_55-filtered.txt-shallow-20200712-193636-33o85-meta.warc.gz 158535 download   job
urls-archive.max.fan-twitter-@Canope_55-filtered.txt-shallow-20200712-193636-33o85-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Canope_55-filtered.txt-shallow-20200712-193636-33o85-urls.txt 104951 download
urls-archive.max.fan-twitter-@Canope_55-filtered.txt-shallow-20200712-193636-33o85.json 333 download   job
urls-archive.max.fan-twitter-@CaribbeanUN-filtered.txt-shallow-20200712-193606-b2ifq-00000.warc.gz 2470221 download   job
urls-archive.max.fan-twitter-@CaribbeanUN-filtered.txt-shallow-20200712-193606-b2ifq-00000.warc.os.cdx.gz 6925 download
urls-archive.max.fan-twitter-@CaribbeanUN-filtered.txt-shallow-20200712-193606-b2ifq-meta.warc.gz 7750 download   job
urls-archive.max.fan-twitter-@CaribbeanUN-filtered.txt-shallow-20200712-193606-b2ifq-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CaribbeanUN-filtered.txt-shallow-20200712-193606-b2ifq-urls.txt 590 download
urls-archive.max.fan-twitter-@CaribbeanUN-filtered.txt-shallow-20200712-193606-b2ifq.json 337 download   job
urls-archive.max.fan-twitter-@CaringChoices-filtered.txt-shallow-20200712-193605-i14ct-00000.warc.gz 30096220 download   job
urls-archive.max.fan-twitter-@CaringChoices-filtered.txt-shallow-20200712-193605-i14ct-00000.warc.os.cdx.gz 36960 download
urls-archive.max.fan-twitter-@CaringChoices-filtered.txt-shallow-20200712-193605-i14ct-meta.warc.gz 24637 download   job
urls-archive.max.fan-twitter-@CaringChoices-filtered.txt-shallow-20200712-193605-i14ct-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CaringChoices-filtered.txt-shallow-20200712-193605-i14ct-urls.txt 16707 download
urls-archive.max.fan-twitter-@CaringChoices-filtered.txt-shallow-20200712-193605-i14ct.json 341 download   job
urls-archive.max.fan-twitter-@CarlosM_Urzua-filtered.txt-shallow-20200712-193051-b7r89-00000.warc.gz 29772127 download   job
urls-archive.max.fan-twitter-@CarlosM_Urzua-filtered.txt-shallow-20200712-193051-b7r89-00000.warc.os.cdx.gz 149815 download
urls-archive.max.fan-twitter-@CarlosM_Urzua-filtered.txt-shallow-20200712-193051-b7r89-meta.warc.gz 83313 download   job
urls-archive.max.fan-twitter-@CarlosM_Urzua-filtered.txt-shallow-20200712-193051-b7r89-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CarlosM_Urzua-filtered.txt-shallow-20200712-193051-b7r89-urls.txt 8052 download
urls-archive.max.fan-twitter-@CarlosM_Urzua-filtered.txt-shallow-20200712-193051-b7r89.json 341 download   job
urls-archive.max.fan-twitter-@CatherineForNV-filtered.txt-shallow-20200712-192632-9lsnl-00000.warc.gz 230485222 download   job
urls-archive.max.fan-twitter-@CatherineForNV-filtered.txt-shallow-20200712-192632-9lsnl-00000.warc.os.cdx.gz 493741 download
urls-archive.max.fan-twitter-@CatherineForNV-filtered.txt-shallow-20200712-192632-9lsnl-meta.warc.gz 265059 download   job
urls-archive.max.fan-twitter-@CatherineForNV-filtered.txt-shallow-20200712-192632-9lsnl-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CatherineForNV-filtered.txt-shallow-20200712-192632-9lsnl-urls.txt 100626 download
urls-archive.max.fan-twitter-@CatherineForNV-filtered.txt-shallow-20200712-192632-9lsnl.json 343 download   job
urls-archive.max.fan-twitter-@CdLavergne-filtered.txt-shallow-20200712-190907-d22yh-00000.warc.gz 313776677 download   job
urls-archive.max.fan-twitter-@CdLavergne-filtered.txt-shallow-20200712-190907-d22yh-00000.warc.os.cdx.gz 322734 download
urls-archive.max.fan-twitter-@CdLavergne-filtered.txt-shallow-20200712-190907-d22yh-meta.warc.gz 174484 download   job
urls-archive.max.fan-twitter-@CdLavergne-filtered.txt-shallow-20200712-190907-d22yh-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CdLavergne-filtered.txt-shallow-20200712-190907-d22yh-urls.txt 68598 download
urls-archive.max.fan-twitter-@CdLavergne-filtered.txt-shallow-20200712-190907-d22yh.json 335 download   job
urls-archive.max.fan-twitter-@CelesteforAR-filtered.txt-shallow-20200712-190855-9nasj-00000.warc.gz 118556759 download   job
urls-archive.max.fan-twitter-@CelesteforAR-filtered.txt-shallow-20200712-190855-9nasj-00000.warc.os.cdx.gz 196605 download
urls-archive.max.fan-twitter-@CelesteforAR-filtered.txt-shallow-20200712-190855-9nasj-meta.warc.gz 109025 download   job
urls-archive.max.fan-twitter-@CelesteforAR-filtered.txt-shallow-20200712-190855-9nasj-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CelesteforAR-filtered.txt-shallow-20200712-190855-9nasj-urls.txt 49006 download
urls-archive.max.fan-twitter-@CelesteforAR-filtered.txt-shallow-20200712-190855-9nasj.json 339 download   job
urls-archive.max.fan-twitter-@CenacedAC-filtered.txt-shallow-20200712-190847-629ei-00000.warc.gz 391787015 download   job
urls-archive.max.fan-twitter-@CenacedAC-filtered.txt-shallow-20200712-190847-629ei-00000.warc.os.cdx.gz 307489 download
urls-archive.max.fan-twitter-@CenacedAC-filtered.txt-shallow-20200712-190847-629ei-meta.warc.gz 164609 download   job
urls-archive.max.fan-twitter-@CenacedAC-filtered.txt-shallow-20200712-190847-629ei-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CenacedAC-filtered.txt-shallow-20200712-190847-629ei-urls.txt 160661 download
urls-archive.max.fan-twitter-@CenacedAC-filtered.txt-shallow-20200712-190847-629ei.json 333 download   job
urls-archive.max.fan-twitter-@ChantalBIYA_Cmr-filtered.txt-shallow-20200712-190746-3iyt2-00000.warc.gz 36883302 download   job
urls-archive.max.fan-twitter-@ChantalBIYA_Cmr-filtered.txt-shallow-20200712-190746-3iyt2-00000.warc.os.cdx.gz 85111 download
urls-archive.max.fan-twitter-@ChantalBIYA_Cmr-filtered.txt-shallow-20200712-190746-3iyt2-meta.warc.gz 48774 download   job
urls-archive.max.fan-twitter-@ChantalBIYA_Cmr-filtered.txt-shallow-20200712-190746-3iyt2-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ChantalBIYA_Cmr-filtered.txt-shallow-20200712-190746-3iyt2-urls.txt 4028 download
urls-archive.max.fan-twitter-@ChantalBIYA_Cmr-filtered.txt-shallow-20200712-190746-3iyt2.json 345 download   job
urls-archive.max.fan-twitter-@CharlestonPD-filtered.txt-shallow-20200712-190548-wflaa-00000.warc.gz 5987989 download   job
urls-archive.max.fan-twitter-@CharlestonPD-filtered.txt-shallow-20200712-190548-wflaa-00000.warc.os.cdx.gz 17100 download
urls-archive.max.fan-twitter-@CharlestonPD-filtered.txt-shallow-20200712-190548-wflaa-meta.warc.gz 13491 download   job
urls-archive.max.fan-twitter-@CharlestonPD-filtered.txt-shallow-20200712-190548-wflaa-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CharlestonPD-filtered.txt-shallow-20200712-190548-wflaa-urls.txt 1620 download
urls-archive.max.fan-twitter-@CharlestonPD-filtered.txt-shallow-20200712-190548-wflaa.json 339 download   job
urls-archive.max.fan-twitter-@Charleston_Fire-filtered.txt-shallow-20200712-190655-enkhg-00000.warc.gz 4035926 download   job
urls-archive.max.fan-twitter-@Charleston_Fire-filtered.txt-shallow-20200712-190655-enkhg-00000.warc.os.cdx.gz 10181 download
urls-archive.max.fan-twitter-@Charleston_Fire-filtered.txt-shallow-20200712-190655-enkhg-meta.warc.gz 9590 download   job
urls-archive.max.fan-twitter-@Charleston_Fire-filtered.txt-shallow-20200712-190655-enkhg-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Charleston_Fire-filtered.txt-shallow-20200712-190655-enkhg-urls.txt 630 download
urls-archive.max.fan-twitter-@Charleston_Fire-filtered.txt-shallow-20200712-190655-enkhg.json 345 download   job
urls-archive.max.fan-twitter-@ChickamaugaNPS-filtered.txt-shallow-20200712-190229-aphmq-00000.warc.gz 93611047 download   job
urls-archive.max.fan-twitter-@ChickamaugaNPS-filtered.txt-shallow-20200712-190229-aphmq-00000.warc.os.cdx.gz 119035 download
urls-archive.max.fan-twitter-@ChickamaugaNPS-filtered.txt-shallow-20200712-190229-aphmq-meta.warc.gz 67756 download   job
urls-archive.max.fan-twitter-@ChickamaugaNPS-filtered.txt-shallow-20200712-190229-aphmq-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ChickamaugaNPS-filtered.txt-shallow-20200712-190229-aphmq-urls.txt 42267 download
urls-archive.max.fan-twitter-@ChickamaugaNPS-filtered.txt-shallow-20200712-190229-aphmq.json 343 download   job
urls-archive.max.fan-twitter-@ChicoChamber-filtered.txt-shallow-20200712-190229-aq424-00000.warc.gz 442537412 download   job
urls-archive.max.fan-twitter-@ChicoChamber-filtered.txt-shallow-20200712-190229-aq424-00000.warc.os.cdx.gz 372584 download
urls-archive.max.fan-twitter-@ChicoChamber-filtered.txt-shallow-20200712-190229-aq424-meta.warc.gz 201315 download   job
urls-archive.max.fan-twitter-@ChicoChamber-filtered.txt-shallow-20200712-190229-aq424-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ChicoChamber-filtered.txt-shallow-20200712-190229-aq424-urls.txt 202556 download
urls-archive.max.fan-twitter-@ChicoChamber-filtered.txt-shallow-20200712-190229-aq424.json 339 download   job
urls-archive.max.fan-twitter-@ChicoFD-filtered.txt-shallow-20200712-190226-4iub8-00000.warc.gz 235484336 download   job
urls-archive.max.fan-twitter-@ChicoFD-filtered.txt-shallow-20200712-190226-4iub8-00000.warc.os.cdx.gz 208705 download
urls-archive.max.fan-twitter-@ChicoFD-filtered.txt-shallow-20200712-190226-4iub8-meta.warc.gz 115337 download   job
urls-archive.max.fan-twitter-@ChicoFD-filtered.txt-shallow-20200712-190226-4iub8-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ChicoFD-filtered.txt-shallow-20200712-190226-4iub8-urls.txt 68129 download
urls-archive.max.fan-twitter-@ChicoFD-filtered.txt-shallow-20200712-190226-4iub8.json 329 download   job
urls-archive.max.fan-twitter-@Chinamission2un-filtered.txt-shallow-20200712-190136-dfaw0-00000.warc.gz 64596862 download   job
urls-archive.max.fan-twitter-@Chinamission2un-filtered.txt-shallow-20200712-190136-dfaw0-00000.warc.os.cdx.gz 130418 download
urls-archive.max.fan-twitter-@Chinamission2un-filtered.txt-shallow-20200712-190136-dfaw0-meta.warc.gz 73256 download   job
urls-archive.max.fan-twitter-@Chinamission2un-filtered.txt-shallow-20200712-190136-dfaw0-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Chinamission2un-filtered.txt-shallow-20200712-190136-dfaw0-urls.txt 17010 download
urls-archive.max.fan-twitter-@Chinamission2un-filtered.txt-shallow-20200712-190136-dfaw0.json 345 download   job
urls-archive.max.fan-twitter-@ChoateAdmission-filtered.txt-shallow-20200712-190129-4gel0-00000.warc.gz 16854341 download   job
urls-archive.max.fan-twitter-@ChoateAdmission-filtered.txt-shallow-20200712-190129-4gel0-00000.warc.os.cdx.gz 18816 download
urls-archive.max.fan-twitter-@ChoateAdmission-filtered.txt-shallow-20200712-190129-4gel0-meta.warc.gz 14406 download   job
urls-archive.max.fan-twitter-@ChoateAdmission-filtered.txt-shallow-20200712-190129-4gel0-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ChoateAdmission-filtered.txt-shallow-20200712-190129-4gel0-urls.txt 4908 download
urls-archive.max.fan-twitter-@ChoateAdmission-filtered.txt-shallow-20200712-190129-4gel0.json 345 download   job
urls-archive.max.fan-twitter-@ChoateAthletics-filtered.txt-shallow-20200712-190121-bbv4e-00000.warc.gz 1247550 download   job
urls-archive.max.fan-twitter-@ChoateAthletics-filtered.txt-shallow-20200712-190121-bbv4e-00000.warc.os.cdx.gz 4697 download
urls-archive.max.fan-twitter-@ChoateAthletics-filtered.txt-shallow-20200712-190121-bbv4e-meta.warc.gz 6521 download   job
urls-archive.max.fan-twitter-@ChoateAthletics-filtered.txt-shallow-20200712-190121-bbv4e-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ChoateAthletics-filtered.txt-shallow-20200712-190121-bbv4e-urls.txt 63 download
urls-archive.max.fan-twitter-@ChoateAthletics-filtered.txt-shallow-20200712-190121-bbv4e.json 345 download   job
urls-archive.max.fan-twitter-@ChoateGvh-filtered.txt-shallow-20200712-190010-cueiw-00000.warc.gz 19984706 download   job
urls-archive.max.fan-twitter-@ChoateGvh-filtered.txt-shallow-20200712-190010-cueiw-00000.warc.os.cdx.gz 27915 download
urls-archive.max.fan-twitter-@ChoateGvh-filtered.txt-shallow-20200712-190010-cueiw-meta.warc.gz 19478 download   job
urls-archive.max.fan-twitter-@ChoateGvh-filtered.txt-shallow-20200712-190010-cueiw-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ChoateGvh-filtered.txt-shallow-20200712-190010-cueiw-urls.txt 8835 download
urls-archive.max.fan-twitter-@ChoateGvh-filtered.txt-shallow-20200712-190010-cueiw.json 333 download   job
urls-archive.max.fan-twitter-@ChoateHoops-filtered.txt-shallow-20200712-185950-29xdk-00000.warc.gz 41219256 download   job
urls-archive.max.fan-twitter-@ChoateHoops-filtered.txt-shallow-20200712-185950-29xdk-00000.warc.os.cdx.gz 54714 download
urls-archive.max.fan-twitter-@ChoateHoops-filtered.txt-shallow-20200712-185950-29xdk-meta.warc.gz 33768 download   job
urls-archive.max.fan-twitter-@ChoateHoops-filtered.txt-shallow-20200712-185950-29xdk-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ChoateHoops-filtered.txt-shallow-20200712-185950-29xdk-urls.txt 17189 download
urls-archive.max.fan-twitter-@ChoateHoops-filtered.txt-shallow-20200712-185950-29xdk.json 337 download   job
urls-archive.max.fan-twitter-@ChrisChristie-filtered.txt-shallow-20200712-183121-94ezs-00000.warc.gz 283303623 download   job
urls-archive.max.fan-twitter-@ChrisChristie-filtered.txt-shallow-20200712-183121-94ezs-00000.warc.os.cdx.gz 831681 download
urls-archive.max.fan-twitter-@ChrisChristie-filtered.txt-shallow-20200712-183121-94ezs-meta.warc.gz 450799 download   job
urls-archive.max.fan-twitter-@ChrisChristie-filtered.txt-shallow-20200712-183121-94ezs-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ChrisChristie-filtered.txt-shallow-20200712-183121-94ezs-urls.txt 120960 download
urls-archive.max.fan-twitter-@ChrisChristie-filtered.txt-shallow-20200712-183121-94ezs.json 341 download   job
urls-archive.max.fan-twitter-@ChuHoiDick-filtered.txt-shallow-20200712-182704-4dhxm-00000.warc.gz 38347906 download   job
urls-archive.max.fan-twitter-@ChuHoiDick-filtered.txt-shallow-20200712-182704-4dhxm-00000.warc.os.cdx.gz 90804 download
urls-archive.max.fan-twitter-@ChuHoiDick-filtered.txt-shallow-20200712-182704-4dhxm-meta.warc.gz 52281 download   job
urls-archive.max.fan-twitter-@ChuHoiDick-filtered.txt-shallow-20200712-182704-4dhxm-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ChuHoiDick-filtered.txt-shallow-20200712-182704-4dhxm-urls.txt 8528 download
urls-archive.max.fan-twitter-@ChuHoiDick-filtered.txt-shallow-20200712-182704-4dhxm.json 335 download   job
urls-archive.max.fan-twitter-@Cim_Katja-filtered.txt-shallow-20200712-182704-5pb17-00000.warc.gz 253670153 download   job
urls-archive.max.fan-twitter-@Cim_Katja-filtered.txt-shallow-20200712-182704-5pb17-00000.warc.os.cdx.gz 273819 download
urls-archive.max.fan-twitter-@Cim_Katja-filtered.txt-shallow-20200712-182704-5pb17-meta.warc.gz 150270 download   job
urls-archive.max.fan-twitter-@Cim_Katja-filtered.txt-shallow-20200712-182704-5pb17-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Cim_Katja-filtered.txt-shallow-20200712-182704-5pb17-urls.txt 194850 download
urls-archive.max.fan-twitter-@Cim_Katja-filtered.txt-shallow-20200712-182704-5pb17.json 333 download   job
urls-archive.max.fan-twitter-@CindyPoloFL103-filtered.txt-shallow-20200712-182659-37qxf-00000.warc.gz 339366518 download   job
urls-archive.max.fan-twitter-@CindyPoloFL103-filtered.txt-shallow-20200712-182659-37qxf-00000.warc.os.cdx.gz 485456 download
urls-archive.max.fan-twitter-@CindyPoloFL103-filtered.txt-shallow-20200712-182659-37qxf-meta.warc.gz 262028 download   job
urls-archive.max.fan-twitter-@CindyPoloFL103-filtered.txt-shallow-20200712-182659-37qxf-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CindyPoloFL103-filtered.txt-shallow-20200712-182659-37qxf-urls.txt 172811 download
urls-archive.max.fan-twitter-@CindyPoloFL103-filtered.txt-shallow-20200712-182659-37qxf.json 343 download   job
urls-archive.max.fan-twitter-@CityOfRocksNPS-filtered.txt-shallow-20200712-182103-92ltc-00000.warc.gz 21896733 download   job
urls-archive.max.fan-twitter-@CityOfRocksNPS-filtered.txt-shallow-20200712-182103-92ltc-00000.warc.os.cdx.gz 24897 download
urls-archive.max.fan-twitter-@CityOfRocksNPS-filtered.txt-shallow-20200712-182103-92ltc-meta.warc.gz 17776 download   job
urls-archive.max.fan-twitter-@CityOfRocksNPS-filtered.txt-shallow-20200712-182103-92ltc-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CityOfRocksNPS-filtered.txt-shallow-20200712-182103-92ltc-urls.txt 11712 download
urls-archive.max.fan-twitter-@CityOfRocksNPS-filtered.txt-shallow-20200712-182103-92ltc.json 343 download   job
urls-archive.max.fan-twitter-@City_of_Mobile-filtered.txt-shallow-20200712-182104-604vf-00000.warc.gz 471660237 download   job
urls-archive.max.fan-twitter-@City_of_Mobile-filtered.txt-shallow-20200712-182104-604vf-00000.warc.os.cdx.gz 569793 download
urls-archive.max.fan-twitter-@City_of_Mobile-filtered.txt-shallow-20200712-182104-604vf-meta.warc.gz 306385 download   job
urls-archive.max.fan-twitter-@City_of_Mobile-filtered.txt-shallow-20200712-182104-604vf-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@City_of_Mobile-filtered.txt-shallow-20200712-182104-604vf-urls.txt 213094 download
urls-archive.max.fan-twitter-@City_of_Mobile-filtered.txt-shallow-20200712-182104-604vf.json 343 download   job
urls-archive.max.fan-twitter-@CityofDetroit-filtered.txt-shallow-20200712-182235-c8xx6-00000.warc.gz 718290144 download   job
urls-archive.max.fan-twitter-@CityofDetroit-filtered.txt-shallow-20200712-182235-c8xx6-00000.warc.os.cdx.gz 1027356 download
urls-archive.max.fan-twitter-@CityofDetroit-filtered.txt-shallow-20200712-182235-c8xx6-meta.warc.gz 539783 download   job
urls-archive.max.fan-twitter-@CityofDetroit-filtered.txt-shallow-20200712-182235-c8xx6-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CityofDetroit-filtered.txt-shallow-20200712-182235-c8xx6-urls.txt 283883 download
urls-archive.max.fan-twitter-@CityofDetroit-filtered.txt-shallow-20200712-182235-c8xx6.json 341 download   job
urls-archive.max.fan-twitter-@ClaudiaTorrens-filtered.txt-shallow-20200712-181705-6it34-00000.warc.gz 74220609 download   job
urls-archive.max.fan-twitter-@ClaudiaTorrens-filtered.txt-shallow-20200712-181705-6it34-00000.warc.os.cdx.gz 113297 download
urls-archive.max.fan-twitter-@ClaudiaTorrens-filtered.txt-shallow-20200712-181705-6it34-meta.warc.gz 64823 download   job
urls-archive.max.fan-twitter-@ClaudiaTorrens-filtered.txt-shallow-20200712-181705-6it34-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ClaudiaTorrens-filtered.txt-shallow-20200712-181705-6it34-urls.txt 59052 download
urls-archive.max.fan-twitter-@ClaudiaTorrens-filtered.txt-shallow-20200712-181705-6it34.json 343 download   job
urls-archive.max.fan-twitter-@ClimateNPS-filtered.txt-shallow-20200712-181705-dkj5m-00000.warc.gz 37846121 download   job
urls-archive.max.fan-twitter-@ClimateNPS-filtered.txt-shallow-20200712-181705-dkj5m-00000.warc.os.cdx.gz 81618 download
urls-archive.max.fan-twitter-@ClimateNPS-filtered.txt-shallow-20200712-181705-dkj5m-meta.warc.gz 47961 download   job
urls-archive.max.fan-twitter-@ClimateNPS-filtered.txt-shallow-20200712-181705-dkj5m-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ClimateNPS-filtered.txt-shallow-20200712-181705-dkj5m-urls.txt 10232 download
urls-archive.max.fan-twitter-@ClimateNPS-filtered.txt-shallow-20200712-181705-dkj5m.json 335 download   job
urls-archive.max.fan-twitter-@ColumbusGov-filtered.txt-shallow-20200712-174931-58aix-urls.txt 2466 download
urls-archive.max.fan-twitter-@ColumbusGov-filtered.txt-shallow-20200712-174931-58aix.json 337 download   job
urls-archive.max.fan-twitter-@Communications-filtered.txt-shallow-20200712-174922-2kibg-00000.warc.gz 2042180762 download   job
urls-archive.max.fan-twitter-@Communications-filtered.txt-shallow-20200712-174922-2kibg-00000.warc.os.cdx.gz 2055967 download
urls-archive.max.fan-twitter-@Communications-filtered.txt-shallow-20200712-174922-2kibg-meta.warc.gz 1050865 download   job
urls-archive.max.fan-twitter-@Communications-filtered.txt-shallow-20200712-174922-2kibg-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Communications-filtered.txt-shallow-20200712-174922-2kibg-urls.txt 572466 download
urls-archive.max.fan-twitter-@Communications-filtered.txt-shallow-20200712-174922-2kibg.json 343 download   job
urls-archive.max.fan-twitter-@CongressmanHice-filtered.txt-shallow-20200712-174810-69zzt-meta.warc.gz 16531 download   job
urls-archive.max.fan-twitter-@CongressmanHice-filtered.txt-shallow-20200712-174810-69zzt-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CongressmanHice-filtered.txt-shallow-20200712-174810-69zzt-urls.txt 1197 download
urls-archive.max.fan-twitter-@CongressmanJVD-filtered.txt-shallow-20200712-174803-84njj-meta.warc.gz 9642 download   job
urls-archive.max.fan-twitter-@CongressmanJVD-filtered.txt-shallow-20200712-174803-84njj-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CongressmanJVD-filtered.txt-shallow-20200712-174803-84njj.json 343 download   job
urls-archive.max.fan-twitter-@Conservatives-filtered.txt-shallow-20200712-174018-8vy1s-00000.warc.gz 829738855 download   job
urls-archive.max.fan-twitter-@Conservatives-filtered.txt-shallow-20200712-174018-8vy1s-00000.warc.os.cdx.gz 1694431 download
urls-archive.max.fan-twitter-@Conservatives-filtered.txt-shallow-20200712-174018-8vy1s-meta.warc.gz 903757 download   job
urls-archive.max.fan-twitter-@Conservatives-filtered.txt-shallow-20200712-174018-8vy1s-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Conservatives-filtered.txt-shallow-20200712-174018-8vy1s-urls.txt 126330 download
urls-archive.max.fan-twitter-@Conservatives-filtered.txt-shallow-20200712-174018-8vy1s.json 341 download   job
urls-archive.max.fan-twitter-@CotedivoireOnu-filtered.txt-shallow-20200712-174012-2tyzh-00000.warc.gz 136050848 download   job
urls-archive.max.fan-twitter-@CotedivoireOnu-filtered.txt-shallow-20200712-174012-2tyzh-00000.warc.os.cdx.gz 126572 download
urls-archive.max.fan-twitter-@CotedivoireOnu-filtered.txt-shallow-20200712-174012-2tyzh.json 343 download   job
urls-archive.max.fan-twitter-@CountyofButte-filtered.txt-shallow-20200712-173528-axiu4-meta.warc.gz 196314 download   job
urls-archive.max.fan-twitter-@CountyofButte-filtered.txt-shallow-20200712-173528-axiu4-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CountyofButte-filtered.txt-shallow-20200712-173528-axiu4-urls.txt 103589 download
urls-archive.max.fan-twitter-@CountyofButte-filtered.txt-shallow-20200712-173528-axiu4.json 341 download   job
urls-archive.max.fan-twitter-@CruzRoja_MX-filtered.txt-shallow-20200712-172444-3f59m-00000.warc.gz 2641697298 download   job
urls-archive.max.fan-twitter-@CruzRoja_MX-filtered.txt-shallow-20200712-172444-3f59m-00000.warc.os.cdx.gz 3313880 download
urls-archive.max.fan-twitter-@CruzRoja_MX-filtered.txt-shallow-20200712-172444-3f59m-meta.warc.gz 1730516 download   job
urls-archive.max.fan-twitter-@CruzRoja_MX-filtered.txt-shallow-20200712-172444-3f59m-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CruzRoja_MX-filtered.txt-shallow-20200712-172444-3f59m-urls.txt 779994 download
urls-archive.max.fan-twitter-@CruzRoja_MX-filtered.txt-shallow-20200712-172444-3f59m.json 337 download   job
urls-archive.max.fan-twitter-@Crystalpearlco-filtered.txt-shallow-20200712-172441-5vb1v-00000.warc.gz 51552335 download   job
urls-archive.max.fan-twitter-@Crystalpearlco-filtered.txt-shallow-20200712-172441-5vb1v-00000.warc.os.cdx.gz 89671 download
urls-archive.max.fan-twitter-@Crystalpearlco-filtered.txt-shallow-20200712-172441-5vb1v-urls.txt 42159 download
urls-archive.max.fan-twitter-@CzechMFA-filtered.txt-shallow-20200712-171823-2fx71-00000.warc.gz 269967220 download   job
urls-archive.max.fan-twitter-@CzechMFA-filtered.txt-shallow-20200712-171823-2fx71-00000.warc.os.cdx.gz 372098 download
urls-archive.max.fan-twitter-@CzechMFA-filtered.txt-shallow-20200712-171823-2fx71-meta.warc.gz 199918 download   job
urls-archive.max.fan-twitter-@CzechMFA-filtered.txt-shallow-20200712-171823-2fx71-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CzechMFA-filtered.txt-shallow-20200712-171823-2fx71.json 331 download   job
urls-archive.max.fan-twitter-@DCLegalHackers-filtered.txt-shallow-20200712-170348-aszyz-00000.warc.gz 98502342 download   job
urls-archive.max.fan-twitter-@DCLegalHackers-filtered.txt-shallow-20200712-170348-aszyz-00000.warc.os.cdx.gz 143342 download
urls-archive.max.fan-twitter-@DEDeptState-filtered.txt-shallow-20200712-170328-f37vu-00000.warc.gz 1671936 download   job
urls-archive.max.fan-twitter-@DEDeptState-filtered.txt-shallow-20200712-170328-f37vu-00000.warc.os.cdx.gz 5698 download
urls-archive.max.fan-twitter-@DEDeptState-filtered.txt-shallow-20200712-170328-f37vu-meta.warc.gz 7056 download   job
urls-archive.max.fan-twitter-@DEDeptState-filtered.txt-shallow-20200712-170328-f37vu-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DEDeptState-filtered.txt-shallow-20200712-170328-f37vu-urls.txt 59 download
urls-archive.max.fan-twitter-@DEDeptState-filtered.txt-shallow-20200712-170328-f37vu.json 337 download   job
urls-archive.max.fan-twitter-@DFATVic-filtered.txt-shallow-20200712-165359-30bxe-meta.warc.gz 69487 download   job
urls-archive.max.fan-twitter-@DFATVic-filtered.txt-shallow-20200712-165359-30bxe-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DFATVic-filtered.txt-shallow-20200712-165359-30bxe-urls.txt 19170 download
urls-archive.max.fan-twitter-@DFATVic-filtered.txt-shallow-20200712-165359-30bxe.json 329 download   job
urls-archive.max.fan-twitter-@DHS_Wolf-filtered.txt-shallow-20200712-165111-dtw1p-00000.warc.gz 136546011 download   job
urls-archive.max.fan-twitter-@DHS_Wolf-filtered.txt-shallow-20200712-165111-dtw1p-00000.warc.os.cdx.gz 396840 download
urls-archive.max.fan-twitter-@DHS_Wolf-filtered.txt-shallow-20200712-165111-dtw1p-meta.warc.gz 212894 download   job
urls-archive.max.fan-twitter-@DHS_Wolf-filtered.txt-shallow-20200712-165111-dtw1p-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DMercer00-filtered.txt-shallow-20200712-162927-9dild-00000.warc.gz 442158514 download   job
urls-archive.max.fan-twitter-@DMercer00-filtered.txt-shallow-20200712-162927-9dild-00000.warc.os.cdx.gz 440363 download
urls-archive.max.fan-twitter-@DMercer00-filtered.txt-shallow-20200712-162927-9dild-meta.warc.gz 237710 download   job
urls-archive.max.fan-twitter-@DMercer00-filtered.txt-shallow-20200712-162927-9dild-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DMercer00-filtered.txt-shallow-20200712-162927-9dild-urls.txt 308275 download
urls-archive.max.fan-twitter-@DailyDevelopmt-filtered.txt-shallow-20200712-171820-12lsj-meta.warc.gz 29626 download   job
urls-archive.max.fan-twitter-@DailyDevelopmt-filtered.txt-shallow-20200712-171820-12lsj-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DanCardenMP-filtered.txt-shallow-20200712-171337-5homf-00000.warc.gz 42940535 download   job
urls-archive.max.fan-twitter-@DanCardenMP-filtered.txt-shallow-20200712-171337-5homf-00000.warc.os.cdx.gz 170118 download
urls-archive.max.fan-twitter-@DanCardenMP-filtered.txt-shallow-20200712-171337-5homf-meta.warc.gz 95116 download   job
urls-archive.max.fan-twitter-@DanCardenMP-filtered.txt-shallow-20200712-171337-5homf-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DanCardenMP-filtered.txt-shallow-20200712-171337-5homf-urls.txt 12920 download
urls-archive.max.fan-twitter-@DanCardenMP-filtered.txt-shallow-20200712-171337-5homf.json 337 download   job
urls-archive.max.fan-twitter-@DanaiGurira-filtered.txt-shallow-20200712-171339-1x07t-00000.warc.gz 244691491 download   job
urls-archive.max.fan-twitter-@DanaiGurira-filtered.txt-shallow-20200712-171339-1x07t-00000.warc.os.cdx.gz 668087 download
urls-archive.max.fan-twitter-@DanaiGurira-filtered.txt-shallow-20200712-171339-1x07t-meta.warc.gz 355188 download   job
urls-archive.max.fan-twitter-@DanaiGurira-filtered.txt-shallow-20200712-171339-1x07t-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DanaiGurira-filtered.txt-shallow-20200712-171339-1x07t-urls.txt 44125 download
urls-archive.max.fan-twitter-@DanaiGurira-filtered.txt-shallow-20200712-171339-1x07t.json 337 download   job
urls-archive.max.fan-twitter-@Danielle_4PA-filtered.txt-shallow-20200712-171337-8it80-00000.warc.gz 124619565 download   job
urls-archive.max.fan-twitter-@Danielle_4PA-filtered.txt-shallow-20200712-171337-8it80-00000.warc.os.cdx.gz 193895 download
urls-archive.max.fan-twitter-@Danielle_4PA-filtered.txt-shallow-20200712-171337-8it80-meta.warc.gz 107732 download   job
urls-archive.max.fan-twitter-@Danielle_4PA-filtered.txt-shallow-20200712-171337-8it80-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Danielle_4PA-filtered.txt-shallow-20200712-171337-8it80.json 339 download   job
urls-archive.max.fan-twitter-@DannyFaure-filtered.txt-shallow-20200712-171335-asbot-00000.warc.gz 20089214 download   job
urls-archive.max.fan-twitter-@DannyFaure-filtered.txt-shallow-20200712-171335-asbot-00000.warc.os.cdx.gz 33998 download
urls-archive.max.fan-twitter-@DannyFaure-filtered.txt-shallow-20200712-171335-asbot-meta.warc.gz 22461 download   job
urls-archive.max.fan-twitter-@DannyFaure-filtered.txt-shallow-20200712-171335-asbot-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DannyFaure-filtered.txt-shallow-20200712-171335-asbot-urls.txt 7184 download
urls-archive.max.fan-twitter-@Data4SDGs-filtered.txt-shallow-20200712-170916-9i0bh.json 333 download   job
urls-archive.max.fan-twitter-@DataDhrumil-filtered.txt-shallow-20200712-170914-b0bb2-00000.warc.gz 53146182 download   job
urls-archive.max.fan-twitter-@DataDhrumil-filtered.txt-shallow-20200712-170914-b0bb2-00000.warc.os.cdx.gz 168758 download
urls-archive.max.fan-twitter-@DataDhrumil-filtered.txt-shallow-20200712-170914-b0bb2-meta.warc.gz 94243 download   job
urls-archive.max.fan-twitter-@DataDhrumil-filtered.txt-shallow-20200712-170914-b0bb2-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DataDhrumil-filtered.txt-shallow-20200712-170914-b0bb2-urls.txt 32785 download
urls-archive.max.fan-twitter-@DataDhrumil-filtered.txt-shallow-20200712-170914-b0bb2.json 337 download   job
urls-archive.max.fan-twitter-@DavidALieb-filtered.txt-shallow-20200712-170515-alpzc-00000.warc.gz 172085302 download   job
urls-archive.max.fan-twitter-@DavidALieb-filtered.txt-shallow-20200712-170515-alpzc-00000.warc.os.cdx.gz 285535 download
urls-archive.max.fan-twitter-@DavidALieb-filtered.txt-shallow-20200712-170515-alpzc-meta.warc.gz 156454 download   job
urls-archive.max.fan-twitter-@DavidALieb-filtered.txt-shallow-20200712-170515-alpzc-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DavidALieb-filtered.txt-shallow-20200712-170515-alpzc-urls.txt 143114 download
urls-archive.max.fan-twitter-@DavidALieb-filtered.txt-shallow-20200712-170515-alpzc.json 335 download   job
urls-archive.max.fan-twitter-@DavidFurstNYT-filtered.txt-shallow-20200712-170515-8se45-00000.warc.gz 262642741 download   job
urls-archive.max.fan-twitter-@DavidFurstNYT-filtered.txt-shallow-20200712-170515-8se45-00000.warc.os.cdx.gz 408145 download
urls-archive.max.fan-twitter-@DavidFurstNYT-filtered.txt-shallow-20200712-170515-8se45-meta.warc.gz 221968 download   job
urls-archive.max.fan-twitter-@DavidFurstNYT-filtered.txt-shallow-20200712-170515-8se45-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DavidFurstNYT-filtered.txt-shallow-20200712-170515-8se45.json 341 download   job
urls-archive.max.fan-twitter-@DavidPorter_AP-filtered.txt-shallow-20200712-170509-9dosg-00000.warc.gz 81493895 download   job
urls-archive.max.fan-twitter-@DavidPorter_AP-filtered.txt-shallow-20200712-170509-9dosg-00000.warc.os.cdx.gz 105777 download
urls-archive.max.fan-twitter-@DavidPorter_AP-filtered.txt-shallow-20200712-170509-9dosg-meta.warc.gz 61071 download   job
urls-archive.max.fan-twitter-@DavidPorter_AP-filtered.txt-shallow-20200712-170509-9dosg-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DavidYankovich-filtered.txt-shallow-20200712-170509-6u90q-00000.warc.gz 2061675355 download   job
urls-archive.max.fan-twitter-@DavidYankovich-filtered.txt-shallow-20200712-170509-6u90q-00000.warc.os.cdx.gz 5631330 download
urls-archive.max.fan-twitter-@DavidYankovich-filtered.txt-shallow-20200712-170509-6u90q-meta.warc.gz 2955447 download   job
urls-archive.max.fan-twitter-@DavidYankovich-filtered.txt-shallow-20200712-170509-6u90q-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DavidYankovich-filtered.txt-shallow-20200712-170509-6u90q-urls.txt 1192356 download
urls-archive.max.fan-twitter-@DavidYankovich-filtered.txt-shallow-20200712-170509-6u90q.json 343 download   job
urls-archive.max.fan-twitter-@DayemiFdn-filtered.txt-shallow-20200712-170355-2n6k6-00000.warc.gz 17255531 download   job
urls-archive.max.fan-twitter-@DayemiFdn-filtered.txt-shallow-20200712-170355-2n6k6-00000.warc.os.cdx.gz 18068 download
urls-archive.max.fan-twitter-@DayemiFdn-filtered.txt-shallow-20200712-170355-2n6k6-meta.warc.gz 13915 download   job
urls-archive.max.fan-twitter-@DayemiFdn-filtered.txt-shallow-20200712-170355-2n6k6-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DayemiFdn-filtered.txt-shallow-20200712-170355-2n6k6.json 333 download   job
urls-archive.max.fan-twitter-@DeWineHustedOH-filtered.txt-shallow-20200712-165403-2jp00-00000.warc.gz 237406064 download   job
urls-archive.max.fan-twitter-@DeWineHustedOH-filtered.txt-shallow-20200712-165403-2jp00-00000.warc.os.cdx.gz 205049 download
urls-archive.max.fan-twitter-@DeWineHustedOH-filtered.txt-shallow-20200712-165403-2jp00-meta.warc.gz 112294 download   job
urls-archive.max.fan-twitter-@DeWineHustedOH-filtered.txt-shallow-20200712-165403-2jp00-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DefenceMinIndia-filtered.txt-shallow-20200712-170326-f2k8c-00000.warc.gz 947480023 download   job
urls-archive.max.fan-twitter-@DefenceMinIndia-filtered.txt-shallow-20200712-170326-f2k8c-00000.warc.os.cdx.gz 1894723 download
urls-archive.max.fan-twitter-@DefenceMinIndia-filtered.txt-shallow-20200712-170326-f2k8c-meta.warc.gz 993335 download   job
urls-archive.max.fan-twitter-@DefenceMinIndia-filtered.txt-shallow-20200712-170326-f2k8c-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DefenceMinIndia-filtered.txt-shallow-20200712-170326-f2k8c-urls.txt 195524 download
urls-archive.max.fan-twitter-@DefenceMinIndia-filtered.txt-shallow-20200712-170326-f2k8c.json 345 download   job
urls-archive.max.fan-twitter-@DelaCruzMusalem-filtered.txt-shallow-20200712-165839-bceef-urls.txt 19848 download
urls-archive.max.fan-twitter-@DelaCruzMusalem-filtered.txt-shallow-20200712-165839-bceef.json 345 download   job
urls-archive.max.fan-twitter-@DelponMichelEM-filtered.txt-shallow-20200712-165836-aaxsj-00000.warc.gz 103754013 download   job
urls-archive.max.fan-twitter-@DelponMichelEM-filtered.txt-shallow-20200712-165836-aaxsj-00000.warc.os.cdx.gz 132144 download
urls-archive.max.fan-twitter-@DelponMichelEM-filtered.txt-shallow-20200712-165836-aaxsj-meta.warc.gz 74598 download   job
urls-archive.max.fan-twitter-@DelponMichelEM-filtered.txt-shallow-20200712-165836-aaxsj-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DelponMichelEM-filtered.txt-shallow-20200712-165836-aaxsj-urls.txt 95907 download
urls-archive.max.fan-twitter-@DelponMichelEM-filtered.txt-shallow-20200712-165836-aaxsj.json 343 download   job
urls-archive.max.fan-twitter-@DennisPassa-filtered.txt-shallow-20200712-165833-6jelj-00000.warc.gz 496008917 download   job
urls-archive.max.fan-twitter-@DennisPassa-filtered.txt-shallow-20200712-165833-6jelj-00000.warc.os.cdx.gz 514024 download
urls-archive.max.fan-twitter-@DennisPassa-filtered.txt-shallow-20200712-165833-6jelj-meta.warc.gz 276191 download   job
urls-archive.max.fan-twitter-@DennisPassa-filtered.txt-shallow-20200712-165833-6jelj-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DennisPassa-filtered.txt-shallow-20200712-165833-6jelj-urls.txt 316823 download
urls-archive.max.fan-twitter-@DennisPassa-filtered.txt-shallow-20200712-165833-6jelj.json 337 download   job
urls-archive.max.fan-twitter-@DiLebouthillier-filtered.txt-shallow-20200712-164411-aro57-meta.warc.gz 298187 download   job
urls-archive.max.fan-twitter-@DiLebouthillier-filtered.txt-shallow-20200712-164411-aro57-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DiLebouthillier-filtered.txt-shallow-20200712-164411-aro57-urls.txt 159743 download
urls-archive.max.fan-twitter-@DiLebouthillier-filtered.txt-shallow-20200712-164411-aro57.json 345 download   job
urls-archive.max.fan-twitter-@DiazCanelB-filtered.txt-shallow-20200712-165110-cebwu-00000.warc.gz 170139929 download   job
urls-archive.max.fan-twitter-@DiazCanelB-filtered.txt-shallow-20200712-165110-cebwu-00000.warc.os.cdx.gz 300019 download
urls-archive.max.fan-twitter-@DiazCanelB-filtered.txt-shallow-20200712-165110-cebwu-meta.warc.gz 160053 download   job
urls-archive.max.fan-twitter-@DiazCanelB-filtered.txt-shallow-20200712-165110-cebwu-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DiazCanelB-filtered.txt-shallow-20200712-165110-cebwu.json 335 download   job
urls-archive.max.fan-twitter-@DigiDiplomats-filtered.txt-shallow-20200712-164933-b0c2r-00000.warc.gz 163865880 download   job
urls-archive.max.fan-twitter-@DigiDiplomats-filtered.txt-shallow-20200712-164933-b0c2r-00000.warc.os.cdx.gz 344164 download
urls-archive.max.fan-twitter-@DigiDiplomats-filtered.txt-shallow-20200712-164933-b0c2r-meta.warc.gz 186826 download   job
urls-archive.max.fan-twitter-@DigiDiplomats-filtered.txt-shallow-20200712-164933-b0c2r-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DigiDiplomats-filtered.txt-shallow-20200712-164933-b0c2r.json 341 download   job
urls-archive.max.fan-twitter-@DouthatNYT-filtered.txt-shallow-20200712-162814-eiwoc-00000.warc.gz 2408209964 download   job
urls-archive.max.fan-twitter-@DouthatNYT-filtered.txt-shallow-20200712-162814-eiwoc-00000.warc.os.cdx.gz 6143701 download
urls-archive.max.fan-twitter-@DouthatNYT-filtered.txt-shallow-20200712-162814-eiwoc-meta.warc.gz 3246508 download   job
urls-archive.max.fan-twitter-@DouthatNYT-filtered.txt-shallow-20200712-162814-eiwoc-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DouthatNYT-filtered.txt-shallow-20200712-162814-eiwoc-urls.txt 1696072 download
urls-archive.max.fan-twitter-@DouthatNYT-filtered.txt-shallow-20200712-162814-eiwoc.json 335 download   job
urls-archive.max.fan-twitter-@EricLiptonNYT-filtered.txt-shallow-20200712-153627-ejw05-00000.warc.gz 1538725497 download   job
urls-archive.max.fan-twitter-@EricLiptonNYT-filtered.txt-shallow-20200712-153627-ejw05-00000.warc.os.cdx.gz 4728368 download
urls-archive.max.fan-twitter-@EricLiptonNYT-filtered.txt-shallow-20200712-153627-ejw05-meta.warc.gz 2493427 download   job
urls-archive.max.fan-twitter-@EricLiptonNYT-filtered.txt-shallow-20200712-153627-ejw05-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@EricLiptonNYT-filtered.txt-shallow-20200712-153627-ejw05-urls.txt 647132 download
urls-archive.max.fan-twitter-@EricLiptonNYT-filtered.txt-shallow-20200712-153627-ejw05.json 341 download   job
urls-archive.max.fan-twitter-@FAOKnowledge-filtered.txt-shallow-20200712-150320-4ppzs-00000.warc.gz 3507660457 download   job
urls-archive.max.fan-twitter-@FAOKnowledge-filtered.txt-shallow-20200712-150320-4ppzs-00000.warc.os.cdx.gz 5483625 download
urls-archive.max.fan-twitter-@FAOKnowledge-filtered.txt-shallow-20200712-150320-4ppzs-meta.warc.gz 2879739 download   job
urls-archive.max.fan-twitter-@FAOKnowledge-filtered.txt-shallow-20200712-150320-4ppzs-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@FAOKnowledge-filtered.txt-shallow-20200712-150320-4ppzs-urls.txt 1535061 download
urls-archive.max.fan-twitter-@FAOKnowledge-filtered.txt-shallow-20200712-150320-4ppzs.json 339 download   job
urls-archive.max.fan-twitter-@FactTank-filtered.txt-shallow-20200712-150323-eprv1-00000.warc.gz 3090476268 download   job
urls-archive.max.fan-twitter-@FactTank-filtered.txt-shallow-20200712-150323-eprv1-00000.warc.os.cdx.gz 6363919 download
urls-archive.max.fan-twitter-@FactTank-filtered.txt-shallow-20200712-150323-eprv1-meta.warc.gz 3368985 download   job
urls-archive.max.fan-twitter-@FactTank-filtered.txt-shallow-20200712-150323-eprv1-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@FactTank-filtered.txt-shallow-20200712-150323-eprv1.json 331 download   job
urls-archive.max.fan-twitter-@HayesBrown-filtered.txt-shallow-20200712-120237-5xr1i-00001.warc.gz 3145611024 download   job
urls-archive.max.fan-twitter-@HayesBrown-filtered.txt-shallow-20200712-120237-5xr1i-00001.warc.os.cdx.gz 8814579 download
urls-archive.max.fan-twitter-@HayesBrown-filtered.txt-shallow-20200712-120237-5xr1i-meta.warc.gz 7021592 download   job
urls-archive.max.fan-twitter-@HayesBrown-filtered.txt-shallow-20200712-120237-5xr1i-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@HayesBrown-filtered.txt-shallow-20200712-120237-5xr1i-urls.txt 5715336 download
urls-archive.max.fan-twitter-@HayesBrown-filtered.txt-shallow-20200712-120237-5xr1i.json 335 download   job
urls-archive.max.fan-twitter-@Herring_NBA-filtered.txt-shallow-20200712-115000-9dhnu-meta.warc.gz 5482724 download   job
urls-archive.max.fan-twitter-@Herring_NBA-filtered.txt-shallow-20200712-115000-9dhnu-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Herring_NBA-filtered.txt-shallow-20200712-115000-9dhnu-urls.txt 5861124 download
urls-archive.max.fan-twitter-@Herring_NBA-filtered.txt-shallow-20200712-115000-9dhnu.json 337 download   job
urls-archive.max.fan-twitter-@campbellnyt-filtered.txt-shallow-20200712-193653-29z4r-00000.warc.gz 281549188 download   job
urls-archive.max.fan-twitter-@campbellnyt-filtered.txt-shallow-20200712-193653-29z4r-00000.warc.os.cdx.gz 584411 download
urls-archive.max.fan-twitter-@campbellnyt-filtered.txt-shallow-20200712-193653-29z4r-meta.warc.gz 310539 download   job
urls-archive.max.fan-twitter-@campbellnyt-filtered.txt-shallow-20200712-193653-29z4r-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@campbellnyt-filtered.txt-shallow-20200712-193653-29z4r-urls.txt 165839 download
urls-archive.max.fan-twitter-@campbellnyt-filtered.txt-shallow-20200712-193653-29z4r.json 337 download   job
urls-archive.max.fan-twitter-@carynrousseau-filtered.txt-shallow-20200712-192804-bmgt0-00000.warc.gz 154879851 download   job
urls-archive.max.fan-twitter-@carynrousseau-filtered.txt-shallow-20200712-192804-bmgt0-00000.warc.os.cdx.gz 174716 download
urls-archive.max.fan-twitter-@carynrousseau-filtered.txt-shallow-20200712-192804-bmgt0-meta.warc.gz 97232 download   job
urls-archive.max.fan-twitter-@carynrousseau-filtered.txt-shallow-20200712-192804-bmgt0-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@carynrousseau-filtered.txt-shallow-20200712-192804-bmgt0.json 341 download   job
urls-archive.max.fan-twitter-@caseysjournal-filtered.txt-shallow-20200712-192633-ec3v1-00000.warc.gz 191118425 download   job
urls-archive.max.fan-twitter-@caseysjournal-filtered.txt-shallow-20200712-192633-ec3v1-00000.warc.os.cdx.gz 672726 download
urls-archive.max.fan-twitter-@caseysjournal-filtered.txt-shallow-20200712-192633-ec3v1-meta.warc.gz 361894 download   job
urls-archive.max.fan-twitter-@caseysjournal-filtered.txt-shallow-20200712-192633-ec3v1-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@caseysjournal-filtered.txt-shallow-20200712-192633-ec3v1-urls.txt 104924 download
urls-archive.max.fan-twitter-@caseysjournal-filtered.txt-shallow-20200712-192633-ec3v1.json 341 download   job
urls-archive.max.fan-twitter-@catherine_lucey-filtered.txt-shallow-20200712-192632-7qgif-00000.warc.gz 276741234 download   job
urls-archive.max.fan-twitter-@catherine_lucey-filtered.txt-shallow-20200712-192632-7qgif-00000.warc.os.cdx.gz 604218 download
urls-archive.max.fan-twitter-@catherine_lucey-filtered.txt-shallow-20200712-192632-7qgif-meta.warc.gz 322078 download   job
urls-archive.max.fan-twitter-@catherine_lucey-filtered.txt-shallow-20200712-192632-7qgif-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@catherine_lucey-filtered.txt-shallow-20200712-192632-7qgif-urls.txt 223802 download
urls-archive.max.fan-twitter-@catherine_lucey-filtered.txt-shallow-20200712-192632-7qgif.json 345 download   job
urls-archive.max.fan-twitter-@ccni-filtered.txt-shallow-20200712-191938-5dpwj-00000.warc.gz 211042205 download   job
urls-archive.max.fan-twitter-@ccni-filtered.txt-shallow-20200712-191938-5dpwj-00000.warc.os.cdx.gz 488375 download
urls-archive.max.fan-twitter-@ccni-filtered.txt-shallow-20200712-191938-5dpwj-meta.warc.gz 258481 download   job
urls-archive.max.fan-twitter-@ccni-filtered.txt-shallow-20200712-191938-5dpwj-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ccni-filtered.txt-shallow-20200712-191938-5dpwj-urls.txt 106803 download
urls-archive.max.fan-twitter-@ccni-filtered.txt-shallow-20200712-191938-5dpwj.json 323 download   job
urls-archive.max.fan-twitter-@cearmario-filtered.txt-shallow-20200712-190856-8prue-00000.warc.gz 256742666 download   job
urls-archive.max.fan-twitter-@cearmario-filtered.txt-shallow-20200712-190856-8prue-00000.warc.os.cdx.gz 377407 download
urls-archive.max.fan-twitter-@cearmario-filtered.txt-shallow-20200712-190856-8prue-meta.warc.gz 203745 download   job
urls-archive.max.fan-twitter-@cearmario-filtered.txt-shallow-20200712-190856-8prue-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@cearmario-filtered.txt-shallow-20200712-190856-8prue-urls.txt 187693 download
urls-archive.max.fan-twitter-@cearmario-filtered.txt-shallow-20200712-190856-8prue.json 333 download   job
urls-archive.max.fan-twitter-@charleslongphd-filtered.txt-shallow-20200712-190656-440bc-00000.warc.gz 11293053 download   job
urls-archive.max.fan-twitter-@charleslongphd-filtered.txt-shallow-20200712-190656-440bc-00000.warc.os.cdx.gz 17663 download
urls-archive.max.fan-twitter-@charleslongphd-filtered.txt-shallow-20200712-190656-440bc-meta.warc.gz 13917 download   job
urls-archive.max.fan-twitter-@charleslongphd-filtered.txt-shallow-20200712-190656-440bc-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@charleslongphd-filtered.txt-shallow-20200712-190656-440bc-urls.txt 9577 download
urls-archive.max.fan-twitter-@charleslongphd-filtered.txt-shallow-20200712-190656-440bc.json 343 download   job
urls-archive.max.fan-twitter-@charscaddan-filtered.txt-shallow-20200712-190544-2bunq-00000.warc.gz 483798399 download   job
urls-archive.max.fan-twitter-@charscaddan-filtered.txt-shallow-20200712-190544-2bunq-00000.warc.os.cdx.gz 458344 download
urls-archive.max.fan-twitter-@charscaddan-filtered.txt-shallow-20200712-190544-2bunq-meta.warc.gz 245781 download   job
urls-archive.max.fan-twitter-@charscaddan-filtered.txt-shallow-20200712-190544-2bunq-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@charscaddan-filtered.txt-shallow-20200712-190544-2bunq-urls.txt 201423 download
urls-archive.max.fan-twitter-@charscaddan-filtered.txt-shallow-20200712-190544-2bunq.json 337 download   job
urls-archive.max.fan-twitter-@chicago-filtered.txt-shallow-20200712-190345-69yzm-00000.warc.gz 14296676 download   job
urls-archive.max.fan-twitter-@chicago-filtered.txt-shallow-20200712-190345-69yzm-00000.warc.os.cdx.gz 38919 download
urls-archive.max.fan-twitter-@chicago-filtered.txt-shallow-20200712-190345-69yzm-meta.warc.gz 24890 download   job
urls-archive.max.fan-twitter-@chicago-filtered.txt-shallow-20200712-190345-69yzm-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@chicago-filtered.txt-shallow-20200712-190345-69yzm-urls.txt 1760 download
urls-archive.max.fan-twitter-@chicago-filtered.txt-shallow-20200712-190345-69yzm.json 329 download   job
urls-archive.max.fan-twitter-@chicagosmayor-filtered.txt-shallow-20200712-190347-bkcdc-00000.warc.gz 570071099 download   job
urls-archive.max.fan-twitter-@chicagosmayor-filtered.txt-shallow-20200712-190347-bkcdc-00000.warc.os.cdx.gz 1417480 download
urls-archive.max.fan-twitter-@chicagosmayor-filtered.txt-shallow-20200712-190347-bkcdc-meta.warc.gz 750533 download   job
urls-archive.max.fan-twitter-@chicagosmayor-filtered.txt-shallow-20200712-190347-bkcdc-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@chicagosmayor-filtered.txt-shallow-20200712-190347-bkcdc-urls.txt 149999 download
urls-archive.max.fan-twitter-@chicagosmayor-filtered.txt-shallow-20200712-190347-bkcdc.json 341 download   job
urls-archive.max.fan-twitter-@choate_robotics-filtered.txt-shallow-20200712-185928-211vu-00000.warc.gz 38222110 download   job
urls-archive.max.fan-twitter-@choate_robotics-filtered.txt-shallow-20200712-185928-211vu-00000.warc.os.cdx.gz 30038 download
urls-archive.max.fan-twitter-@choate_robotics-filtered.txt-shallow-20200712-185928-211vu-meta.warc.gz 20523 download   job
urls-archive.max.fan-twitter-@choate_robotics-filtered.txt-shallow-20200712-185928-211vu-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@choate_robotics-filtered.txt-shallow-20200712-185928-211vu-urls.txt 12958 download
urls-archive.max.fan-twitter-@choate_robotics-filtered.txt-shallow-20200712-185928-211vu.json 345 download   job
urls-archive.max.fan-twitter-@choatenews-filtered.txt-shallow-20200712-185930-f58nv-00000.warc.gz 17602013 download   job
urls-archive.max.fan-twitter-@choatenews-filtered.txt-shallow-20200712-185930-f58nv-00000.warc.os.cdx.gz 21380 download
urls-archive.max.fan-twitter-@choatenews-filtered.txt-shallow-20200712-185930-f58nv-meta.warc.gz 15951 download   job
urls-archive.max.fan-twitter-@choatenews-filtered.txt-shallow-20200712-185930-f58nv-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@choatenews-filtered.txt-shallow-20200712-185930-f58nv-urls.txt 10287 download
urls-archive.max.fan-twitter-@choatenews-filtered.txt-shallow-20200712-185930-f58nv.json 335 download   job
urls-archive.max.fan-twitter-@chowtingagnes-filtered.txt-shallow-20200712-185924-1jij6-00000.warc.gz 243332932 download   job
urls-archive.max.fan-twitter-@chowtingagnes-filtered.txt-shallow-20200712-185924-1jij6-00000.warc.os.cdx.gz 535050 download
urls-archive.max.fan-twitter-@chowtingagnes-filtered.txt-shallow-20200712-185924-1jij6-meta.warc.gz 288821 download   job
urls-archive.max.fan-twitter-@chowtingagnes-filtered.txt-shallow-20200712-185924-1jij6-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@chowtingagnes-filtered.txt-shallow-20200712-185924-1jij6-urls.txt 46045 download
urls-archive.max.fan-twitter-@chowtingagnes-filtered.txt-shallow-20200712-185924-1jij6.json 341 download   job
urls-archive.max.fan-twitter-@chrisshermanAP-filtered.txt-shallow-20200712-183121-624uj-00000.warc.gz 394791045 download   job
urls-archive.max.fan-twitter-@chrisshermanAP-filtered.txt-shallow-20200712-183121-624uj-00000.warc.os.cdx.gz 538031 download
urls-archive.max.fan-twitter-@chrisshermanAP-filtered.txt-shallow-20200712-183121-624uj-meta.warc.gz 286186 download   job
urls-archive.max.fan-twitter-@chrisshermanAP-filtered.txt-shallow-20200712-183121-624uj-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@chrisshermanAP-filtered.txt-shallow-20200712-183121-624uj-urls.txt 370419 download
urls-archive.max.fan-twitter-@chrisshermanAP-filtered.txt-shallow-20200712-183121-624uj.json 343 download   job
urls-archive.max.fan-twitter-@ckrausss-filtered.txt-shallow-20200712-181708-9tupv-00000.warc.gz 123288546 download   job
urls-archive.max.fan-twitter-@ckrausss-filtered.txt-shallow-20200712-181708-9tupv-00000.warc.os.cdx.gz 279827 download
urls-archive.max.fan-twitter-@ckrausss-filtered.txt-shallow-20200712-181708-9tupv-meta.warc.gz 154427 download   job
urls-archive.max.fan-twitter-@ckrausss-filtered.txt-shallow-20200712-181708-9tupv-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ckrausss-filtered.txt-shallow-20200712-181708-9tupv-urls.txt 85225 download
urls-archive.max.fan-twitter-@ckrausss-filtered.txt-shallow-20200712-181708-9tupv.json 331 download   job
urls-archive.max.fan-twitter-@cmyeaton-filtered.txt-shallow-20200712-174958-chk04-00000.warc.gz 97341151 download   job
urls-archive.max.fan-twitter-@cmyeaton-filtered.txt-shallow-20200712-174958-chk04-00000.warc.os.cdx.gz 425332 download
urls-archive.max.fan-twitter-@cmyeaton-filtered.txt-shallow-20200712-174958-chk04-meta.warc.gz 230889 download   job
urls-archive.max.fan-twitter-@cmyeaton-filtered.txt-shallow-20200712-174958-chk04-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@cmyeaton-filtered.txt-shallow-20200712-174958-chk04-urls.txt 37800 download
urls-archive.max.fan-twitter-@cmyeaton-filtered.txt-shallow-20200712-174958-chk04.json 331 download   job
urls-archive.max.fan-twitter-@cologop-filtered.txt-shallow-20200712-174941-7vllr-00000.warc.gz 631754354 download   job
urls-archive.max.fan-twitter-@cologop-filtered.txt-shallow-20200712-174941-7vllr-00000.warc.os.cdx.gz 943072 download
urls-archive.max.fan-twitter-@cologop-filtered.txt-shallow-20200712-174941-7vllr-meta.warc.gz 510483 download   job
urls-archive.max.fan-twitter-@cologop-filtered.txt-shallow-20200712-174941-7vllr-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@cologop-filtered.txt-shallow-20200712-174941-7vllr-urls.txt 334602 download
urls-archive.max.fan-twitter-@cologop-filtered.txt-shallow-20200712-174941-7vllr.json 329 download   job
urls-archive.max.fan-twitter-@colvinj-filtered.txt-shallow-20200712-174930-9a15c-00000.warc.gz 1645632274 download   job
urls-archive.max.fan-twitter-@colvinj-filtered.txt-shallow-20200712-174930-9a15c-00000.warc.os.cdx.gz 3106412 download
urls-archive.max.fan-twitter-@colvinj-filtered.txt-shallow-20200712-174930-9a15c-meta.warc.gz 1633423 download   job
urls-archive.max.fan-twitter-@colvinj-filtered.txt-shallow-20200712-174930-9a15c-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@colvinj-filtered.txt-shallow-20200712-174930-9a15c-urls.txt 1107161 download
urls-archive.max.fan-twitter-@colvinj-filtered.txt-shallow-20200712-174930-9a15c.json 329 download   job
urls-archive.max.fan-twitter-@conniebritton-filtered.txt-shallow-20200712-174026-15l3b-00000.warc.gz 175630416 download   job
urls-archive.max.fan-twitter-@conniebritton-filtered.txt-shallow-20200712-174026-15l3b-00000.warc.os.cdx.gz 763974 download
urls-archive.max.fan-twitter-@conniebritton-filtered.txt-shallow-20200712-174026-15l3b-meta.warc.gz 412097 download   job
urls-archive.max.fan-twitter-@conniebritton-filtered.txt-shallow-20200712-174026-15l3b-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@conniebritton-filtered.txt-shallow-20200712-174026-15l3b-urls.txt 78124 download
urls-archive.max.fan-twitter-@conniebritton-filtered.txt-shallow-20200712-174026-15l3b.json 341 download   job
urls-archive.max.fan-twitter-@countyofsb-filtered.txt-shallow-20200712-173503-8qd78-00000.warc.gz 401816984 download   job
urls-archive.max.fan-twitter-@countyofsb-filtered.txt-shallow-20200712-173503-8qd78-00000.warc.os.cdx.gz 622961 download
urls-archive.max.fan-twitter-@countyofsb-filtered.txt-shallow-20200712-173503-8qd78-meta.warc.gz 335791 download   job
urls-archive.max.fan-twitter-@countyofsb-filtered.txt-shallow-20200712-173503-8qd78-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@countyofsb-filtered.txt-shallow-20200712-173503-8qd78-urls.txt 218701 download
urls-archive.max.fan-twitter-@countyofsb-filtered.txt-shallow-20200712-173503-8qd78.json 335 download   job
urls-archive.max.fan-twitter-@ctlong1-filtered.txt-shallow-20200712-171830-6rb0r-00000.warc.gz 103117494 download   job
urls-archive.max.fan-twitter-@ctlong1-filtered.txt-shallow-20200712-171830-6rb0r-00000.warc.os.cdx.gz 174984 download
urls-archive.max.fan-twitter-@ctlong1-filtered.txt-shallow-20200712-171830-6rb0r-meta.warc.gz 98365 download   job
urls-archive.max.fan-twitter-@ctlong1-filtered.txt-shallow-20200712-171830-6rb0r-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ctlong1-filtered.txt-shallow-20200712-171830-6rb0r-urls.txt 71507 download
urls-archive.max.fan-twitter-@ctlong1-filtered.txt-shallow-20200712-171830-6rb0r.json 329 download   job
urls-archive.max.fan-twitter-@daldenAP-filtered.txt-shallow-20200712-171402-bm044-urls.txt 2821 download
urls-archive.max.fan-twitter-@daldenAP-filtered.txt-shallow-20200712-171402-bm044.json 331 download   job
urls-archive.max.fan-twitter-@danablanton_nyc-filtered.txt-shallow-20200712-171350-dihmz-00000.warc.gz 44963142 download   job
urls-archive.max.fan-twitter-@danablanton_nyc-filtered.txt-shallow-20200712-171350-dihmz-00000.warc.os.cdx.gz 89587 download
urls-archive.max.fan-twitter-@danablanton_nyc-filtered.txt-shallow-20200712-171350-dihmz-meta.warc.gz 52968 download   job
urls-archive.max.fan-twitter-@danablanton_nyc-filtered.txt-shallow-20200712-171350-dihmz-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@danablanton_nyc-filtered.txt-shallow-20200712-171350-dihmz.json 345 download   job
urls-archive.max.fan-twitter-@dannyhakim-filtered.txt-shallow-20200712-171332-av04k-00000.warc.gz 338982184 download   job
urls-archive.max.fan-twitter-@dannyhakim-filtered.txt-shallow-20200712-171332-av04k-00000.warc.os.cdx.gz 720242 download
urls-archive.max.fan-twitter-@dannyhakim-filtered.txt-shallow-20200712-171332-av04k-urls.txt 263252 download
urls-archive.max.fan-twitter-@dannyhakim-filtered.txt-shallow-20200712-171332-av04k.json 335 download   job
urls-archive.max.fan-twitter-@davidzucchino-filtered.txt-shallow-20200712-170359-3oh7h-00000.warc.gz 41602267 download   job
urls-archive.max.fan-twitter-@davidzucchino-filtered.txt-shallow-20200712-170359-3oh7h-00000.warc.os.cdx.gz 77990 download
urls-archive.max.fan-twitter-@davidzucchino-filtered.txt-shallow-20200712-170359-3oh7h-meta.warc.gz 46375 download   job
urls-archive.max.fan-twitter-@davidzucchino-filtered.txt-shallow-20200712-170359-3oh7h-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@davidzucchino-filtered.txt-shallow-20200712-170359-3oh7h.json 341 download   job
urls-archive.max.fan-twitter-@dclark5-filtered.txt-shallow-20200712-170352-gv72j-00000.warc.gz 29097659 download   job
urls-archive.max.fan-twitter-@dclark5-filtered.txt-shallow-20200712-170352-gv72j-00000.warc.os.cdx.gz 35815 download
urls-archive.max.fan-twitter-@dclark5-filtered.txt-shallow-20200712-170352-gv72j-meta.warc.gz 24018 download   job
urls-archive.max.fan-twitter-@dclark5-filtered.txt-shallow-20200712-170352-gv72j-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@dclark5-filtered.txt-shallow-20200712-170352-gv72j.json 329 download   job
urls-archive.max.fan-twitter-@deepakm71-filtered.txt-shallow-20200712-170326-8im46-00000.warc.gz 56665562 download   job
urls-archive.max.fan-twitter-@deepakm71-filtered.txt-shallow-20200712-170326-8im46-00000.warc.os.cdx.gz 54325 download
urls-archive.max.fan-twitter-@deepakm71-filtered.txt-shallow-20200712-170326-8im46-meta.warc.gz 33590 download   job
urls-archive.max.fan-twitter-@deepakm71-filtered.txt-shallow-20200712-170326-8im46-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@deepakm71-filtered.txt-shallow-20200712-170326-8im46.json 333 download   job
urls-archive.max.fan-twitter-@demosisto-filtered.txt-shallow-20200712-165836-8945h-meta.warc.gz 502885 download   job
urls-archive.max.fan-twitter-@demosisto-filtered.txt-shallow-20200712-165836-8945h-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@demosisto-filtered.txt-shallow-20200712-165836-8945h-urls.txt 74898 download
urls-archive.max.fan-twitter-@demosisto-filtered.txt-shallow-20200712-165836-8945h.json 333 download   job
urls-archive.max.fan-twitter-@dgblankinship-filtered.txt-shallow-20200712-165357-cwz2b-00000.warc.gz 446176388 download   job
urls-archive.max.fan-twitter-@dgblankinship-filtered.txt-shallow-20200712-165357-cwz2b-00000.warc.os.cdx.gz 553311 download
urls-archive.max.fan-twitter-@dgblankinship-filtered.txt-shallow-20200712-165357-cwz2b-meta.warc.gz 295742 download   job
urls-archive.max.fan-twitter-@dgblankinship-filtered.txt-shallow-20200712-165357-cwz2b-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@dgblankinship-filtered.txt-shallow-20200712-165357-cwz2b-urls.txt 377875 download
urls-archive.max.fan-twitter-@dgblankinship-filtered.txt-shallow-20200712-165357-cwz2b.json 341 download   job
urls-archive.max.fan-twitter-@dhopkins1776-filtered.txt-shallow-20200712-165351-gio9w-00000.warc.gz 103319750 download   job
urls-archive.max.fan-twitter-@dhopkins1776-filtered.txt-shallow-20200712-165351-gio9w-00000.warc.os.cdx.gz 216469 download
urls-archive.max.fan-twitter-@dhopkins1776-filtered.txt-shallow-20200712-165351-gio9w.json 339 download   job
urls-archive.max.fan-twitter-@dionnesearcey-filtered.txt-shallow-20200712-164404-by5xx-00000.warc.gz 246073530 download   job
urls-archive.max.fan-twitter-@dionnesearcey-filtered.txt-shallow-20200712-164404-by5xx-00000.warc.os.cdx.gz 614352 download
urls-archive.max.fan-twitter-@dionnesearcey-filtered.txt-shallow-20200712-164404-by5xx.json 341 download   job
urls-archive.max.fan-twitter-@elizaorlins-filtered.txt-shallow-20200712-160409-a8hxc-00000.warc.gz 3709510376 download   job
urls-archive.max.fan-twitter-@elizaorlins-filtered.txt-shallow-20200712-160409-a8hxc-00000.warc.os.cdx.gz 6172803 download
urls-archive.max.fan-twitter-@elizaorlins-filtered.txt-shallow-20200712-160409-a8hxc-meta.warc.gz 3225804 download   job
urls-archive.max.fan-twitter-@elizaorlins-filtered.txt-shallow-20200712-160409-a8hxc-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@elizaorlins-filtered.txt-shallow-20200712-160409-a8hxc-urls.txt 2628542 download
urls-archive.max.fan-twitter-@emarvelous-filtered.txt-shallow-20200712-154748-e1td3-00000.warc.gz 2600138843 download   job
urls-archive.max.fan-twitter-@emarvelous-filtered.txt-shallow-20200712-154748-e1td3-00000.warc.os.cdx.gz 4041167 download
urls-archive.max.fan-twitter-@emarvelous-filtered.txt-shallow-20200712-154748-e1td3-meta.warc.gz 2103743 download   job
urls-archive.max.fan-twitter-@emarvelous-filtered.txt-shallow-20200712-154748-e1td3-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@emarvelous-filtered.txt-shallow-20200712-154748-e1td3-urls.txt 1758382 download
urls-archive.max.fan-twitter-@emarvelous-filtered.txt-shallow-20200712-154748-e1td3.json 335 download   job
urls-archive.max.fan-twitter-@evoespueblo-filtered.txt-shallow-20200712-151610-457o0-00000.warc.gz 3808647447 download   job
urls-archive.max.fan-twitter-@evoespueblo-filtered.txt-shallow-20200712-151610-457o0-00000.warc.os.cdx.gz 5989530 download
urls-archive.max.fan-twitter-@evoespueblo-filtered.txt-shallow-20200712-151610-457o0-urls.txt 814527 download
urls-archive.max.fan-twitter-@evoespueblo-filtered.txt-shallow-20200712-151610-457o0.json 337 download   job
urls-archive.max.fan-twitter-@friscojosh-filtered.txt-shallow-20200712-141447-d7ex9-00000.warc.gz 5368737159 download   job
urls-archive.max.fan-twitter-@friscojosh-filtered.txt-shallow-20200712-141447-d7ex9-00000.warc.os.cdx.gz 6069883 download
urls-archive.max.fan-twitter-@friscojosh-filtered.txt-shallow-20200712-141447-d7ex9-00001.warc.gz 305306766 download   job
urls-archive.max.fan-twitter-@friscojosh-filtered.txt-shallow-20200712-141447-d7ex9-00001.warc.os.cdx.gz 513031 download
urls-archive.max.fan-twitter-@friscojosh-filtered.txt-shallow-20200712-141447-d7ex9-meta.warc.gz 3490389 download   job
urls-archive.max.fan-twitter-@friscojosh-filtered.txt-shallow-20200712-141447-d7ex9-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@friscojosh-filtered.txt-shallow-20200712-141447-d7ex9-urls.txt 3481819 download
urls-archive.max.fan-twitter-@friscojosh-filtered.txt-shallow-20200712-141447-d7ex9.json 335 download   job
urls-archive.max.fan-twitter-@gbellseattle-filtered.txt-shallow-20200712-140422-5dyqg-00000.warc.gz 5368712088 download   job
urls-archive.max.fan-twitter-@gbellseattle-filtered.txt-shallow-20200712-140422-5dyqg-00000.warc.os.cdx.gz 6882341 download
urls-archive.max.fan-twitter-@gbellseattle-filtered.txt-shallow-20200712-140422-5dyqg-00001.warc.gz 374482606 download   job
urls-archive.max.fan-twitter-@gbellseattle-filtered.txt-shallow-20200712-140422-5dyqg-00001.warc.os.cdx.gz 913905 download
urls-archive.max.fan-twitter-@gbellseattle-filtered.txt-shallow-20200712-140422-5dyqg-meta.warc.gz 4092846 download   job
urls-archive.max.fan-twitter-@gbellseattle-filtered.txt-shallow-20200712-140422-5dyqg-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@gbellseattle-filtered.txt-shallow-20200712-140422-5dyqg-urls.txt 4175783 download
urls-archive.max.fan-twitter-@gbellseattle-filtered.txt-shallow-20200712-140422-5dyqg.json 339 download   job
urls-transfer.notkiska.pw-twitter-%23BlackHistoryMonth-shallow-20200610-132545-46qdq-00211.warc.gz 5381920372 download   job
urls-transfer.notkiska.pw-twitter-%23BlackHistoryMonth-shallow-20200610-132545-46qdq-00211.warc.os.cdx.gz 1086917 download
urls-transfer.notkiska.pw-twitter-%23notmypresident-shallow-20200530-220957-2c0z0-00130.warc.gz 5369320381 download   job
urls-transfer.notkiska.pw-twitter-%23notmypresident-shallow-20200530-220957-2c0z0-00130.warc.os.cdx.gz 1798528 download
urls-transfer.notkiska.pw-twitter-%23qanon-shallow-20200531-053932-8yw79-00078.warc.gz 5397135175 download   job
urls-transfer.notkiska.pw-twitter-%23qanon-shallow-20200531-053932-8yw79-00078.warc.os.cdx.gz 2468708 download
urls-transfer.notkiska.pw-twitter-@bonappetit-shallow-20200712-003605-9ajtk-00002.warc.gz 5371593740 download   job
urls-transfer.notkiska.pw-twitter-@bonappetit-shallow-20200712-003605-9ajtk-00002.warc.os.cdx.gz 1137401 download
urls-transfer.notkiska.pw-twitter-@bonappetit-shallow-20200712-003605-9ajtk-00003.warc.gz 5370385294 download   job
urls-transfer.notkiska.pw-twitter-@bonappetit-shallow-20200712-003605-9ajtk-00003.warc.os.cdx.gz 911301 download
urls-transfer.notkiska.pw-twitter-@bonappetit-shallow-20200712-003605-9ajtk-00004.warc.gz 5368759405 download   job
urls-transfer.notkiska.pw-twitter-@bonappetit-shallow-20200712-003605-9ajtk-00004.warc.os.cdx.gz 481130 download
urls-transfer.notkiska.pw-vote-usa_org-twitter-accounts-outlinks.1.txt-shallow-20200609-230435-7k4tj-00083.warc.gz 5488998641 download   job
urls-transfer.notkiska.pw-vote-usa_org-twitter-accounts-outlinks.1.txt-shallow-20200609-230435-7k4tj-00083.warc.os.cdx.gz 491908 download
urls-transfer.notkiska.pw-www.cfr.org_rerun_403s.txt-shallow-20200712-171252-9migq-urls.txt 1680968 download
urls-transfer.notkiska.pw-www.cfr.org_rerun_403s.txt-shallow-20200712-180528-9migq-aborted-00000.warc.gz 2003862 download   job
urls-transfer.notkiska.pw-www.cfr.org_rerun_403s.txt-shallow-20200712-180528-9migq-aborted-00000.warc.os.cdx.gz 31860 download
urls-transfer.notkiska.pw-www.cfr.org_rerun_403s.txt-shallow-20200712-180528-9migq-aborted-wpull.log.gz 17360 download
urls-transfer.notkiska.pw-www.cfr.org_rerun_403s.txt-shallow-20200712-180528-9migq-aborted.json 344 download   job
urls-transfer.notkiska.pw-www.cfr.org_rerun_403s.txt-shallow-20200712-180528-9migq-urls.txt 1680968 download
urls-transfer.notkiska.pw-www.cfr403s.txt-shallow-20200712-161440-2cotg-urls.txt 1680968 download
www.bigrigs.com.au-inf-20200528-061953-52odw-00067.warc.gz 5418949662 download   job
www.bigrigs.com.au-inf-20200528-061953-52odw-00067.warc.os.cdx.gz 3752738 download
www.bukarest.balassiintezet.hu-inf-20200707-090517-doafk-00003.warc.gz 5372331859 download   job
www.bukarest.balassiintezet.hu-inf-20200707-090517-doafk-00003.warc.os.cdx.gz 1377357 download
www.cfr.org-shallow-20200712-175046-f2oeg.json 329 download   job
www.chinadaily.com.cn-inf-20190927-102302-505np-00465.warc.gz 1073963211 download   job
www.chinadaily.com.cn-inf-20190927-102302-505np-00465.warc.os.cdx.gz 1117134 download
www.mydiplomaticlife.com-inf-20200712-183619-96vsq-00000.warc.gz 493560382 download   job
www.mydiplomaticlife.com-inf-20200712-183619-96vsq-00000.warc.os.cdx.gz 461799 download
www.mydiplomaticlife.com-inf-20200712-183619-96vsq-meta.warc.gz 278420 download   job
www.mydiplomaticlife.com-inf-20200712-183619-96vsq-meta.warc.os.cdx.gz 47 download
www.mydiplomaticlife.com-inf-20200712-183619-96vsq.json 249 download   job
www.qiagen.com-inf-20200621-061202-1wax4-00028.warc.gz 5369633737 download   job
www.qiagen.com-inf-20200621-061202-1wax4-00028.warc.os.cdx.gz 3590544 download