Item archiveteam_archivebot_go_20200711210003

View on Internet Archive

Filename Size
ablogfullofdemons.blogspot.com-inf-20200711-184230-dfv3d-00000.warc.gz 129726917 download   job
ablogfullofdemons.blogspot.com-inf-20200711-184230-dfv3d-00000.warc.os.cdx.gz 197964 download
ablogfullofdemons.blogspot.com-inf-20200711-184230-dfv3d-meta.warc.gz 122649 download   job
ablogfullofdemons.blogspot.com-inf-20200711-184230-dfv3d-meta.warc.os.cdx.gz 47 download
ablogfullofdemons.blogspot.com-inf-20200711-184230-dfv3d.json 255 download   job
againstthepagans.blogspot.com-inf-20200711-184230-99m4z-meta.warc.gz 124608 download   job
againstthepagans.blogspot.com-inf-20200711-184230-99m4z-meta.warc.os.cdx.gz 47 download
againstthepagans.blogspot.com-inf-20200711-184230-99m4z.json 254 download   job
ajmanzanedo.blogspot.com-inf-20200711-184232-4j23l-meta.warc.gz 179597 download   job
ajmanzanedo.blogspot.com-inf-20200711-184232-4j23l-meta.warc.os.cdx.gz 47 download
ajmanzanedo.blogspot.com-inf-20200711-184232-4j23l.json 249 download   job
alles-ist-zahl.blogspot.com-inf-20200711-184243-11927-00000.warc.gz 5373529644 download   job
alles-ist-zahl.blogspot.com-inf-20200711-184243-11927-00000.warc.os.cdx.gz 1254057 download
alles-ist-zahl.blogspot.com-inf-20200711-184243-11927-meta.warc.gz 826997 download   job
alles-ist-zahl.blogspot.com-inf-20200711-184243-11927-meta.warc.os.cdx.gz 47 download
animatekratocracy.blogspot.com-inf-20200711-184244-dqv8h-00000.warc.gz 120043628 download   job
animatekratocracy.blogspot.com-inf-20200711-184244-dqv8h-00000.warc.os.cdx.gz 157231 download
animatekratocracy.blogspot.com-inf-20200711-184244-dqv8h-meta.warc.gz 107142 download   job
animatekratocracy.blogspot.com-inf-20200711-184244-dqv8h-meta.warc.os.cdx.gz 47 download
animatekratocracy.blogspot.com-inf-20200711-184244-dqv8h.json 255 download   job
appendixm.blogspot.com-inf-20200711-184913-exmlr-00000.warc.gz 596129280 download   job
appendixm.blogspot.com-inf-20200711-184913-exmlr-00000.warc.os.cdx.gz 632537 download
appendixm.blogspot.com-inf-20200711-184913-exmlr-meta.warc.gz 425072 download   job
appendixm.blogspot.com-inf-20200711-184913-exmlr-meta.warc.os.cdx.gz 47 download
appendixm.blogspot.com-inf-20200711-184913-exmlr.json 247 download   job
archiveteam_archivebot_go_20200711210003.cdx.gz 95869357 download
archiveteam_archivebot_go_20200711210003.cdx.idx 87211 download
archiveteam_archivebot_go_20200711210003_files.xml 0 download
archiveteam_archivebot_go_20200711210003_meta.sqlite 1323008 download
archiveteam_archivebot_go_20200711210003_meta.xml 969 download
aswampinspace.blogspot.com-inf-20200711-184924-5we38-00000.warc.gz 773006395 download   job
aswampinspace.blogspot.com-inf-20200711-184924-5we38-00000.warc.os.cdx.gz 859976 download
aswampinspace.blogspot.com-inf-20200711-184924-5we38-meta.warc.gz 558434 download   job
aswampinspace.blogspot.com-inf-20200711-184924-5we38-meta.warc.os.cdx.gz 47 download
aswampinspace.blogspot.com-inf-20200711-184924-5we38.json 251 download   job
barbariansofprovo.blogspot.com-inf-20200711-184925-2sm63-00000.warc.gz 1893363622 download   job
barbariansofprovo.blogspot.com-inf-20200711-184925-2sm63-00000.warc.os.cdx.gz 356442 download
barbariansofprovo.blogspot.com-inf-20200711-184925-2sm63-meta.warc.gz 246386 download   job
barbariansofprovo.blogspot.com-inf-20200711-184925-2sm63-meta.warc.os.cdx.gz 47 download
barbariansofprovo.blogspot.com-inf-20200711-184925-2sm63.json 255 download   job
bravehalflingpublishing.blogspot.com-inf-20200711-184932-bz7mh-meta.warc.gz 68640 download   job
bravehalflingpublishing.blogspot.com-inf-20200711-184932-bz7mh-meta.warc.os.cdx.gz 47 download
cliqz.com-inf-20200501-194732-82yzf-00247.warc.gz 5378489283 download   job
cliqz.com-inf-20200501-194732-82yzf-00247.warc.os.cdx.gz 2536798 download
history/files/www.qiagen.com-inf-20200621-061202-1wax4-00023.warc.gz.~1~ 5368851996 download
jackstoolbox.wordpress.com-inf-20200711-171853-bc1xq.json 251 download   job
jaspersrantings.blogspot.com-inf-20200711-171912-7ko53-00000.warc.gz 932040162 download   job
jaspersrantings.blogspot.com-inf-20200711-171912-7ko53-00000.warc.os.cdx.gz 1130925 download
jaspersrantings.blogspot.com-inf-20200711-171912-7ko53-meta.warc.gz 714202 download   job
jaspersrantings.blogspot.com-inf-20200711-171912-7ko53-meta.warc.os.cdx.gz 47 download
jaspersrantings.wordpress.com-inf-20200711-171906-9nf4l-00000.warc.gz 2423738325 download   job
jaspersrantings.wordpress.com-inf-20200711-171906-9nf4l-00000.warc.os.cdx.gz 756491 download
jaspersrantings.wordpress.com-inf-20200711-171906-9nf4l-meta.warc.gz 513234 download   job
jaspersrantings.wordpress.com-inf-20200711-171906-9nf4l-meta.warc.os.cdx.gz 47 download
jaspersrantings.wordpress.com-inf-20200711-171906-9nf4l.json 254 download   job
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00107.warc.gz 6032279465 download   job
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00107.warc.os.cdx.gz 262361 download
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00108.warc.gz 5562910263 download   job
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00108.warc.os.cdx.gz 3565 download
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00109.warc.gz 5515052429 download   job
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00109.warc.os.cdx.gz 15415 download
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00110.warc.gz 5623382271 download   job
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00110.warc.os.cdx.gz 1302 download
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00111.warc.gz 5821912818 download   job
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00111.warc.os.cdx.gz 2465 download
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00112.warc.gz 5582966392 download   job
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00112.warc.os.cdx.gz 9172 download
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00114.warc.gz 5913364862 download   job
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00114.warc.os.cdx.gz 3261 download
news.cision.com-inf-20191109-005415-egdys-00423.warc.gz 5368863598 download   job
news.cision.com-inf-20191109-005415-egdys-00423.warc.os.cdx.gz 6377338 download
olctsd.wordpress.com-inf-20200711-172712-b6x8a-00000.warc.gz 310215705 download   job
olctsd.wordpress.com-inf-20200711-172712-b6x8a-00000.warc.os.cdx.gz 676780 download
olctsd.wordpress.com-inf-20200711-172712-b6x8a.json 245 download   job
player.fm-inf-20200501-233943-6recr-00683.warc.gz 5449299638 download   job
player.fm-inf-20200501-233943-6recr-00683.warc.os.cdx.gz 385817 download
qzlx.12371.cn-inf-20200711-143323-2u1o0-00000.warc.gz 1353366550 download   job
qzlx.12371.cn-inf-20200711-143323-2u1o0-00000.warc.os.cdx.gz 3227188 download
qzlx.12371.cn-inf-20200711-143323-2u1o0-meta.warc.gz 2007054 download   job
qzlx.12371.cn-inf-20200711-143323-2u1o0-meta.warc.os.cdx.gz 47 download
qzlx.12371.cn-inf-20200711-143323-2u1o0.json 242 download   job
redboxvancouver.wordpress.com-inf-20200711-172720-336cg-meta.warc.gz 664298 download   job
redboxvancouver.wordpress.com-inf-20200711-172720-336cg-meta.warc.os.cdx.gz 47 download
redboxvancouver.wordpress.com-inf-20200711-172720-336cg.json 254 download   job
seevaa.tistory.com-inf-20200711-054757-2ry21-00006.warc.gz 4552290 download   job
seevaa.tistory.com-inf-20200711-054757-2ry21-00006.warc.os.cdx.gz 17974 download
seevaa.tistory.com-inf-20200711-054757-2ry21-meta.warc.gz 2855532 download   job
seevaa.tistory.com-inf-20200711-054757-2ry21-meta.warc.os.cdx.gz 47 download
seevaa.tistory.com-inf-20200711-054757-2ry21.json 243 download   job
skarlocs.wordpress.com-inf-20200711-172727-7ktvu-meta.warc.gz 656016 download   job
skarlocs.wordpress.com-inf-20200711-172727-7ktvu-meta.warc.os.cdx.gz 47 download
skarlocs.wordpress.com-inf-20200711-172727-7ktvu.json 247 download   job
stepintorpgs.wordpress.com-inf-20200711-172737-9vwgc-00000.warc.gz 4453817591 download   job
stepintorpgs.wordpress.com-inf-20200711-172737-9vwgc-00000.warc.os.cdx.gz 1908353 download
stepintorpgs.wordpress.com-inf-20200711-172737-9vwgc-meta.warc.gz 1346087 download   job
stepintorpgs.wordpress.com-inf-20200711-172737-9vwgc-meta.warc.os.cdx.gz 47 download
stepintorpgs.wordpress.com-inf-20200711-172737-9vwgc.json 251 download   job
twistedcities.wordpress.com-inf-20200711-173546-4w2z8-00000.warc.gz 1577555832 download   job
twistedcities.wordpress.com-inf-20200711-173546-4w2z8-00000.warc.os.cdx.gz 847936 download
twistedcities.wordpress.com-inf-20200711-173546-4w2z8-meta.warc.gz 572905 download   job
twistedcities.wordpress.com-inf-20200711-173546-4w2z8-meta.warc.os.cdx.gz 47 download
twistedcities.wordpress.com-inf-20200711-173546-4w2z8.json 252 download   job
urls-archive.max.fan-jobs.txt-shallow-20200711-185911-78icp-00000.warc.gz 80689635 download   job
urls-archive.max.fan-jobs.txt-shallow-20200711-185911-78icp-00000.warc.os.cdx.gz 92990 download
urls-archive.max.fan-jobs.txt-shallow-20200711-185911-78icp-meta.warc.gz 49893 download   job
urls-archive.max.fan-jobs.txt-shallow-20200711-185911-78icp-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-jobs.txt-shallow-20200711-185911-78icp-urls.txt 101615 download
urls-archive.max.fan-jobs.txt-shallow-20200711-185911-78icp.json 286 download   job
urls-archive.max.fan-police.txt-shallow-20200711-185905-ahfjq-00000.warc.gz 5214571 download   job
urls-archive.max.fan-police.txt-shallow-20200711-185905-ahfjq-00000.warc.os.cdx.gz 13660 download
urls-archive.max.fan-police.txt-shallow-20200711-185905-ahfjq-meta.warc.gz 10623 download   job
urls-archive.max.fan-police.txt-shallow-20200711-185905-ahfjq-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-police.txt-shallow-20200711-185905-ahfjq-urls.txt 15685 download
urls-archive.max.fan-police.txt-shallow-20200711-185905-ahfjq.json 290 download   job
urls-archive.max.fan-twitter-@AbingtonPolice-filtered.txt-shallow-20200711-194040-cq9yz-00000.warc.gz 80961457 download   job
urls-archive.max.fan-twitter-@AbingtonPolice-filtered.txt-shallow-20200711-194040-cq9yz-00000.warc.os.cdx.gz 99164 download
urls-archive.max.fan-twitter-@AbingtonPolice-filtered.txt-shallow-20200711-194040-cq9yz-meta.warc.gz 57260 download   job
urls-archive.max.fan-twitter-@AbingtonPolice-filtered.txt-shallow-20200711-194040-cq9yz-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AbingtonPolice-filtered.txt-shallow-20200711-194040-cq9yz-urls.txt 32386 download
urls-archive.max.fan-twitter-@AbingtonPolice-filtered.txt-shallow-20200711-194040-cq9yz.json 343 download   job
urls-archive.max.fan-twitter-@AcushnetPolice-filtered.txt-shallow-20200711-193950-cuodn-00000.warc.gz 50203374 download   job
urls-archive.max.fan-twitter-@AcushnetPolice-filtered.txt-shallow-20200711-193950-cuodn-00000.warc.os.cdx.gz 61004 download
urls-archive.max.fan-twitter-@AcushnetPolice-filtered.txt-shallow-20200711-193950-cuodn-meta.warc.gz 36445 download   job
urls-archive.max.fan-twitter-@AcushnetPolice-filtered.txt-shallow-20200711-193950-cuodn-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AcushnetPolice-filtered.txt-shallow-20200711-193950-cuodn-urls.txt 49322 download
urls-archive.max.fan-twitter-@AcushnetPolice-filtered.txt-shallow-20200711-193950-cuodn.json 343 download   job
urls-archive.max.fan-twitter-@AgawamPD-filtered.txt-shallow-20200711-193950-9b7ig-00000.warc.gz 1564987 download   job
urls-archive.max.fan-twitter-@AgawamPD-filtered.txt-shallow-20200711-193950-9b7ig-00000.warc.os.cdx.gz 4939 download
urls-archive.max.fan-twitter-@AgawamPD-filtered.txt-shallow-20200711-193950-9b7ig-meta.warc.gz 6633 download   job
urls-archive.max.fan-twitter-@AgawamPD-filtered.txt-shallow-20200711-193950-9b7ig-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AgawamPD-filtered.txt-shallow-20200711-193950-9b7ig-urls.txt 553 download
urls-archive.max.fan-twitter-@AgawamPD-filtered.txt-shallow-20200711-193950-9b7ig.json 331 download   job
urls-archive.max.fan-twitter-@AllenstownPD-filtered.txt-shallow-20200711-193548-5irtw-00000.warc.gz 2393378 download   job
urls-archive.max.fan-twitter-@AllenstownPD-filtered.txt-shallow-20200711-193548-5irtw-00000.warc.os.cdx.gz 5769 download
urls-archive.max.fan-twitter-@AllenstownPD-filtered.txt-shallow-20200711-193548-5irtw-meta.warc.gz 7177 download   job
urls-archive.max.fan-twitter-@AllenstownPD-filtered.txt-shallow-20200711-193548-5irtw-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AllenstownPD-filtered.txt-shallow-20200711-193548-5irtw-urls.txt 1525 download
urls-archive.max.fan-twitter-@AllenstownPD-filtered.txt-shallow-20200711-193548-5irtw.json 339 download   job
urls-archive.max.fan-twitter-@AmesburyPD-filtered.txt-shallow-20200711-193543-1lpr0-00000.warc.gz 66237053 download   job
urls-archive.max.fan-twitter-@AmesburyPD-filtered.txt-shallow-20200711-193543-1lpr0-00000.warc.os.cdx.gz 97738 download
urls-archive.max.fan-twitter-@AmesburyPD-filtered.txt-shallow-20200711-193543-1lpr0-meta.warc.gz 56800 download   job
urls-archive.max.fan-twitter-@AmesburyPD-filtered.txt-shallow-20200711-193543-1lpr0-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AmesburyPD-filtered.txt-shallow-20200711-193543-1lpr0-urls.txt 27377 download
urls-archive.max.fan-twitter-@AmesburyPD-filtered.txt-shallow-20200711-193543-1lpr0.json 335 download   job
urls-archive.max.fan-twitter-@AmherstMApolice-filtered.txt-shallow-20200711-193543-8v1m3-00000.warc.gz 80137808 download   job
urls-archive.max.fan-twitter-@AmherstMApolice-filtered.txt-shallow-20200711-193543-8v1m3-00000.warc.os.cdx.gz 82610 download
urls-archive.max.fan-twitter-@AmherstMApolice-filtered.txt-shallow-20200711-193543-8v1m3-meta.warc.gz 48716 download   job
urls-archive.max.fan-twitter-@AmherstMApolice-filtered.txt-shallow-20200711-193543-8v1m3-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AmherstMApolice-filtered.txt-shallow-20200711-193543-8v1m3-urls.txt 38690 download
urls-archive.max.fan-twitter-@AmherstMApolice-filtered.txt-shallow-20200711-193543-8v1m3.json 345 download   job
urls-archive.max.fan-twitter-@AndoverDPW-filtered.txt-shallow-20200711-193134-32wb7-00000.warc.gz 174140552 download   job
urls-archive.max.fan-twitter-@AndoverDPW-filtered.txt-shallow-20200711-193134-32wb7-00000.warc.os.cdx.gz 169979 download
urls-archive.max.fan-twitter-@AndoverDPW-filtered.txt-shallow-20200711-193134-32wb7-meta.warc.gz 94855 download   job
urls-archive.max.fan-twitter-@AndoverDPW-filtered.txt-shallow-20200711-193134-32wb7-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AndoverDPW-filtered.txt-shallow-20200711-193134-32wb7-urls.txt 103476 download
urls-archive.max.fan-twitter-@AndoverDPW-filtered.txt-shallow-20200711-193134-32wb7.json 335 download   job
urls-archive.max.fan-twitter-@ArlingtonMAPD-filtered.txt-shallow-20200711-193134-9wod0-00000.warc.gz 672686247 download   job
urls-archive.max.fan-twitter-@ArlingtonMAPD-filtered.txt-shallow-20200711-193134-9wod0-00000.warc.os.cdx.gz 705321 download
urls-archive.max.fan-twitter-@ArlingtonMAPD-filtered.txt-shallow-20200711-193134-9wod0-urls.txt 256184 download
urls-archive.max.fan-twitter-@AshlandMAFire-filtered.txt-shallow-20200711-192909-6o0vu-00000.warc.gz 253802538 download   job
urls-archive.max.fan-twitter-@AshlandMAFire-filtered.txt-shallow-20200711-192909-6o0vu-00000.warc.os.cdx.gz 213251 download
urls-archive.max.fan-twitter-@AshlandMAFire-filtered.txt-shallow-20200711-192909-6o0vu-meta.warc.gz 116904 download   job
urls-archive.max.fan-twitter-@AshlandMAFire-filtered.txt-shallow-20200711-192909-6o0vu-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AshlandMAFire-filtered.txt-shallow-20200711-192909-6o0vu-urls.txt 68067 download
urls-archive.max.fan-twitter-@AshlandMAFire-filtered.txt-shallow-20200711-192909-6o0vu.json 341 download   job
urls-archive.max.fan-twitter-@AsstDeputyChief-filtered.txt-shallow-20200711-192818-aa42z-00000.warc.gz 4511857 download   job
urls-archive.max.fan-twitter-@AsstDeputyChief-filtered.txt-shallow-20200711-192818-aa42z-00000.warc.os.cdx.gz 12384 download
urls-archive.max.fan-twitter-@AsstDeputyChief-filtered.txt-shallow-20200711-192818-aa42z-meta.warc.gz 10759 download   job
urls-archive.max.fan-twitter-@AsstDeputyChief-filtered.txt-shallow-20200711-192818-aa42z-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AsstDeputyChief-filtered.txt-shallow-20200711-192818-aa42z-urls.txt 1323 download
urls-archive.max.fan-twitter-@AsstDeputyChief-filtered.txt-shallow-20200711-192818-aa42z.json 345 download   job
urls-archive.max.fan-twitter-@AuburnMAPolice-filtered.txt-shallow-20200711-192815-7r2ho-00000.warc.gz 407440379 download   job
urls-archive.max.fan-twitter-@AuburnMAPolice-filtered.txt-shallow-20200711-192815-7r2ho-00000.warc.os.cdx.gz 472206 download
urls-archive.max.fan-twitter-@AuburnMAPolice-filtered.txt-shallow-20200711-192815-7r2ho-meta.warc.gz 254515 download   job
urls-archive.max.fan-twitter-@AuburnMAPolice-filtered.txt-shallow-20200711-192815-7r2ho-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AuburnMAPolice-filtered.txt-shallow-20200711-192815-7r2ho-urls.txt 163242 download
urls-archive.max.fan-twitter-@AuburnMAPolice-filtered.txt-shallow-20200711-192815-7r2ho.json 343 download   job
urls-archive.max.fan-twitter-@BCPoliceDept-filtered.txt-shallow-20200711-192726-dtl98-00000.warc.gz 168414182 download   job
urls-archive.max.fan-twitter-@BCPoliceDept-filtered.txt-shallow-20200711-192726-dtl98-00000.warc.os.cdx.gz 220363 download
urls-archive.max.fan-twitter-@BCPoliceDept-filtered.txt-shallow-20200711-192726-dtl98-meta.warc.gz 122751 download   job
urls-archive.max.fan-twitter-@BCPoliceDept-filtered.txt-shallow-20200711-192726-dtl98-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BCPoliceDept-filtered.txt-shallow-20200711-192726-dtl98-urls.txt 85065 download
urls-archive.max.fan-twitter-@BCPoliceDept-filtered.txt-shallow-20200711-192726-dtl98.json 339 download   job
urls-archive.max.fan-twitter-@BOSTON_EMS-filtered.txt-shallow-20200711-191639-590u0-00000.warc.gz 437503176 download   job
urls-archive.max.fan-twitter-@BOSTON_EMS-filtered.txt-shallow-20200711-191639-590u0-00000.warc.os.cdx.gz 695830 download
urls-archive.max.fan-twitter-@BOSTON_EMS-filtered.txt-shallow-20200711-191639-590u0-meta.warc.gz 375816 download   job
urls-archive.max.fan-twitter-@BOSTON_EMS-filtered.txt-shallow-20200711-191639-590u0-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BOSTON_EMS-filtered.txt-shallow-20200711-191639-590u0-urls.txt 252519 download
urls-archive.max.fan-twitter-@BOSTON_EMS-filtered.txt-shallow-20200711-191639-590u0.json 335 download   job
urls-archive.max.fan-twitter-@BPDACO-filtered.txt-shallow-20200711-191051-b7f0d-00000.warc.gz 141467694 download   job
urls-archive.max.fan-twitter-@BPDACO-filtered.txt-shallow-20200711-191051-b7f0d-00000.warc.os.cdx.gz 120405 download
urls-archive.max.fan-twitter-@BPDACO-filtered.txt-shallow-20200711-191051-b7f0d-meta.warc.gz 65063 download   job
urls-archive.max.fan-twitter-@BPDACO-filtered.txt-shallow-20200711-191051-b7f0d-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BPDACO-filtered.txt-shallow-20200711-191051-b7f0d-urls.txt 95052 download
urls-archive.max.fan-twitter-@BPDACO-filtered.txt-shallow-20200711-191051-b7f0d.json 327 download   job
urls-archive.max.fan-twitter-@BPDK9Bushido-filtered.txt-shallow-20200711-190717-b1r7r-00000.warc.gz 35278853 download   job
urls-archive.max.fan-twitter-@BPDK9Bushido-filtered.txt-shallow-20200711-190717-b1r7r-00000.warc.os.cdx.gz 89277 download
urls-archive.max.fan-twitter-@BPDK9Bushido-filtered.txt-shallow-20200711-190717-b1r7r-meta.warc.gz 52877 download   job
urls-archive.max.fan-twitter-@BPDK9Bushido-filtered.txt-shallow-20200711-190717-b1r7r-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BPDK9Bushido-filtered.txt-shallow-20200711-190717-b1r7r-urls.txt 13465 download
urls-archive.max.fan-twitter-@BPDK9Bushido-filtered.txt-shallow-20200711-190717-b1r7r.json 339 download   job
urls-archive.max.fan-twitter-@BPDTraffic-filtered.txt-shallow-20200711-190458-a6383-00000.warc.gz 31538816 download   job
urls-archive.max.fan-twitter-@BPDTraffic-filtered.txt-shallow-20200711-190458-a6383-00000.warc.os.cdx.gz 47784 download
urls-archive.max.fan-twitter-@BPDTraffic-filtered.txt-shallow-20200711-190458-a6383-meta.warc.gz 30307 download   job
urls-archive.max.fan-twitter-@BPDTraffic-filtered.txt-shallow-20200711-190458-a6383-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BPDTraffic-filtered.txt-shallow-20200711-190458-a6383-urls.txt 12398 download
urls-archive.max.fan-twitter-@BPDTraffic-filtered.txt-shallow-20200711-190458-a6383.json 335 download   job
urls-archive.max.fan-twitter-@BabsonPolice-filtered.txt-shallow-20200711-192728-69h71-00000.warc.gz 2540 download   job
urls-archive.max.fan-twitter-@BabsonPolice-filtered.txt-shallow-20200711-192728-69h71-00000.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BabsonPolice-filtered.txt-shallow-20200711-192728-69h71-meta.warc.gz 3408 download   job
urls-archive.max.fan-twitter-@BabsonPolice-filtered.txt-shallow-20200711-192728-69h71-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BabsonPolice-filtered.txt-shallow-20200711-192728-69h71-urls.txt 0 download
urls-archive.max.fan-twitter-@BabsonPolice-filtered.txt-shallow-20200711-192728-69h71.json 339 download   job
urls-archive.max.fan-twitter-@BedfordNHPolice-filtered.txt-shallow-20200711-192404-81lcs-00000.warc.gz 177650191 download   job
urls-archive.max.fan-twitter-@BedfordNHPolice-filtered.txt-shallow-20200711-192404-81lcs-00000.warc.os.cdx.gz 164093 download
urls-archive.max.fan-twitter-@BedfordNHPolice-filtered.txt-shallow-20200711-192404-81lcs-meta.warc.gz 91185 download   job
urls-archive.max.fan-twitter-@BedfordNHPolice-filtered.txt-shallow-20200711-192404-81lcs-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BedfordNHPolice-filtered.txt-shallow-20200711-192404-81lcs-urls.txt 116042 download
urls-archive.max.fan-twitter-@BedfordNHPolice-filtered.txt-shallow-20200711-192404-81lcs.json 345 download   job
urls-archive.max.fan-twitter-@BelchertownPD-filtered.txt-shallow-20200711-192355-cbr1k-00000.warc.gz 3108140 download   job
urls-archive.max.fan-twitter-@BelchertownPD-filtered.txt-shallow-20200711-192355-cbr1k-00000.warc.os.cdx.gz 7930 download
urls-archive.max.fan-twitter-@BelchertownPD-filtered.txt-shallow-20200711-192355-cbr1k-meta.warc.gz 8422 download   job
urls-archive.max.fan-twitter-@BelchertownPD-filtered.txt-shallow-20200711-192355-cbr1k-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BelchertownPD-filtered.txt-shallow-20200711-192355-cbr1k-urls.txt 1320 download
urls-archive.max.fan-twitter-@BelchertownPD-filtered.txt-shallow-20200711-192355-cbr1k.json 341 download   job
urls-archive.max.fan-twitter-@BelmontPD-filtered.txt-shallow-20200711-192353-4vexe-00000.warc.gz 270820281 download   job
urls-archive.max.fan-twitter-@BelmontPD-filtered.txt-shallow-20200711-192353-4vexe-00000.warc.os.cdx.gz 300646 download
urls-archive.max.fan-twitter-@BelmontPD-filtered.txt-shallow-20200711-192353-4vexe-meta.warc.gz 165385 download   job
urls-archive.max.fan-twitter-@BelmontPD-filtered.txt-shallow-20200711-192353-4vexe-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BelmontPD-filtered.txt-shallow-20200711-192353-4vexe-urls.txt 127453 download
urls-archive.max.fan-twitter-@BelmontPD-filtered.txt-shallow-20200711-192353-4vexe.json 333 download   job
urls-archive.max.fan-twitter-@Bentley_Police-filtered.txt-shallow-20200711-192351-7sxcp-00000.warc.gz 71527407 download   job
urls-archive.max.fan-twitter-@Bentley_Police-filtered.txt-shallow-20200711-192351-7sxcp-00000.warc.os.cdx.gz 69518 download
urls-archive.max.fan-twitter-@Bentley_Police-filtered.txt-shallow-20200711-192351-7sxcp-meta.warc.gz 41550 download   job
urls-archive.max.fan-twitter-@Bentley_Police-filtered.txt-shallow-20200711-192351-7sxcp-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Bentley_Police-filtered.txt-shallow-20200711-192351-7sxcp-urls.txt 19582 download
urls-archive.max.fan-twitter-@Bentley_Police-filtered.txt-shallow-20200711-192351-7sxcp.json 343 download   job
urls-archive.max.fan-twitter-@BerlinMAPolice-filtered.txt-shallow-20200711-192108-95gbe-00000.warc.gz 30191301 download   job
urls-archive.max.fan-twitter-@BerlinMAPolice-filtered.txt-shallow-20200711-192108-95gbe-00000.warc.os.cdx.gz 40323 download
urls-archive.max.fan-twitter-@BerlinMAPolice-filtered.txt-shallow-20200711-192108-95gbe-meta.warc.gz 26111 download   job
urls-archive.max.fan-twitter-@BerlinMAPolice-filtered.txt-shallow-20200711-192108-95gbe-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BerlinMAPolice-filtered.txt-shallow-20200711-192108-95gbe-urls.txt 28445 download
urls-archive.max.fan-twitter-@BerlinMAPolice-filtered.txt-shallow-20200711-192108-95gbe.json 343 download   job
urls-archive.max.fan-twitter-@BeverlyPD-filtered.txt-shallow-20200711-192106-9lrfo-00000.warc.gz 36294730 download   job
urls-archive.max.fan-twitter-@BeverlyPD-filtered.txt-shallow-20200711-192106-9lrfo-00000.warc.os.cdx.gz 66926 download
urls-archive.max.fan-twitter-@BeverlyPD-filtered.txt-shallow-20200711-192106-9lrfo-meta.warc.gz 40962 download   job
urls-archive.max.fan-twitter-@BeverlyPD-filtered.txt-shallow-20200711-192106-9lrfo-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BeverlyPD-filtered.txt-shallow-20200711-192106-9lrfo-urls.txt 22724 download
urls-archive.max.fan-twitter-@BeverlyPD-filtered.txt-shallow-20200711-192106-9lrfo.json 333 download   job
urls-archive.max.fan-twitter-@BillericaEMS-filtered.txt-shallow-20200711-192105-2keio-00000.warc.gz 28499152 download   job
urls-archive.max.fan-twitter-@BillericaEMS-filtered.txt-shallow-20200711-192105-2keio-00000.warc.os.cdx.gz 43277 download
urls-archive.max.fan-twitter-@BillericaEMS-filtered.txt-shallow-20200711-192105-2keio-meta.warc.gz 27738 download   job
urls-archive.max.fan-twitter-@BillericaEMS-filtered.txt-shallow-20200711-192105-2keio-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BillericaEMS-filtered.txt-shallow-20200711-192105-2keio-urls.txt 10258 download
urls-archive.max.fan-twitter-@BillericaEMS-filtered.txt-shallow-20200711-192105-2keio.json 339 download   job
urls-archive.max.fan-twitter-@BillericaFD-filtered.txt-shallow-20200711-192011-aabn7-00000.warc.gz 134028962 download   job
urls-archive.max.fan-twitter-@BillericaFD-filtered.txt-shallow-20200711-192011-aabn7-00000.warc.os.cdx.gz 201674 download
urls-archive.max.fan-twitter-@BillericaFD-filtered.txt-shallow-20200711-192011-aabn7-meta.warc.gz 112775 download   job
urls-archive.max.fan-twitter-@BillericaFD-filtered.txt-shallow-20200711-192011-aabn7-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BillericaFD-filtered.txt-shallow-20200711-192011-aabn7-urls.txt 63868 download
urls-archive.max.fan-twitter-@BillericaFD-filtered.txt-shallow-20200711-192011-aabn7.json 337 download   job
urls-archive.max.fan-twitter-@BillericaPD-filtered.txt-shallow-20200711-192010-b0jag-urls.txt 1319192 download
urls-archive.max.fan-twitter-@BillericaPD-filtered.txt-shallow-20200711-192010-b0jag.json 337 download   job
urls-archive.max.fan-twitter-@BostonDotCom-filtered.txt-shallow-20200711-192010-8v7ex-00000.warc.gz 1274647 download   job
urls-archive.max.fan-twitter-@BostonDotCom-filtered.txt-shallow-20200711-192010-8v7ex-00000.warc.os.cdx.gz 4126 download
urls-archive.max.fan-twitter-@BostonDotCom-filtered.txt-shallow-20200711-192010-8v7ex-meta.warc.gz 6169 download   job
urls-archive.max.fan-twitter-@BostonDotCom-filtered.txt-shallow-20200711-192010-8v7ex-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BostonDotCom-filtered.txt-shallow-20200711-192010-8v7ex-urls.txt 51 download
urls-archive.max.fan-twitter-@BostonDotCom-filtered.txt-shallow-20200711-192010-8v7ex.json 339 download   job
urls-archive.max.fan-twitter-@Boston_PFD-filtered.txt-shallow-20200711-191637-ada3b-00000.warc.gz 58876693 download   job
urls-archive.max.fan-twitter-@Boston_PFD-filtered.txt-shallow-20200711-191637-ada3b-00000.warc.os.cdx.gz 75115 download
urls-archive.max.fan-twitter-@Boston_PFD-filtered.txt-shallow-20200711-191637-ada3b-meta.warc.gz 44667 download   job
urls-archive.max.fan-twitter-@Boston_PFD-filtered.txt-shallow-20200711-191637-ada3b-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Boston_PFD-filtered.txt-shallow-20200711-191637-ada3b-urls.txt 14122 download
urls-archive.max.fan-twitter-@Boston_PFD-filtered.txt-shallow-20200711-191637-ada3b.json 335 download   job
urls-archive.max.fan-twitter-@BournePD-filtered.txt-shallow-20200711-191116-1cn4l-00000.warc.gz 160573711 download   job
urls-archive.max.fan-twitter-@BournePD-filtered.txt-shallow-20200711-191116-1cn4l-00000.warc.os.cdx.gz 213499 download
urls-archive.max.fan-twitter-@BournePD-filtered.txt-shallow-20200711-191116-1cn4l-meta.warc.gz 118009 download   job
urls-archive.max.fan-twitter-@BournePD-filtered.txt-shallow-20200711-191116-1cn4l-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BournePD-filtered.txt-shallow-20200711-191116-1cn4l-urls.txt 131277 download
urls-archive.max.fan-twitter-@BournePD-filtered.txt-shallow-20200711-191116-1cn4l.json 331 download   job
urls-archive.max.fan-twitter-@BoxboroughPD-filtered.txt-shallow-20200711-191114-1pe34-00000.warc.gz 242077415 download   job
urls-archive.max.fan-twitter-@BoxboroughPD-filtered.txt-shallow-20200711-191114-1pe34-00000.warc.os.cdx.gz 257394 download
urls-archive.max.fan-twitter-@BoxboroughPD-filtered.txt-shallow-20200711-191114-1pe34-meta.warc.gz 141517 download   job
urls-archive.max.fan-twitter-@BoxboroughPD-filtered.txt-shallow-20200711-191114-1pe34-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BoxboroughPD-filtered.txt-shallow-20200711-191114-1pe34-urls.txt 110282 download
urls-archive.max.fan-twitter-@BoxboroughPD-filtered.txt-shallow-20200711-191114-1pe34.json 339 download   job
urls-archive.max.fan-twitter-@BrewsterMAPD-filtered.txt-shallow-20200711-190457-979uj-00000.warc.gz 18497902 download   job
urls-archive.max.fan-twitter-@BrewsterMAPD-filtered.txt-shallow-20200711-190457-979uj-00000.warc.os.cdx.gz 24512 download
urls-archive.max.fan-twitter-@BrewsterMAPD-filtered.txt-shallow-20200711-190457-979uj-meta.warc.gz 17598 download   job
urls-archive.max.fan-twitter-@BrewsterMAPD-filtered.txt-shallow-20200711-190457-979uj-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BrewsterMAPD-filtered.txt-shallow-20200711-190457-979uj-urls.txt 17324 download
urls-archive.max.fan-twitter-@BrewsterMAPD-filtered.txt-shallow-20200711-190457-979uj.json 339 download   job
urls-archive.max.fan-twitter-@BristolSheriff-filtered.txt-shallow-20200711-190457-6qnz0-00000.warc.gz 430357650 download   job
urls-archive.max.fan-twitter-@BristolSheriff-filtered.txt-shallow-20200711-190457-6qnz0-00000.warc.os.cdx.gz 338385 download
urls-archive.max.fan-twitter-@BristolSheriff-filtered.txt-shallow-20200711-190457-6qnz0-meta.warc.gz 181480 download   job
urls-archive.max.fan-twitter-@BristolSheriff-filtered.txt-shallow-20200711-190457-6qnz0-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BristolSheriff-filtered.txt-shallow-20200711-190457-6qnz0-urls.txt 122428 download
urls-archive.max.fan-twitter-@BristolSheriff-filtered.txt-shallow-20200711-190457-6qnz0.json 343 download   job
urls-archive.max.fan-twitter-@BrocktonPolice-filtered.txt-shallow-20200711-190309-2wbve-00000.warc.gz 134040688 download   job
urls-archive.max.fan-twitter-@BrocktonPolice-filtered.txt-shallow-20200711-190309-2wbve-00000.warc.os.cdx.gz 169826 download
urls-archive.max.fan-twitter-@BrocktonPolice-filtered.txt-shallow-20200711-190309-2wbve-meta.warc.gz 94422 download   job
urls-archive.max.fan-twitter-@BrocktonPolice-filtered.txt-shallow-20200711-190309-2wbve-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BrocktonPolice-filtered.txt-shallow-20200711-190309-2wbve-urls.txt 62663 download
urls-archive.max.fan-twitter-@BrocktonPolice-filtered.txt-shallow-20200711-190309-2wbve.json 343 download   job
urls-archive.max.fan-twitter-@BrooklineFD-filtered.txt-shallow-20200711-190308-bpwe4-00000.warc.gz 201335245 download   job
urls-archive.max.fan-twitter-@BrooklineFD-filtered.txt-shallow-20200711-190308-bpwe4-00000.warc.os.cdx.gz 233142 download
urls-archive.max.fan-twitter-@BrooklineFD-filtered.txt-shallow-20200711-190308-bpwe4-meta.warc.gz 129008 download   job
urls-archive.max.fan-twitter-@BrooklineFD-filtered.txt-shallow-20200711-190308-bpwe4-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BrooklineFD-filtered.txt-shallow-20200711-190308-bpwe4-urls.txt 74592 download
urls-archive.max.fan-twitter-@BrooklineFD-filtered.txt-shallow-20200711-190308-bpwe4.json 337 download   job
urls-archive.max.fan-twitter-@BrooklineMAPD-filtered.txt-shallow-20200711-190306-10fem-00000.warc.gz 645171553 download   job
urls-archive.max.fan-twitter-@BrooklineMAPD-filtered.txt-shallow-20200711-190306-10fem-00000.warc.os.cdx.gz 720216 download
urls-archive.max.fan-twitter-@BrooklineMAPD-filtered.txt-shallow-20200711-190306-10fem-meta.warc.gz 386497 download   job
urls-archive.max.fan-twitter-@BrooklineMAPD-filtered.txt-shallow-20200711-190306-10fem-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BrooklineMAPD-filtered.txt-shallow-20200711-190306-10fem-urls.txt 306246 download
urls-archive.max.fan-twitter-@BrooklineMAPD-filtered.txt-shallow-20200711-190306-10fem.json 341 download   job
urls-archive.max.fan-twitter-@BurlingtonMAFD-filtered.txt-shallow-20200711-190240-2rmrc-00000.warc.gz 295731050 download   job
urls-archive.max.fan-twitter-@BurlingtonMAFD-filtered.txt-shallow-20200711-190240-2rmrc-00000.warc.os.cdx.gz 260711 download
urls-archive.max.fan-twitter-@BurlingtonMAFD-filtered.txt-shallow-20200711-190240-2rmrc-meta.warc.gz 141072 download   job
urls-archive.max.fan-twitter-@BurlingtonMAFD-filtered.txt-shallow-20200711-190240-2rmrc-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BurlingtonMAFD-filtered.txt-shallow-20200711-190240-2rmrc-urls.txt 110467 download
urls-archive.max.fan-twitter-@BurlingtonMAFD-filtered.txt-shallow-20200711-190240-2rmrc.json 343 download   job
urls-archive.max.fan-twitter-@BurlingtonPDK9-filtered.txt-shallow-20200711-184909-86gqu-00000.warc.gz 185029938 download   job
urls-archive.max.fan-twitter-@BurlingtonPDK9-filtered.txt-shallow-20200711-184909-86gqu-00000.warc.os.cdx.gz 277564 download
urls-archive.max.fan-twitter-@BurlingtonPDK9-filtered.txt-shallow-20200711-184909-86gqu-meta.warc.gz 152602 download   job
urls-archive.max.fan-twitter-@BurlingtonPDK9-filtered.txt-shallow-20200711-184909-86gqu-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BurlingtonPDK9-filtered.txt-shallow-20200711-184909-86gqu-urls.txt 100625 download
urls-archive.max.fan-twitter-@BurlingtonPDK9-filtered.txt-shallow-20200711-184909-86gqu.json 343 download   job
urls-archive.max.fan-twitter-@CT_STATE_POLICE-filtered.txt-shallow-20200711-183338-9llr0-00000.warc.gz 1058967505 download   job
urls-archive.max.fan-twitter-@CT_STATE_POLICE-filtered.txt-shallow-20200711-183338-9llr0-00000.warc.os.cdx.gz 1335305 download
urls-archive.max.fan-twitter-@CT_STATE_POLICE-filtered.txt-shallow-20200711-183338-9llr0-meta.warc.gz 714714 download   job
urls-archive.max.fan-twitter-@CT_STATE_POLICE-filtered.txt-shallow-20200711-183338-9llr0-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CT_STATE_POLICE-filtered.txt-shallow-20200711-183338-9llr0-urls.txt 373619 download
urls-archive.max.fan-twitter-@CT_STATE_POLICE-filtered.txt-shallow-20200711-183338-9llr0.json 345 download   job
urls-archive.max.fan-twitter-@CarlisleMAPD-filtered.txt-shallow-20200711-184837-eapaa-00000.warc.gz 2844787 download   job
urls-archive.max.fan-twitter-@CarlisleMAPD-filtered.txt-shallow-20200711-184837-eapaa-00000.warc.os.cdx.gz 5864 download
urls-archive.max.fan-twitter-@CarlisleMAPD-filtered.txt-shallow-20200711-184837-eapaa-meta.warc.gz 7234 download   job
urls-archive.max.fan-twitter-@CarlisleMAPD-filtered.txt-shallow-20200711-184837-eapaa-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CarlisleMAPD-filtered.txt-shallow-20200711-184837-eapaa-urls.txt 1299 download
urls-archive.max.fan-twitter-@CarlisleMAPD-filtered.txt-shallow-20200711-184837-eapaa.json 339 download   job
urls-archive.max.fan-twitter-@CarverPolice-filtered.txt-shallow-20200711-184655-3udi8-00000.warc.gz 53272941 download   job
urls-archive.max.fan-twitter-@CarverPolice-filtered.txt-shallow-20200711-184655-3udi8-00000.warc.os.cdx.gz 72246 download
urls-archive.max.fan-twitter-@CarverPolice-filtered.txt-shallow-20200711-184655-3udi8-meta.warc.gz 42560 download   job
urls-archive.max.fan-twitter-@CarverPolice-filtered.txt-shallow-20200711-184655-3udi8-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CarverPolice-filtered.txt-shallow-20200711-184655-3udi8.json 339 download   job
urls-archive.max.fan-twitter-@ChelmsfordPD-filtered.txt-shallow-20200711-184654-2ww4y-00000.warc.gz 106721159 download   job
urls-archive.max.fan-twitter-@ChelmsfordPD-filtered.txt-shallow-20200711-184654-2ww4y-00000.warc.os.cdx.gz 187509 download
urls-archive.max.fan-twitter-@ChelmsfordPD-filtered.txt-shallow-20200711-184654-2ww4y-urls.txt 83748 download
urls-archive.max.fan-twitter-@ChiefBartlett-filtered.txt-shallow-20200711-184608-d1wqa-00000.warc.gz 61706931 download   job
urls-archive.max.fan-twitter-@ChiefBartlett-filtered.txt-shallow-20200711-184608-d1wqa-00000.warc.os.cdx.gz 76539 download
urls-archive.max.fan-twitter-@ChiefBartlett-filtered.txt-shallow-20200711-184608-d1wqa-meta.warc.gz 45581 download   job
urls-archive.max.fan-twitter-@ChiefBartlett-filtered.txt-shallow-20200711-184608-d1wqa-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ChiefBartlett-filtered.txt-shallow-20200711-184608-d1wqa-urls.txt 21236 download
urls-archive.max.fan-twitter-@ChiefBartlett-filtered.txt-shallow-20200711-184608-d1wqa.json 341 download   job
urls-archive.max.fan-twitter-@ChiefCantu111-filtered.txt-shallow-20200711-184515-ch2h9-meta.warc.gz 119197 download   job
urls-archive.max.fan-twitter-@ChiefCantu111-filtered.txt-shallow-20200711-184515-ch2h9-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ChiefCantu111-filtered.txt-shallow-20200711-184515-ch2h9-urls.txt 50646 download
urls-archive.max.fan-twitter-@ChiefCantu111-filtered.txt-shallow-20200711-184515-ch2h9.json 341 download   job
urls-archive.max.fan-twitter-@ChiefGalea-filtered.txt-shallow-20200711-184453-55xv3-00000.warc.gz 11732139 download   job
urls-archive.max.fan-twitter-@ChiefGalea-filtered.txt-shallow-20200711-184453-55xv3-00000.warc.os.cdx.gz 18443 download
urls-archive.max.fan-twitter-@ChiefJarrell-filtered.txt-shallow-20200711-184249-bmt3k-00000.warc.gz 27988560 download   job
urls-archive.max.fan-twitter-@ChiefJarrell-filtered.txt-shallow-20200711-184249-bmt3k-00000.warc.os.cdx.gz 27358 download
urls-archive.max.fan-twitter-@ChiefJarrell-filtered.txt-shallow-20200711-184249-bmt3k-meta.warc.gz 18872 download   job
urls-archive.max.fan-twitter-@ChiefJarrell-filtered.txt-shallow-20200711-184249-bmt3k-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ChiefJarrell-filtered.txt-shallow-20200711-184249-bmt3k-urls.txt 8220 download
urls-archive.max.fan-twitter-@ChiefJarrell-filtered.txt-shallow-20200711-184249-bmt3k.json 339 download   job
urls-archive.max.fan-twitter-@ChiefMcNeilSPD-filtered.txt-shallow-20200711-184104-ac0fw-00000.warc.gz 89977551 download   job
urls-archive.max.fan-twitter-@ChiefMcNeilSPD-filtered.txt-shallow-20200711-184104-ac0fw-00000.warc.os.cdx.gz 102801 download
urls-archive.max.fan-twitter-@ChiefMcNeilSPD-filtered.txt-shallow-20200711-184104-ac0fw.json 343 download   job
urls-archive.max.fan-twitter-@ChiefMcgrath-filtered.txt-shallow-20200711-184104-b4vr1-00000.warc.gz 43281562 download   job
urls-archive.max.fan-twitter-@ChiefMcgrath-filtered.txt-shallow-20200711-184104-b4vr1-00000.warc.os.cdx.gz 59551 download
urls-archive.max.fan-twitter-@ChiefMcgrath-filtered.txt-shallow-20200711-184104-b4vr1-meta.warc.gz 36396 download   job
urls-archive.max.fan-twitter-@ChiefMcgrath-filtered.txt-shallow-20200711-184104-b4vr1-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ChiefMcgrath-filtered.txt-shallow-20200711-184104-b4vr1-urls.txt 10611 download
urls-archive.max.fan-twitter-@ChiefMcgrath-filtered.txt-shallow-20200711-184104-b4vr1.json 339 download   job
urls-archive.max.fan-twitter-@ChiefMillerCPD-filtered.txt-shallow-20200711-183925-adr1j-00000.warc.gz 136865904 download   job
urls-archive.max.fan-twitter-@ChiefMillerCPD-filtered.txt-shallow-20200711-183925-adr1j-00000.warc.os.cdx.gz 153516 download
urls-archive.max.fan-twitter-@ChiefMillerCPD-filtered.txt-shallow-20200711-183925-adr1j-urls.txt 40609 download
urls-archive.max.fan-twitter-@ChiefMillerCPD-filtered.txt-shallow-20200711-183925-adr1j.json 343 download   job
urls-archive.max.fan-twitter-@ChiefShughes-filtered.txt-shallow-20200711-183646-dwmc9-00000.warc.gz 210239690 download   job
urls-archive.max.fan-twitter-@ChiefShughes-filtered.txt-shallow-20200711-183646-dwmc9-00000.warc.os.cdx.gz 242568 download
urls-archive.max.fan-twitter-@ChiefShughes-filtered.txt-shallow-20200711-183646-dwmc9-urls.txt 54660 download
urls-archive.max.fan-twitter-@ChiefShughes-filtered.txt-shallow-20200711-183646-dwmc9.json 339 download   job
urls-archive.max.fan-twitter-@ChiefStevenson1-filtered.txt-shallow-20200711-183519-7txud-00000.warc.gz 9297853 download   job
urls-archive.max.fan-twitter-@ChiefStevenson1-filtered.txt-shallow-20200711-183519-7txud-00000.warc.os.cdx.gz 20239 download
urls-archive.max.fan-twitter-@ChiefStevenson1-filtered.txt-shallow-20200711-183519-7txud-meta.warc.gz 15294 download   job
urls-archive.max.fan-twitter-@ChiefStevenson1-filtered.txt-shallow-20200711-183519-7txud-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ChiefStevenson1-filtered.txt-shallow-20200711-183519-7txud-urls.txt 2835 download
urls-archive.max.fan-twitter-@ChiefStevenson1-filtered.txt-shallow-20200711-183519-7txud.json 345 download   job
urls-archive.max.fan-twitter-@Chief_Navarro-filtered.txt-shallow-20200711-183831-955d4-00000.warc.gz 48036305 download   job
urls-archive.max.fan-twitter-@Chief_Navarro-filtered.txt-shallow-20200711-183831-955d4-00000.warc.os.cdx.gz 40150 download
urls-archive.max.fan-twitter-@Chief_Navarro-filtered.txt-shallow-20200711-183831-955d4-meta.warc.gz 25542 download   job
urls-archive.max.fan-twitter-@Chief_Navarro-filtered.txt-shallow-20200711-183831-955d4-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Chief_Navarro-filtered.txt-shallow-20200711-183831-955d4-urls.txt 10980 download
urls-archive.max.fan-twitter-@Chiefsmith201-filtered.txt-shallow-20200711-183642-3pj60-00000.warc.gz 60126555 download   job
urls-archive.max.fan-twitter-@Chiefsmith201-filtered.txt-shallow-20200711-183642-3pj60-00000.warc.os.cdx.gz 73399 download
urls-archive.max.fan-twitter-@Chiefsmith201-filtered.txt-shallow-20200711-183642-3pj60-meta.warc.gz 43491 download   job
urls-archive.max.fan-twitter-@Chiefsmith201-filtered.txt-shallow-20200711-183642-3pj60-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Chiefsmith201-filtered.txt-shallow-20200711-183642-3pj60-urls.txt 25804 download
urls-archive.max.fan-twitter-@Chiefsmith201-filtered.txt-shallow-20200711-183642-3pj60.json 341 download   job
urls-archive.max.fan-twitter-@CityofLowellMA-filtered.txt-shallow-20200711-183347-1pvr4-meta.warc.gz 107203 download   job
urls-archive.max.fan-twitter-@CityofLowellMA-filtered.txt-shallow-20200711-183347-1pvr4-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CityofLowellMA-filtered.txt-shallow-20200711-183347-1pvr4-urls.txt 83532 download
urls-archive.max.fan-twitter-@CohassetPolice-filtered.txt-shallow-20200711-183341-d29sk-urls.txt 60025 download
urls-archive.max.fan-twitter-@DanversPolice-filtered.txt-shallow-20200711-183149-t2wj9-00000.warc.gz 271022009 download   job
urls-archive.max.fan-twitter-@DanversPolice-filtered.txt-shallow-20200711-183149-t2wj9-00000.warc.os.cdx.gz 347153 download
urls-archive.max.fan-twitter-@DanversPolice-filtered.txt-shallow-20200711-183149-t2wj9-urls.txt 148572 download
urls-archive.max.fan-twitter-@DedhamPD-filtered.txt-shallow-20200711-183149-5r1nz-meta.warc.gz 138879 download   job
urls-archive.max.fan-twitter-@DedhamPD-filtered.txt-shallow-20200711-183149-5r1nz-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DedhamPD-filtered.txt-shallow-20200711-183149-5r1nz-urls.txt 76626 download
urls-archive.max.fan-twitter-@DedhamPD-filtered.txt-shallow-20200711-183149-5r1nz.json 331 download   job
urls-archive.max.fan-twitter-@DeerfieldMAPD-filtered.txt-shallow-20200711-183148-agg5q-meta.warc.gz 32523 download   job
urls-archive.max.fan-twitter-@DeerfieldMAPD-filtered.txt-shallow-20200711-183148-agg5q-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DeerfieldMAPD-filtered.txt-shallow-20200711-183148-agg5q.json 341 download   job
urls-archive.max.fan-twitter-@Douglasmapd-filtered.txt-shallow-20200711-183148-n1zz0-00000.warc.gz 78759114 download   job
urls-archive.max.fan-twitter-@Douglasmapd-filtered.txt-shallow-20200711-183148-n1zz0-00000.warc.os.cdx.gz 97103 download
urls-archive.max.fan-twitter-@Douglasmapd-filtered.txt-shallow-20200711-183148-n1zz0-meta.warc.gz 55725 download   job
urls-archive.max.fan-twitter-@Douglasmapd-filtered.txt-shallow-20200711-183148-n1zz0-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Douglasmapd-filtered.txt-shallow-20200711-183148-n1zz0-urls.txt 56075 download
urls-archive.max.fan-twitter-@Douglasmapd-filtered.txt-shallow-20200711-183148-n1zz0.json 337 download   job
urls-archive.max.fan-twitter-@DoverMAChief-filtered.txt-shallow-20200711-183148-555re-00000.warc.gz 112763243 download   job
urls-archive.max.fan-twitter-@DoverMAChief-filtered.txt-shallow-20200711-183148-555re-00000.warc.os.cdx.gz 121759 download
urls-archive.max.fan-twitter-@DoverMAChief-filtered.txt-shallow-20200711-183148-555re-meta.warc.gz 69456 download   job
urls-archive.max.fan-twitter-@DoverMAChief-filtered.txt-shallow-20200711-183148-555re-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DoverMAChief-filtered.txt-shallow-20200711-183148-555re.json 339 download   job
urls-archive.max.fan-twitter-@DoverNHPolice-filtered.txt-shallow-20200711-183028-dmf4q-meta.warc.gz 46508 download   job
urls-archive.max.fan-twitter-@DoverNHPolice-filtered.txt-shallow-20200711-183028-dmf4q-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DoverNHPolice-filtered.txt-shallow-20200711-183028-dmf4q-urls.txt 34599 download
urls-archive.max.fan-twitter-@DoverNHPolice-filtered.txt-shallow-20200711-183028-dmf4q.json 341 download   job
urls-archive.max.fan-twitter-@DracutMAPD-filtered.txt-shallow-20200711-183027-8u86f-00000.warc.gz 18722194 download   job
urls-archive.max.fan-twitter-@DracutMAPD-filtered.txt-shallow-20200711-183027-8u86f-00000.warc.os.cdx.gz 35986 download
urls-archive.max.fan-twitter-@DracutMAPD-filtered.txt-shallow-20200711-183027-8u86f-meta.warc.gz 23720 download   job
urls-archive.max.fan-twitter-@DracutMAPD-filtered.txt-shallow-20200711-183027-8u86f-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DracutMAPD-filtered.txt-shallow-20200711-183027-8u86f-urls.txt 8236 download
urls-archive.max.fan-twitter-@DracutMAPD-filtered.txt-shallow-20200711-183027-8u86f.json 335 download   job
urls-archive.max.fan-twitter-@DunstablePD-filtered.txt-shallow-20200711-183025-bb196-00000.warc.gz 12345949 download   job
urls-archive.max.fan-twitter-@DunstablePD-filtered.txt-shallow-20200711-183025-bb196-00000.warc.os.cdx.gz 19601 download
urls-archive.max.fan-twitter-@DunstablePD-filtered.txt-shallow-20200711-183025-bb196-meta.warc.gz 14942 download   job
urls-archive.max.fan-twitter-@DunstablePD-filtered.txt-shallow-20200711-183025-bb196-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@DunstablePD-filtered.txt-shallow-20200711-183025-bb196-urls.txt 10150 download
urls-archive.max.fan-twitter-@DunstablePD-filtered.txt-shallow-20200711-183025-bb196.json 337 download   job
urls-archive.max.fan-twitter-@Duxbury_Police-filtered.txt-shallow-20200711-183024-77dec-00000.warc.gz 637466410 download   job
urls-archive.max.fan-twitter-@Duxbury_Police-filtered.txt-shallow-20200711-183024-77dec-00000.warc.os.cdx.gz 583184 download
urls-archive.max.fan-twitter-@Duxbury_Police-filtered.txt-shallow-20200711-183024-77dec-meta.warc.gz 311479 download   job
urls-archive.max.fan-twitter-@Duxbury_Police-filtered.txt-shallow-20200711-183024-77dec-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Duxbury_Police-filtered.txt-shallow-20200711-183024-77dec-urls.txt 198895 download
urls-archive.max.fan-twitter-@Duxbury_Police-filtered.txt-shallow-20200711-183024-77dec.json 343 download   job
urls-archive.max.fan-twitter-@EBPolice-filtered.txt-shallow-20200711-182815-6cvjb-00000.warc.gz 142322753 download   job
urls-archive.max.fan-twitter-@EBPolice-filtered.txt-shallow-20200711-182815-6cvjb-00000.warc.os.cdx.gz 156113 download
urls-archive.max.fan-twitter-@EBPolice-filtered.txt-shallow-20200711-182815-6cvjb-meta.warc.gz 87677 download   job
urls-archive.max.fan-twitter-@EBPolice-filtered.txt-shallow-20200711-182815-6cvjb-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@EBPolice-filtered.txt-shallow-20200711-182815-6cvjb-urls.txt 94759 download
urls-archive.max.fan-twitter-@EastonMapd-filtered.txt-shallow-20200711-182907-ehreb-00000.warc.gz 17698594 download   job
urls-archive.max.fan-twitter-@EastonMapd-filtered.txt-shallow-20200711-182907-ehreb-00000.warc.os.cdx.gz 28171 download
urls-archive.max.fan-twitter-@EastonMapd-filtered.txt-shallow-20200711-182907-ehreb-meta.warc.gz 19843 download   job
urls-archive.max.fan-twitter-@EastonMapd-filtered.txt-shallow-20200711-182907-ehreb-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@EastonMapd-filtered.txt-shallow-20200711-182907-ehreb-urls.txt 8543 download
urls-archive.max.fan-twitter-@Essex_PD-filtered.txt-shallow-20200711-182812-153d0-00000.warc.gz 19066970 download   job
urls-archive.max.fan-twitter-@Essex_PD-filtered.txt-shallow-20200711-182812-153d0-00000.warc.os.cdx.gz 30625 download
urls-archive.max.fan-twitter-@FallRiverPD-filtered.txt-shallow-20200711-182634-93u43-urls.txt 21246 download
urls-archive.max.fan-twitter-@Fisher_Police-filtered.txt-shallow-20200711-182412-5mgg2-00000.warc.gz 25607226 download   job
urls-archive.max.fan-twitter-@Fisher_Police-filtered.txt-shallow-20200711-182412-5mgg2-00000.warc.os.cdx.gz 31606 download
urls-archive.max.fan-twitter-@Fisher_Police-filtered.txt-shallow-20200711-182412-5mgg2-meta.warc.gz 21501 download   job
urls-archive.max.fan-twitter-@Fisher_Police-filtered.txt-shallow-20200711-182412-5mgg2-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Fisher_Police-filtered.txt-shallow-20200711-182412-5mgg2-urls.txt 14597 download
urls-archive.max.fan-twitter-@Fisher_Police-filtered.txt-shallow-20200711-182412-5mgg2.json 341 download   job
urls-archive.max.fan-twitter-@FitchburgPolice-filtered.txt-shallow-20200711-182412-4ev11-meta.warc.gz 75383 download   job
urls-archive.max.fan-twitter-@FitchburgPolice-filtered.txt-shallow-20200711-182412-4ev11-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@FitchburgPolice-filtered.txt-shallow-20200711-182412-4ev11-urls.txt 97210 download
urls-archive.max.fan-twitter-@FitchburgPolice-filtered.txt-shallow-20200711-182412-4ev11.json 345 download   job
urls-archive.max.fan-twitter-@FoxboroughPD-filtered.txt-shallow-20200711-182412-3dsyj-00000.warc.gz 41827540 download   job
urls-archive.max.fan-twitter-@FoxboroughPD-filtered.txt-shallow-20200711-182412-3dsyj-00000.warc.os.cdx.gz 71472 download
urls-archive.max.fan-twitter-@FoxboroughPD-filtered.txt-shallow-20200711-182412-3dsyj-meta.warc.gz 43046 download   job
urls-archive.max.fan-twitter-@FoxboroughPD-filtered.txt-shallow-20200711-182412-3dsyj-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@FoxboroughPD-filtered.txt-shallow-20200711-182412-3dsyj.json 339 download   job
urls-archive.max.fan-twitter-@FraminghamFire-filtered.txt-shallow-20200711-181850-dw4a1-00000.warc.gz 295987432 download   job
urls-archive.max.fan-twitter-@FraminghamFire-filtered.txt-shallow-20200711-181850-dw4a1-00000.warc.os.cdx.gz 186348 download
urls-archive.max.fan-twitter-@FraminghamFire-filtered.txt-shallow-20200711-181850-dw4a1-meta.warc.gz 102672 download   job
urls-archive.max.fan-twitter-@FraminghamFire-filtered.txt-shallow-20200711-181850-dw4a1-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@FraminghamFire-filtered.txt-shallow-20200711-181850-dw4a1.json 343 download   job
urls-archive.max.fan-twitter-@FraminghamPD-filtered.txt-shallow-20200711-181848-df29w-meta.warc.gz 212380 download   job
urls-archive.max.fan-twitter-@FraminghamPD-filtered.txt-shallow-20200711-181848-df29w-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@GTownSheriff-filtered.txt-shallow-20200711-181825-6be0a-00000.warc.gz 17040914 download   job
urls-archive.max.fan-twitter-@GTownSheriff-filtered.txt-shallow-20200711-181825-6be0a-00000.warc.os.cdx.gz 25373 download
urls-archive.max.fan-twitter-@GTownSheriff-filtered.txt-shallow-20200711-181825-6be0a-meta.warc.gz 17804 download   job
urls-archive.max.fan-twitter-@GTownSheriff-filtered.txt-shallow-20200711-181825-6be0a-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@GTownSheriff-filtered.txt-shallow-20200711-181825-6be0a-urls.txt 5399 download
urls-archive.max.fan-twitter-@GTownSheriff-filtered.txt-shallow-20200711-181825-6be0a.json 339 download   job
urls-archive.max.fan-twitter-@GardnerMassPD-filtered.txt-shallow-20200711-181841-eym54-00000.warc.gz 6696781 download   job
urls-archive.max.fan-twitter-@GardnerMassPD-filtered.txt-shallow-20200711-181841-eym54-00000.warc.os.cdx.gz 15587 download
urls-archive.max.fan-twitter-@GardnerMassPD-filtered.txt-shallow-20200711-181841-eym54-meta.warc.gz 12670 download   job
urls-archive.max.fan-twitter-@GardnerMassPD-filtered.txt-shallow-20200711-181841-eym54-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@GardnerMassPD-filtered.txt-shallow-20200711-181841-eym54.json 341 download   job
urls-archive.max.fan-twitter-@GloucesterPD-filtered.txt-shallow-20200711-181836-852ej-00000.warc.gz 56729479 download   job
urls-archive.max.fan-twitter-@GloucesterPD-filtered.txt-shallow-20200711-181836-852ej-00000.warc.os.cdx.gz 101891 download
urls-archive.max.fan-twitter-@GloucesterPD-filtered.txt-shallow-20200711-181836-852ej-meta.warc.gz 58628 download   job
urls-archive.max.fan-twitter-@GloucesterPD-filtered.txt-shallow-20200711-181836-852ej-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@GloucesterPD-filtered.txt-shallow-20200711-181836-852ej-urls.txt 46439 download
urls-archive.max.fan-twitter-@GraftonPolice-filtered.txt-shallow-20200711-181832-2z650-meta.warc.gz 101352 download   job
urls-archive.max.fan-twitter-@GraftonPolice-filtered.txt-shallow-20200711-181832-2z650-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@GraftonPolice-filtered.txt-shallow-20200711-181832-2z650-urls.txt 76619 download
urls-archive.max.fan-twitter-@GraftonPolice-filtered.txt-shallow-20200711-181832-2z650.json 341 download   job
urls-archive.max.fan-twitter-@GreenfieldPD-filtered.txt-shallow-20200711-181825-cvvlc-meta.warc.gz 130392 download   job
urls-archive.max.fan-twitter-@GreenfieldPD-filtered.txt-shallow-20200711-181825-cvvlc-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@GreenfieldPD-filtered.txt-shallow-20200711-181825-cvvlc-urls.txt 218143 download
urls-archive.max.fan-twitter-@GreenfieldPD-filtered.txt-shallow-20200711-181825-cvvlc.json 339 download   job
urls-archive.max.fan-twitter-@GrovelandPolice-filtered.txt-shallow-20200711-181829-9fd9h-meta.warc.gz 27766 download   job
urls-archive.max.fan-twitter-@GrovelandPolice-filtered.txt-shallow-20200711-181829-9fd9h-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@GrovelandPolice-filtered.txt-shallow-20200711-181829-9fd9h-urls.txt 15162 download
urls-archive.max.fan-twitter-@GrovelandPolice-filtered.txt-shallow-20200711-181829-9fd9h.json 345 download   job
urls-archive.max.fan-twitter-@HPDMASSCOP-filtered.txt-shallow-20200711-181233-802iq-00000.warc.gz 2586979 download   job
urls-archive.max.fan-twitter-@HPDMASSCOP-filtered.txt-shallow-20200711-181233-802iq-00000.warc.os.cdx.gz 6097 download
urls-archive.max.fan-twitter-@HPDMASSCOP-filtered.txt-shallow-20200711-181233-802iq-meta.warc.gz 7343 download   job
urls-archive.max.fan-twitter-@HPDMASSCOP-filtered.txt-shallow-20200711-181233-802iq-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@HPDMASSCOP-filtered.txt-shallow-20200711-181233-802iq-urls.txt 969 download
urls-archive.max.fan-twitter-@HPDMASSCOP-filtered.txt-shallow-20200711-181233-802iq.json 335 download   job
urls-archive.max.fan-twitter-@HamptonNHPD-filtered.txt-shallow-20200711-181821-ckvst-00000.warc.gz 107615803 download   job
urls-archive.max.fan-twitter-@HamptonNHPD-filtered.txt-shallow-20200711-181821-ckvst-00000.warc.os.cdx.gz 137902 download
urls-archive.max.fan-twitter-@HamptonNHPD-filtered.txt-shallow-20200711-181821-ckvst-meta.warc.gz 78192 download   job
urls-archive.max.fan-twitter-@HamptonNHPD-filtered.txt-shallow-20200711-181821-ckvst-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@HanoverPolice-filtered.txt-shallow-20200711-181729-79ov8-00000.warc.gz 418382610 download   job
urls-archive.max.fan-twitter-@HanoverPolice-filtered.txt-shallow-20200711-181729-79ov8-00000.warc.os.cdx.gz 423001 download
urls-archive.max.fan-twitter-@HanoverPolice-filtered.txt-shallow-20200711-181729-79ov8-meta.warc.gz 227896 download   job
urls-archive.max.fan-twitter-@HanoverPolice-filtered.txt-shallow-20200711-181729-79ov8-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@HanoverPolice-filtered.txt-shallow-20200711-181729-79ov8.json 341 download   job
urls-archive.max.fan-twitter-@HansonMAPolice-filtered.txt-shallow-20200711-181728-cruo5-00000.warc.gz 101649917 download   job
urls-archive.max.fan-twitter-@HansonMAPolice-filtered.txt-shallow-20200711-181728-cruo5-00000.warc.os.cdx.gz 126515 download
urls-archive.max.fan-twitter-@HansonMAPolice-filtered.txt-shallow-20200711-181728-cruo5-meta.warc.gz 72105 download   job
urls-archive.max.fan-twitter-@HansonMAPolice-filtered.txt-shallow-20200711-181728-cruo5-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@HansonMAPolice-filtered.txt-shallow-20200711-181728-cruo5-urls.txt 78225 download
urls-archive.max.fan-twitter-@HaverhillPolice-filtered.txt-shallow-20200711-181607-ddh2z-00000.warc.gz 78940491 download   job
urls-archive.max.fan-twitter-@HaverhillPolice-filtered.txt-shallow-20200711-181607-ddh2z-00000.warc.os.cdx.gz 136364 download
urls-archive.max.fan-twitter-@HaverhillPolice-filtered.txt-shallow-20200711-181607-ddh2z-urls.txt 71261 download
urls-archive.max.fan-twitter-@HaverhillPolice-filtered.txt-shallow-20200711-181607-ddh2z.json 345 download   job
urls-archive.max.fan-twitter-@HolbrookPolice-filtered.txt-shallow-20200711-181537-1vsqp-00000.warc.gz 2233343 download   job
urls-archive.max.fan-twitter-@HolbrookPolice-filtered.txt-shallow-20200711-181537-1vsqp-00000.warc.os.cdx.gz 5528 download
urls-archive.max.fan-twitter-@HolbrookPolice-filtered.txt-shallow-20200711-181537-1vsqp-meta.warc.gz 7013 download   job
urls-archive.max.fan-twitter-@HolbrookPolice-filtered.txt-shallow-20200711-181537-1vsqp-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@HolbrookPolice-filtered.txt-shallow-20200711-181537-1vsqp-urls.txt 1389 download
urls-archive.max.fan-twitter-@HolbrookPolice-filtered.txt-shallow-20200711-181537-1vsqp.json 343 download   job
urls-archive.max.fan-twitter-@Holbrook_PD-filtered.txt-shallow-20200711-181607-5kbv8-meta.warc.gz 16051 download   job
urls-archive.max.fan-twitter-@Holbrook_PD-filtered.txt-shallow-20200711-181607-5kbv8-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Holbrook_PD-filtered.txt-shallow-20200711-181607-5kbv8-urls.txt 8559 download
urls-archive.max.fan-twitter-@Holbrook_PD-filtered.txt-shallow-20200711-181607-5kbv8.json 337 download   job
urls-archive.max.fan-twitter-@HollandMAPD-filtered.txt-shallow-20200711-181350-3mb7j-00000.warc.gz 56227910 download   job
urls-archive.max.fan-twitter-@HollandMAPD-filtered.txt-shallow-20200711-181350-3mb7j-00000.warc.os.cdx.gz 57022 download
urls-archive.max.fan-twitter-@HollandMAPD-filtered.txt-shallow-20200711-181350-3mb7j-meta.warc.gz 34824 download   job
urls-archive.max.fan-twitter-@HollandMAPD-filtered.txt-shallow-20200711-181350-3mb7j-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@HollandMAPD-filtered.txt-shallow-20200711-181350-3mb7j-urls.txt 19265 download
urls-archive.max.fan-twitter-@HollandMAPD-filtered.txt-shallow-20200711-181350-3mb7j.json 337 download   job
urls-archive.max.fan-twitter-@HollisPolice-filtered.txt-shallow-20200711-181348-c0tgq-00000.warc.gz 88026819 download   job
urls-archive.max.fan-twitter-@HollisPolice-filtered.txt-shallow-20200711-181348-c0tgq-00000.warc.os.cdx.gz 106509 download
urls-archive.max.fan-twitter-@HollisPolice-filtered.txt-shallow-20200711-181348-c0tgq-meta.warc.gz 61680 download   job
urls-archive.max.fan-twitter-@HollisPolice-filtered.txt-shallow-20200711-181348-c0tgq-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@HollisPolice-filtered.txt-shallow-20200711-181348-c0tgq-urls.txt 62539 download
urls-archive.max.fan-twitter-@HollistonFD-filtered.txt-shallow-20200711-181348-4845e-00000.warc.gz 46718667 download   job
urls-archive.max.fan-twitter-@HollistonFD-filtered.txt-shallow-20200711-181348-4845e-00000.warc.os.cdx.gz 63205 download
urls-archive.max.fan-twitter-@HollistonFD-filtered.txt-shallow-20200711-181348-4845e-meta.warc.gz 38497 download   job
urls-archive.max.fan-twitter-@HollistonFD-filtered.txt-shallow-20200711-181348-4845e-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@HollistonFD-filtered.txt-shallow-20200711-181348-4845e-urls.txt 23659 download
urls-archive.max.fan-twitter-@HollistonFD-filtered.txt-shallow-20200711-181348-4845e.json 337 download   job
urls-archive.max.fan-twitter-@HollistonPolice-filtered.txt-shallow-20200711-181348-c8k2u-00000.warc.gz 322440045 download   job
urls-archive.max.fan-twitter-@HollistonPolice-filtered.txt-shallow-20200711-181348-c8k2u-00000.warc.os.cdx.gz 349958 download
urls-archive.max.fan-twitter-@HollistonPolice-filtered.txt-shallow-20200711-181348-c8k2u.json 345 download   job
urls-archive.max.fan-twitter-@HopkintonFire-filtered.txt-shallow-20200711-181257-dwhtm-00000.warc.gz 57444289 download   job
urls-archive.max.fan-twitter-@HopkintonFire-filtered.txt-shallow-20200711-181257-dwhtm-00000.warc.os.cdx.gz 59412 download
urls-archive.max.fan-twitter-@HopkintonFire-filtered.txt-shallow-20200711-181257-dwhtm-meta.warc.gz 36510 download   job
urls-archive.max.fan-twitter-@HopkintonFire-filtered.txt-shallow-20200711-181257-dwhtm-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@HopkintonFire-filtered.txt-shallow-20200711-181257-dwhtm-urls.txt 21343 download
urls-archive.max.fan-twitter-@HubPD-filtered.txt-shallow-20200711-181230-11e94-meta.warc.gz 58923 download   job
urls-archive.max.fan-twitter-@HubPD-filtered.txt-shallow-20200711-181230-11e94-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@HubPD-filtered.txt-shallow-20200711-181230-11e94-urls.txt 37458 download
urls-archive.max.fan-twitter-@HudsonMaPD-filtered.txt-shallow-20200711-181202-39tpb-00000.warc.gz 36825091 download   job
urls-archive.max.fan-twitter-@HudsonMaPD-filtered.txt-shallow-20200711-181202-39tpb-00000.warc.os.cdx.gz 64273 download
urls-archive.max.fan-twitter-@HudsonMaPD-filtered.txt-shallow-20200711-181202-39tpb-meta.warc.gz 39226 download   job
urls-archive.max.fan-twitter-@HudsonMaPD-filtered.txt-shallow-20200711-181202-39tpb-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LAPDChiefMoore-filtered.txt-shallow-20200711-181025-32ilu-meta.warc.gz 268451 download   job
urls-archive.max.fan-twitter-@LAPDChiefMoore-filtered.txt-shallow-20200711-181025-32ilu-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LAStatePolice-filtered.txt-shallow-20200711-181019-9h9u4-00000.warc.gz 525532922 download   job
urls-archive.max.fan-twitter-@LAStatePolice-filtered.txt-shallow-20200711-181019-9h9u4-00000.warc.os.cdx.gz 693075 download
urls-archive.max.fan-twitter-@LAStatePolice-filtered.txt-shallow-20200711-181019-9h9u4-meta.warc.gz 371623 download   job
urls-archive.max.fan-twitter-@LAStatePolice-filtered.txt-shallow-20200711-181019-9h9u4-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LAStatePolice-filtered.txt-shallow-20200711-181019-9h9u4-urls.txt 275461 download
urls-archive.max.fan-twitter-@LAStatePolice-filtered.txt-shallow-20200711-181019-9h9u4.json 341 download   job
urls-archive.max.fan-twitter-@LPD_MA-filtered.txt-shallow-20200711-175746-5cre5-00000.warc.gz 48924371 download   job
urls-archive.max.fan-twitter-@LPD_MA-filtered.txt-shallow-20200711-175746-5cre5-00000.warc.os.cdx.gz 58850 download
urls-archive.max.fan-twitter-@LPD_MA-filtered.txt-shallow-20200711-175746-5cre5-meta.warc.gz 36050 download   job
urls-archive.max.fan-twitter-@LPD_MA-filtered.txt-shallow-20200711-175746-5cre5-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LPD_MA-filtered.txt-shallow-20200711-175746-5cre5-urls.txt 35665 download
urls-archive.max.fan-twitter-@LPD_MA-filtered.txt-shallow-20200711-175746-5cre5.json 327 download   job
urls-archive.max.fan-twitter-@LaconiaNHPolice-filtered.txt-shallow-20200711-181137-b3y4a-00000.warc.gz 115629328 download   job
urls-archive.max.fan-twitter-@LaconiaNHPolice-filtered.txt-shallow-20200711-181137-b3y4a-00000.warc.os.cdx.gz 103446 download
urls-archive.max.fan-twitter-@LaconiaNHPolice-filtered.txt-shallow-20200711-181137-b3y4a-meta.warc.gz 59212 download   job
urls-archive.max.fan-twitter-@LaconiaNHPolice-filtered.txt-shallow-20200711-181137-b3y4a-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LaconiaNHPolice-filtered.txt-shallow-20200711-181137-b3y4a.json 345 download   job
urls-archive.max.fan-twitter-@LanesboroughPD-filtered.txt-shallow-20200711-181134-20rz5-00000.warc.gz 13172190 download   job
urls-archive.max.fan-twitter-@LanesboroughPD-filtered.txt-shallow-20200711-181134-20rz5-00000.warc.os.cdx.gz 18745 download
urls-archive.max.fan-twitter-@LanesboroughPD-filtered.txt-shallow-20200711-181134-20rz5-meta.warc.gz 14413 download   job
urls-archive.max.fan-twitter-@LanesboroughPD-filtered.txt-shallow-20200711-181134-20rz5-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LanesboroughPD-filtered.txt-shallow-20200711-181134-20rz5-urls.txt 12188 download
urls-archive.max.fan-twitter-@LanesboroughPD-filtered.txt-shallow-20200711-181134-20rz5.json 343 download   job
urls-archive.max.fan-twitter-@LeeMAPD-filtered.txt-shallow-20200711-180956-8newu-00000.warc.gz 28674367 download   job
urls-archive.max.fan-twitter-@LeeMAPD-filtered.txt-shallow-20200711-180956-8newu-00000.warc.os.cdx.gz 35009 download
urls-archive.max.fan-twitter-@LeeMAPD-filtered.txt-shallow-20200711-180956-8newu-meta.warc.gz 23152 download   job
urls-archive.max.fan-twitter-@LeeMAPD-filtered.txt-shallow-20200711-180956-8newu-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LeeMAPD-filtered.txt-shallow-20200711-180956-8newu-urls.txt 7625 download
urls-archive.max.fan-twitter-@LeeMAPD-filtered.txt-shallow-20200711-180956-8newu.json 329 download   job
urls-archive.max.fan-twitter-@LeominsterPD-filtered.txt-shallow-20200711-180930-42hin-00000.warc.gz 4648036 download   job
urls-archive.max.fan-twitter-@LeominsterPD-filtered.txt-shallow-20200711-180930-42hin-00000.warc.os.cdx.gz 14679 download
urls-archive.max.fan-twitter-@LeominsterPD-filtered.txt-shallow-20200711-180930-42hin-urls.txt 2502 download
urls-archive.max.fan-twitter-@LeominsterPD-filtered.txt-shallow-20200711-180930-42hin.json 339 download   job
urls-archive.max.fan-twitter-@LexingtonPolice-filtered.txt-shallow-20200711-180929-42ah3-00000.warc.gz 62004213 download   job
urls-archive.max.fan-twitter-@LexingtonPolice-filtered.txt-shallow-20200711-180929-42ah3-00000.warc.os.cdx.gz 80573 download
urls-archive.max.fan-twitter-@LexingtonPolice-filtered.txt-shallow-20200711-180929-42ah3-meta.warc.gz 47337 download   job
urls-archive.max.fan-twitter-@LexingtonPolice-filtered.txt-shallow-20200711-180929-42ah3-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LexingtonPolice-filtered.txt-shallow-20200711-180929-42ah3-urls.txt 40938 download
urls-archive.max.fan-twitter-@LexingtonPolice-filtered.txt-shallow-20200711-180929-42ah3.json 345 download   job
urls-archive.max.fan-twitter-@LowellFireDept-filtered.txt-shallow-20200711-175815-79u23-00000.warc.gz 62395682 download   job
urls-archive.max.fan-twitter-@LowellFireDept-filtered.txt-shallow-20200711-175815-79u23-00000.warc.os.cdx.gz 93504 download
urls-archive.max.fan-twitter-@LowellFireDept-filtered.txt-shallow-20200711-175815-79u23-meta.warc.gz 54701 download   job
urls-archive.max.fan-twitter-@LowellFireDept-filtered.txt-shallow-20200711-175815-79u23-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LowellFireDept-filtered.txt-shallow-20200711-175815-79u23-urls.txt 27703 download
urls-archive.max.fan-twitter-@LowellFireDept-filtered.txt-shallow-20200711-175815-79u23.json 343 download   job
urls-archive.max.fan-twitter-@LowellPD-filtered.txt-shallow-20200711-175751-ekzoz-00000.warc.gz 659191890 download   job
urls-archive.max.fan-twitter-@LowellPD-filtered.txt-shallow-20200711-175751-ekzoz-00000.warc.os.cdx.gz 804339 download
urls-archive.max.fan-twitter-@LowellPD-filtered.txt-shallow-20200711-175751-ekzoz-meta.warc.gz 431064 download   job
urls-archive.max.fan-twitter-@LowellPD-filtered.txt-shallow-20200711-175751-ekzoz-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LowellPD-filtered.txt-shallow-20200711-175751-ekzoz-urls.txt 237866 download
urls-archive.max.fan-twitter-@LowellPD-filtered.txt-shallow-20200711-175751-ekzoz.json 331 download   job
urls-archive.max.fan-twitter-@LynchburgPolice-filtered.txt-shallow-20200711-175745-af9w7-00000.warc.gz 244438771 download   job
urls-archive.max.fan-twitter-@LynchburgPolice-filtered.txt-shallow-20200711-175745-af9w7-00000.warc.os.cdx.gz 288212 download
urls-archive.max.fan-twitter-@LynchburgPolice-filtered.txt-shallow-20200711-175745-af9w7-meta.warc.gz 158057 download   job
urls-archive.max.fan-twitter-@LynchburgPolice-filtered.txt-shallow-20200711-175745-af9w7-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LynchburgPolice-filtered.txt-shallow-20200711-175745-af9w7-urls.txt 154287 download
urls-archive.max.fan-twitter-@LynchburgPolice-filtered.txt-shallow-20200711-175745-af9w7.json 345 download   job
urls-archive.max.fan-twitter-@LynnPoliceDept-filtered.txt-shallow-20200711-175722-6d25e-00000.warc.gz 116770963 download   job
urls-archive.max.fan-twitter-@LynnPoliceDept-filtered.txt-shallow-20200711-175722-6d25e-00000.warc.os.cdx.gz 178108 download
urls-archive.max.fan-twitter-@LynnPoliceDept-filtered.txt-shallow-20200711-175722-6d25e-meta.warc.gz 99576 download   job
urls-archive.max.fan-twitter-@LynnPoliceDept-filtered.txt-shallow-20200711-175722-6d25e-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LynnPoliceDept-filtered.txt-shallow-20200711-175722-6d25e-urls.txt 61917 download
urls-archive.max.fan-twitter-@LynnPoliceDept-filtered.txt-shallow-20200711-175722-6d25e.json 343 download   job
urls-archive.max.fan-twitter-@MAEnviroPolice-filtered.txt-shallow-20200711-175721-9826c-00000.warc.gz 176615949 download   job
urls-archive.max.fan-twitter-@MAEnviroPolice-filtered.txt-shallow-20200711-175721-9826c-00000.warc.os.cdx.gz 203269 download
urls-archive.max.fan-twitter-@MAEnviroPolice-filtered.txt-shallow-20200711-175721-9826c-meta.warc.gz 112292 download   job
urls-archive.max.fan-twitter-@MAEnviroPolice-filtered.txt-shallow-20200711-175721-9826c-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MAEnviroPolice-filtered.txt-shallow-20200711-175721-9826c-urls.txt 43063 download
urls-archive.max.fan-twitter-@MaldenPolice-filtered.txt-shallow-20200711-175721-dl82j-00000.warc.gz 183320810 download   job
urls-archive.max.fan-twitter-@MaldenPolice-filtered.txt-shallow-20200711-175721-dl82j-00000.warc.os.cdx.gz 255273 download
urls-archive.max.fan-twitter-@MaldenPolice-filtered.txt-shallow-20200711-175721-dl82j-meta.warc.gz 140390 download   job
urls-archive.max.fan-twitter-@MaldenPolice-filtered.txt-shallow-20200711-175721-dl82j-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MaldenPolice-filtered.txt-shallow-20200711-175721-dl82j-urls.txt 74223 download
urls-archive.max.fan-twitter-@MaldenPolice-filtered.txt-shallow-20200711-175721-dl82j.json 339 download   job
urls-archive.max.fan-twitter-@ManchesterMAPD-filtered.txt-shallow-20200711-175720-13epu-00000.warc.gz 131304590 download   job
urls-archive.max.fan-twitter-@ManchesterMAPD-filtered.txt-shallow-20200711-175720-13epu-00000.warc.os.cdx.gz 154964 download
urls-archive.max.fan-twitter-@ManchesterMAPD-filtered.txt-shallow-20200711-175720-13epu-meta.warc.gz 87523 download   job
urls-archive.max.fan-twitter-@ManchesterMAPD-filtered.txt-shallow-20200711-175720-13epu-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ManchesterMAPD-filtered.txt-shallow-20200711-175720-13epu-urls.txt 79441 download
urls-archive.max.fan-twitter-@MansfieldMAPD-filtered.txt-shallow-20200711-175720-cg459-00000.warc.gz 236549712 download   job
urls-archive.max.fan-twitter-@MansfieldMAPD-filtered.txt-shallow-20200711-175720-cg459-00000.warc.os.cdx.gz 292088 download
urls-archive.max.fan-twitter-@MansfieldMAPD-filtered.txt-shallow-20200711-175720-cg459-meta.warc.gz 157319 download   job
urls-archive.max.fan-twitter-@MansfieldMAPD-filtered.txt-shallow-20200711-175720-cg459-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MansfieldMAPD-filtered.txt-shallow-20200711-175720-cg459-urls.txt 203709 download
urls-archive.max.fan-twitter-@MansfieldMAPD-filtered.txt-shallow-20200711-175720-cg459.json 341 download   job
urls-archive.max.fan-twitter-@MarlboroughMaPD-filtered.txt-shallow-20200711-175719-5i7h7-meta.warc.gz 44262 download   job
urls-archive.max.fan-twitter-@MarlboroughMaPD-filtered.txt-shallow-20200711-175719-5i7h7-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MarlboroughMaPD-filtered.txt-shallow-20200711-175719-5i7h7-urls.txt 62337 download
urls-archive.max.fan-twitter-@NEvnen-filtered.txt-shallow-20200711-205744-ao09g-00000.warc.gz 24493426 download   job
urls-archive.max.fan-twitter-@NEvnen-filtered.txt-shallow-20200711-205744-ao09g-00000.warc.os.cdx.gz 32820 download
urls-archive.max.fan-twitter-@NEvnen-filtered.txt-shallow-20200711-205744-ao09g-meta.warc.gz 21833 download   job
urls-archive.max.fan-twitter-@NEvnen-filtered.txt-shallow-20200711-205744-ao09g-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NEvnen-filtered.txt-shallow-20200711-205744-ao09g-urls.txt 6534 download
urls-archive.max.fan-twitter-@NEvnen-filtered.txt-shallow-20200711-205744-ao09g.json 327 download   job
urls-archive.max.fan-twitter-@NHC_Surge-filtered.txt-shallow-20200711-203800-6bgo7-meta.warc.gz 108591 download   job
urls-archive.max.fan-twitter-@NHC_Surge-filtered.txt-shallow-20200711-203800-6bgo7-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NHC_Surge-filtered.txt-shallow-20200711-203800-6bgo7.json 333 download   job
urls-archive.max.fan-twitter-@NHC_TAFB-filtered.txt-shallow-20200711-203759-7j1iy-00000.warc.gz 626848197 download   job
urls-archive.max.fan-twitter-@NHC_TAFB-filtered.txt-shallow-20200711-203759-7j1iy-00000.warc.os.cdx.gz 268379 download
urls-archive.max.fan-twitter-@NHC_TAFB-filtered.txt-shallow-20200711-203759-7j1iy.json 331 download   job
urls-archive.max.fan-twitter-@NHSecretary-filtered.txt-shallow-20200711-203639-crry8-00000.warc.gz 14796132 download   job
urls-archive.max.fan-twitter-@NHSecretary-filtered.txt-shallow-20200711-203639-crry8-00000.warc.os.cdx.gz 24253 download
urls-archive.max.fan-twitter-@NHSecretary-filtered.txt-shallow-20200711-203639-crry8-meta.warc.gz 17281 download   job
urls-archive.max.fan-twitter-@NHSecretary-filtered.txt-shallow-20200711-203639-crry8-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NHSecretary-filtered.txt-shallow-20200711-203639-crry8-urls.txt 4661 download
urls-archive.max.fan-twitter-@NHSecretary-filtered.txt-shallow-20200711-203639-crry8.json 337 download   job
urls-archive.max.fan-twitter-@NJStateDept-filtered.txt-shallow-20200711-201918-9myfh-00000.warc.gz 128938219 download   job
urls-archive.max.fan-twitter-@NJStateDept-filtered.txt-shallow-20200711-201918-9myfh-00000.warc.os.cdx.gz 151136 download
urls-archive.max.fan-twitter-@NJStateDept-filtered.txt-shallow-20200711-201918-9myfh-meta.warc.gz 84708 download   job
urls-archive.max.fan-twitter-@NJStateDept-filtered.txt-shallow-20200711-201918-9myfh-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NJStateDept-filtered.txt-shallow-20200711-201918-9myfh-urls.txt 44464 download
urls-archive.max.fan-twitter-@NJStateDept-filtered.txt-shallow-20200711-201918-9myfh.json 337 download   job
urls-archive.max.fan-twitter-@NLADA-filtered.txt-shallow-20200711-201917-chrfc-meta.warc.gz 35661 download   job
urls-archive.max.fan-twitter-@NLADA-filtered.txt-shallow-20200711-201917-chrfc-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NLADA-filtered.txt-shallow-20200711-201917-chrfc-urls.txt 21835 download
urls-archive.max.fan-twitter-@NLADA-filtered.txt-shallow-20200711-201917-chrfc.json 325 download   job
urls-archive.max.fan-twitter-@NMSecOfState-filtered.txt-shallow-20200711-201644-2cldj-urls.txt 55960 download
urls-archive.max.fan-twitter-@NPSHistory-filtered.txt-shallow-20200711-201426-7omtb-00000.warc.gz 2587389 download   job
urls-archive.max.fan-twitter-@NPSHistory-filtered.txt-shallow-20200711-201426-7omtb-00000.warc.os.cdx.gz 7258 download
urls-archive.max.fan-twitter-@NPSHistory-filtered.txt-shallow-20200711-201426-7omtb-meta.warc.gz 8022 download   job
urls-archive.max.fan-twitter-@NPSHistory-filtered.txt-shallow-20200711-201426-7omtb-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NPSHistory-filtered.txt-shallow-20200711-201426-7omtb-urls.txt 1378 download
urls-archive.max.fan-twitter-@NPSUrban-filtered.txt-shallow-20200711-201424-5uyj5-00000.warc.gz 53427445 download   job
urls-archive.max.fan-twitter-@NPSUrban-filtered.txt-shallow-20200711-201424-5uyj5-00000.warc.os.cdx.gz 87553 download
urls-archive.max.fan-twitter-@NPSUrban-filtered.txt-shallow-20200711-201424-5uyj5-meta.warc.gz 51516 download   job
urls-archive.max.fan-twitter-@NPSUrban-filtered.txt-shallow-20200711-201424-5uyj5-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NPSUrban-filtered.txt-shallow-20200711-201424-5uyj5-urls.txt 18920 download
urls-archive.max.fan-twitter-@NPSUrban-filtered.txt-shallow-20200711-201424-5uyj5.json 331 download   job
urls-archive.max.fan-twitter-@NVGOP-filtered.txt-shallow-20200711-200651-7gsqb-urls.txt 215001 download
urls-archive.max.fan-twitter-@NVSOS-filtered.txt-shallow-20200711-200650-9azux-meta.warc.gz 102763 download   job
urls-archive.max.fan-twitter-@NVSOS-filtered.txt-shallow-20200711-200650-9azux-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NWSWilmingtonNC-filtered.txt-shallow-20200711-194602-54uue-urls.txt 477661 download
urls-archive.max.fan-twitter-@NWSWilmingtonNC-filtered.txt-shallow-20200711-194602-54uue.json 345 download   job
urls-archive.max.fan-twitter-@NYCHealthCommr-filtered.txt-shallow-20200711-194600-1uwu0-00000.warc.gz 21303073 download   job
urls-archive.max.fan-twitter-@NYCHealthCommr-filtered.txt-shallow-20200711-194600-1uwu0-00000.warc.os.cdx.gz 58527 download
urls-archive.max.fan-twitter-@NYCHealthCommr-filtered.txt-shallow-20200711-194600-1uwu0-meta.warc.gz 35695 download   job
urls-archive.max.fan-twitter-@NYCHealthCommr-filtered.txt-shallow-20200711-194600-1uwu0-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYCHealthCommr-filtered.txt-shallow-20200711-194600-1uwu0-urls.txt 4712 download
urls-archive.max.fan-twitter-@NYCHealthCommr-filtered.txt-shallow-20200711-194600-1uwu0.json 343 download   job
urls-archive.max.fan-twitter-@NYCPBA-filtered.txt-shallow-20200711-194538-ldr8v-00000.warc.gz 123641267 download   job
urls-archive.max.fan-twitter-@NYCPBA-filtered.txt-shallow-20200711-194538-ldr8v-00000.warc.os.cdx.gz 337673 download
urls-archive.max.fan-twitter-@NYCPBA-filtered.txt-shallow-20200711-194538-ldr8v-meta.warc.gz 184439 download   job
urls-archive.max.fan-twitter-@NYCPBA-filtered.txt-shallow-20200711-194538-ldr8v-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYCPBA-filtered.txt-shallow-20200711-194538-ldr8v-urls.txt 24624 download
urls-archive.max.fan-twitter-@NYCPBA-filtered.txt-shallow-20200711-194538-ldr8v.json 327 download   job
urls-archive.max.fan-twitter-@NYCPBA_GC-filtered.txt-shallow-20200711-194559-5c1c8-00000.warc.gz 7322876 download   job
urls-archive.max.fan-twitter-@NYCPBA_GC-filtered.txt-shallow-20200711-194559-5c1c8-00000.warc.os.cdx.gz 33163 download
urls-archive.max.fan-twitter-@NYCPBA_GC-filtered.txt-shallow-20200711-194559-5c1c8-meta.warc.gz 21955 download   job
urls-archive.max.fan-twitter-@NYCPBA_GC-filtered.txt-shallow-20200711-194559-5c1c8-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYCPBA_GC-filtered.txt-shallow-20200711-194559-5c1c8-urls.txt 2166 download
urls-archive.max.fan-twitter-@NYCPBA_GC-filtered.txt-shallow-20200711-194559-5c1c8.json 333 download   job
urls-archive.max.fan-twitter-@NYCPDDEA-filtered.txt-shallow-20200711-194537-cj2th-00000.warc.gz 5708591 download   job
urls-archive.max.fan-twitter-@NYCPDDEA-filtered.txt-shallow-20200711-194537-cj2th-00000.warc.os.cdx.gz 24627 download
urls-archive.max.fan-twitter-@NYCPDDEA-filtered.txt-shallow-20200711-194537-cj2th-meta.warc.gz 17482 download   job
urls-archive.max.fan-twitter-@NYCPDDEA-filtered.txt-shallow-20200711-194537-cj2th-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYCPDDEA-filtered.txt-shallow-20200711-194537-cj2th-urls.txt 1232 download
urls-archive.max.fan-twitter-@NYCPDDEA-filtered.txt-shallow-20200711-194537-cj2th.json 331 download   job
urls-archive.max.fan-twitter-@NYPD100Pct-filtered.txt-shallow-20200711-194534-9htao-00000.warc.gz 3062962 download   job
urls-archive.max.fan-twitter-@NYPD100Pct-filtered.txt-shallow-20200711-194534-9htao-00000.warc.os.cdx.gz 10259 download
urls-archive.max.fan-twitter-@NYPD100Pct-filtered.txt-shallow-20200711-194534-9htao-meta.warc.gz 9633 download   job
urls-archive.max.fan-twitter-@NYPD100Pct-filtered.txt-shallow-20200711-194534-9htao-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD100Pct-filtered.txt-shallow-20200711-194534-9htao-urls.txt 638 download
urls-archive.max.fan-twitter-@NYPD100Pct-filtered.txt-shallow-20200711-194534-9htao.json 335 download   job
urls-archive.max.fan-twitter-@NYPD101Pct-filtered.txt-shallow-20200711-194534-6p0w8-00000.warc.gz 6669248 download   job
urls-archive.max.fan-twitter-@NYPD101Pct-filtered.txt-shallow-20200711-194534-6p0w8-00000.warc.os.cdx.gz 16474 download
urls-archive.max.fan-twitter-@NYPD101Pct-filtered.txt-shallow-20200711-194534-6p0w8-meta.warc.gz 13007 download   job
urls-archive.max.fan-twitter-@NYPD101Pct-filtered.txt-shallow-20200711-194534-6p0w8-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD101Pct-filtered.txt-shallow-20200711-194534-6p0w8-urls.txt 1160 download
urls-archive.max.fan-twitter-@NYPD101Pct-filtered.txt-shallow-20200711-194534-6p0w8.json 335 download   job
urls-archive.max.fan-twitter-@NYPD102Pct-filtered.txt-shallow-20200711-194510-8kau8-00000.warc.gz 2032640 download   job
urls-archive.max.fan-twitter-@NYPD102Pct-filtered.txt-shallow-20200711-194510-8kau8-00000.warc.os.cdx.gz 6576 download
urls-archive.max.fan-twitter-@NYPD102Pct-filtered.txt-shallow-20200711-194510-8kau8-meta.warc.gz 7618 download   job
urls-archive.max.fan-twitter-@NYPD102Pct-filtered.txt-shallow-20200711-194510-8kau8-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD102Pct-filtered.txt-shallow-20200711-194510-8kau8-urls.txt 348 download
urls-archive.max.fan-twitter-@NYPD102Pct-filtered.txt-shallow-20200711-194510-8kau8.json 335 download   job
urls-archive.max.fan-twitter-@NYPD103Pct-filtered.txt-shallow-20200711-194506-d939e-00000.warc.gz 7385668 download   job
urls-archive.max.fan-twitter-@NYPD103Pct-filtered.txt-shallow-20200711-194506-d939e-00000.warc.os.cdx.gz 10501 download
urls-archive.max.fan-twitter-@NYPD103Pct-filtered.txt-shallow-20200711-194506-d939e-meta.warc.gz 9782 download   job
urls-archive.max.fan-twitter-@NYPD103Pct-filtered.txt-shallow-20200711-194506-d939e-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD103Pct-filtered.txt-shallow-20200711-194506-d939e-urls.txt 986 download
urls-archive.max.fan-twitter-@NYPD103Pct-filtered.txt-shallow-20200711-194506-d939e.json 335 download   job
urls-archive.max.fan-twitter-@NYPD104Pct-filtered.txt-shallow-20200711-194505-aohnr-00000.warc.gz 12527073 download   job
urls-archive.max.fan-twitter-@NYPD104Pct-filtered.txt-shallow-20200711-194505-aohnr-00000.warc.os.cdx.gz 26138 download
urls-archive.max.fan-twitter-@NYPD104Pct-filtered.txt-shallow-20200711-194505-aohnr-meta.warc.gz 18019 download   job
urls-archive.max.fan-twitter-@NYPD104Pct-filtered.txt-shallow-20200711-194505-aohnr-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD104Pct-filtered.txt-shallow-20200711-194505-aohnr-urls.txt 2548 download
urls-archive.max.fan-twitter-@NYPD104Pct-filtered.txt-shallow-20200711-194505-aohnr.json 335 download   job
urls-archive.max.fan-twitter-@NYPD105Pct-filtered.txt-shallow-20200711-194505-alvtx-00000.warc.gz 12180141 download   job
urls-archive.max.fan-twitter-@NYPD105Pct-filtered.txt-shallow-20200711-194505-alvtx-00000.warc.os.cdx.gz 14886 download
urls-archive.max.fan-twitter-@NYPD105Pct-filtered.txt-shallow-20200711-194505-alvtx-meta.warc.gz 12142 download   job
urls-archive.max.fan-twitter-@NYPD105Pct-filtered.txt-shallow-20200711-194505-alvtx-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD105Pct-filtered.txt-shallow-20200711-194505-alvtx-urls.txt 1740 download
urls-archive.max.fan-twitter-@NYPD105Pct-filtered.txt-shallow-20200711-194505-alvtx.json 335 download   job
urls-archive.max.fan-twitter-@NYPD106Pct-filtered.txt-shallow-20200711-194503-b29nb-00000.warc.gz 6719574 download   job
urls-archive.max.fan-twitter-@NYPD106Pct-filtered.txt-shallow-20200711-194503-b29nb-00000.warc.os.cdx.gz 14016 download
urls-archive.max.fan-twitter-@NYPD106Pct-filtered.txt-shallow-20200711-194503-b29nb-meta.warc.gz 11689 download   job
urls-archive.max.fan-twitter-@NYPD106Pct-filtered.txt-shallow-20200711-194503-b29nb-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD106Pct-filtered.txt-shallow-20200711-194503-b29nb-urls.txt 1218 download
urls-archive.max.fan-twitter-@NYPD106Pct-filtered.txt-shallow-20200711-194503-b29nb.json 335 download   job
urls-archive.max.fan-twitter-@NYPD107Pct-filtered.txt-shallow-20200711-194440-2nmfh-00000.warc.gz 7177470 download   job
urls-archive.max.fan-twitter-@NYPD107Pct-filtered.txt-shallow-20200711-194440-2nmfh-00000.warc.os.cdx.gz 12931 download
urls-archive.max.fan-twitter-@NYPD107Pct-filtered.txt-shallow-20200711-194440-2nmfh-meta.warc.gz 11112 download   job
urls-archive.max.fan-twitter-@NYPD107Pct-filtered.txt-shallow-20200711-194440-2nmfh-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD107Pct-filtered.txt-shallow-20200711-194440-2nmfh-urls.txt 1271 download
urls-archive.max.fan-twitter-@NYPD107Pct-filtered.txt-shallow-20200711-194440-2nmfh.json 335 download   job
urls-archive.max.fan-twitter-@NYPD108Pct-filtered.txt-shallow-20200711-194438-eljp2-00000.warc.gz 6414088 download   job
urls-archive.max.fan-twitter-@NYPD108Pct-filtered.txt-shallow-20200711-194438-eljp2-00000.warc.os.cdx.gz 12196 download
urls-archive.max.fan-twitter-@NYPD108Pct-filtered.txt-shallow-20200711-194438-eljp2-meta.warc.gz 10698 download   job
urls-archive.max.fan-twitter-@NYPD108Pct-filtered.txt-shallow-20200711-194438-eljp2-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD108Pct-filtered.txt-shallow-20200711-194438-eljp2-urls.txt 1102 download
urls-archive.max.fan-twitter-@NYPD108Pct-filtered.txt-shallow-20200711-194438-eljp2.json 335 download   job
urls-archive.max.fan-twitter-@NYPD109Pct-filtered.txt-shallow-20200711-194437-9mwdn-00000.warc.gz 3843907 download   job
urls-archive.max.fan-twitter-@NYPD109Pct-filtered.txt-shallow-20200711-194437-9mwdn-00000.warc.os.cdx.gz 9444 download
urls-archive.max.fan-twitter-@NYPD109Pct-filtered.txt-shallow-20200711-194437-9mwdn-meta.warc.gz 9220 download   job
urls-archive.max.fan-twitter-@NYPD109Pct-filtered.txt-shallow-20200711-194437-9mwdn-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD109Pct-filtered.txt-shallow-20200711-194437-9mwdn-urls.txt 754 download
urls-archive.max.fan-twitter-@NYPD109Pct-filtered.txt-shallow-20200711-194437-9mwdn.json 335 download   job
urls-archive.max.fan-twitter-@NYPD10Pct-filtered.txt-shallow-20200711-194437-7ec4e-00000.warc.gz 4786037 download   job
urls-archive.max.fan-twitter-@NYPD10Pct-filtered.txt-shallow-20200711-194437-7ec4e-00000.warc.os.cdx.gz 11829 download
urls-archive.max.fan-twitter-@NYPD10Pct-filtered.txt-shallow-20200711-194437-7ec4e-meta.warc.gz 10495 download   job
urls-archive.max.fan-twitter-@NYPD10Pct-filtered.txt-shallow-20200711-194437-7ec4e-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD10Pct-filtered.txt-shallow-20200711-194437-7ec4e-urls.txt 798 download
urls-archive.max.fan-twitter-@NYPD10Pct-filtered.txt-shallow-20200711-194437-7ec4e.json 333 download   job
urls-archive.max.fan-twitter-@NYPD110Pct-filtered.txt-shallow-20200711-194436-8kgdu-00000.warc.gz 6479568 download   job
urls-archive.max.fan-twitter-@NYPD110Pct-filtered.txt-shallow-20200711-194436-8kgdu-00000.warc.os.cdx.gz 12005 download
urls-archive.max.fan-twitter-@NYPD110Pct-filtered.txt-shallow-20200711-194436-8kgdu-meta.warc.gz 10599 download   job
urls-archive.max.fan-twitter-@NYPD110Pct-filtered.txt-shallow-20200711-194436-8kgdu-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD110Pct-filtered.txt-shallow-20200711-194436-8kgdu-urls.txt 870 download
urls-archive.max.fan-twitter-@NYPD110Pct-filtered.txt-shallow-20200711-194436-8kgdu.json 335 download   job
urls-archive.max.fan-twitter-@NYPD111Pct-filtered.txt-shallow-20200711-194417-2okdh-00000.warc.gz 6466784 download   job
urls-archive.max.fan-twitter-@NYPD111Pct-filtered.txt-shallow-20200711-194417-2okdh-00000.warc.os.cdx.gz 12681 download
urls-archive.max.fan-twitter-@NYPD111Pct-filtered.txt-shallow-20200711-194417-2okdh-meta.warc.gz 10965 download   job
urls-archive.max.fan-twitter-@NYPD111Pct-filtered.txt-shallow-20200711-194417-2okdh-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD111Pct-filtered.txt-shallow-20200711-194417-2okdh-urls.txt 986 download
urls-archive.max.fan-twitter-@NYPD111Pct-filtered.txt-shallow-20200711-194417-2okdh.json 335 download   job
urls-archive.max.fan-twitter-@NYPD112Pct-filtered.txt-shallow-20200711-194414-d5dcs-00000.warc.gz 3688194 download   job
urls-archive.max.fan-twitter-@NYPD112Pct-filtered.txt-shallow-20200711-194414-d5dcs-00000.warc.os.cdx.gz 9577 download
urls-archive.max.fan-twitter-@NYPD112Pct-filtered.txt-shallow-20200711-194414-d5dcs-meta.warc.gz 9265 download   job
urls-archive.max.fan-twitter-@NYPD112Pct-filtered.txt-shallow-20200711-194414-d5dcs-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD112Pct-filtered.txt-shallow-20200711-194414-d5dcs-urls.txt 870 download
urls-archive.max.fan-twitter-@NYPD112Pct-filtered.txt-shallow-20200711-194414-d5dcs.json 335 download   job
urls-archive.max.fan-twitter-@NYPD113Pct-filtered.txt-shallow-20200711-194411-dce0s-00000.warc.gz 6503285 download   job
urls-archive.max.fan-twitter-@NYPD113Pct-filtered.txt-shallow-20200711-194411-dce0s-00000.warc.os.cdx.gz 14435 download
urls-archive.max.fan-twitter-@NYPD113Pct-filtered.txt-shallow-20200711-194411-dce0s-meta.warc.gz 11977 download   job
urls-archive.max.fan-twitter-@NYPD113Pct-filtered.txt-shallow-20200711-194411-dce0s-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD113Pct-filtered.txt-shallow-20200711-194411-dce0s-urls.txt 1275 download
urls-archive.max.fan-twitter-@NYPD113Pct-filtered.txt-shallow-20200711-194411-dce0s.json 335 download   job
urls-archive.max.fan-twitter-@NYPD114Pct-filtered.txt-shallow-20200711-194410-9oax6-00000.warc.gz 3563017 download   job
urls-archive.max.fan-twitter-@NYPD114Pct-filtered.txt-shallow-20200711-194410-9oax6-00000.warc.os.cdx.gz 8977 download
urls-archive.max.fan-twitter-@NYPD114Pct-filtered.txt-shallow-20200711-194410-9oax6-meta.warc.gz 8941 download   job
urls-archive.max.fan-twitter-@NYPD114Pct-filtered.txt-shallow-20200711-194410-9oax6-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD114Pct-filtered.txt-shallow-20200711-194410-9oax6-urls.txt 406 download
urls-archive.max.fan-twitter-@NYPD114Pct-filtered.txt-shallow-20200711-194410-9oax6.json 335 download   job
urls-archive.max.fan-twitter-@NYPD115Pct-filtered.txt-shallow-20200711-194410-d3bcw-00000.warc.gz 5231850 download   job
urls-archive.max.fan-twitter-@NYPD115Pct-filtered.txt-shallow-20200711-194410-d3bcw-00000.warc.os.cdx.gz 16247 download
urls-archive.max.fan-twitter-@NYPD115Pct-filtered.txt-shallow-20200711-194410-d3bcw-meta.warc.gz 12824 download   job
urls-archive.max.fan-twitter-@NYPD115Pct-filtered.txt-shallow-20200711-194410-d3bcw-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD115Pct-filtered.txt-shallow-20200711-194410-d3bcw-urls.txt 1682 download
urls-archive.max.fan-twitter-@NYPD115Pct-filtered.txt-shallow-20200711-194410-d3bcw.json 335 download   job
urls-archive.max.fan-twitter-@NYPD13Pct-filtered.txt-shallow-20200711-194350-c63ht-00000.warc.gz 7283662 download   job
urls-archive.max.fan-twitter-@NYPD13Pct-filtered.txt-shallow-20200711-194350-c63ht-00000.warc.os.cdx.gz 7716 download
urls-archive.max.fan-twitter-@NYPD13Pct-filtered.txt-shallow-20200711-194350-c63ht-meta.warc.gz 8186 download   job
urls-archive.max.fan-twitter-@NYPD13Pct-filtered.txt-shallow-20200711-194350-c63ht-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD13Pct-filtered.txt-shallow-20200711-194350-c63ht-urls.txt 399 download
urls-archive.max.fan-twitter-@NYPD13Pct-filtered.txt-shallow-20200711-194350-c63ht.json 333 download   job
urls-archive.max.fan-twitter-@NYPD17Pct-filtered.txt-shallow-20200711-194346-6z0ze-00000.warc.gz 2022204 download   job
urls-archive.max.fan-twitter-@NYPD17Pct-filtered.txt-shallow-20200711-194346-6z0ze-00000.warc.os.cdx.gz 6709 download
urls-archive.max.fan-twitter-@NYPD17Pct-filtered.txt-shallow-20200711-194346-6z0ze-meta.warc.gz 7691 download   job
urls-archive.max.fan-twitter-@NYPD17Pct-filtered.txt-shallow-20200711-194346-6z0ze-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD17Pct-filtered.txt-shallow-20200711-194346-6z0ze-urls.txt 285 download
urls-archive.max.fan-twitter-@NYPD17Pct-filtered.txt-shallow-20200711-194346-6z0ze.json 333 download   job
urls-archive.max.fan-twitter-@NYPD19Pct-filtered.txt-shallow-20200711-194344-2jbx8-00000.warc.gz 18988415 download   job
urls-archive.max.fan-twitter-@NYPD19Pct-filtered.txt-shallow-20200711-194344-2jbx8-00000.warc.os.cdx.gz 43593 download
urls-archive.max.fan-twitter-@NYPD19Pct-filtered.txt-shallow-20200711-194344-2jbx8-meta.warc.gz 27293 download   job
urls-archive.max.fan-twitter-@NYPD19Pct-filtered.txt-shallow-20200711-194344-2jbx8-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD19Pct-filtered.txt-shallow-20200711-194344-2jbx8-urls.txt 3933 download
urls-archive.max.fan-twitter-@NYPD19Pct-filtered.txt-shallow-20200711-194344-2jbx8.json 333 download   job
urls-archive.max.fan-twitter-@NYPD1Pct-filtered.txt-shallow-20200711-194342-dleey-00000.warc.gz 3608098 download   job
urls-archive.max.fan-twitter-@NYPD1Pct-filtered.txt-shallow-20200711-194342-dleey-00000.warc.os.cdx.gz 8400 download
urls-archive.max.fan-twitter-@NYPD1Pct-filtered.txt-shallow-20200711-194342-dleey-meta.warc.gz 8608 download   job
urls-archive.max.fan-twitter-@NYPD1Pct-filtered.txt-shallow-20200711-194342-dleey-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD1Pct-filtered.txt-shallow-20200711-194342-dleey-urls.txt 392 download
urls-archive.max.fan-twitter-@NYPD1Pct-filtered.txt-shallow-20200711-194342-dleey.json 331 download   job
urls-archive.max.fan-twitter-@NYPD20Pct-filtered.txt-shallow-20200711-194340-cncc5-00000.warc.gz 3535607 download   job
urls-archive.max.fan-twitter-@NYPD20Pct-filtered.txt-shallow-20200711-194340-cncc5-00000.warc.os.cdx.gz 9293 download
urls-archive.max.fan-twitter-@NYPD20Pct-filtered.txt-shallow-20200711-194340-cncc5-meta.warc.gz 9056 download   job
urls-archive.max.fan-twitter-@NYPD20Pct-filtered.txt-shallow-20200711-194340-cncc5-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD20Pct-filtered.txt-shallow-20200711-194340-cncc5-urls.txt 342 download
urls-archive.max.fan-twitter-@NYPD20Pct-filtered.txt-shallow-20200711-194340-cncc5.json 333 download   job
urls-archive.max.fan-twitter-@NYPD23Pct-filtered.txt-shallow-20200711-194320-e1rq3-00000.warc.gz 3041149 download   job
urls-archive.max.fan-twitter-@NYPD23Pct-filtered.txt-shallow-20200711-194320-e1rq3-00000.warc.os.cdx.gz 9830 download
urls-archive.max.fan-twitter-@NYPD23Pct-filtered.txt-shallow-20200711-194320-e1rq3-meta.warc.gz 9417 download   job
urls-archive.max.fan-twitter-@NYPD23Pct-filtered.txt-shallow-20200711-194320-e1rq3-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD23Pct-filtered.txt-shallow-20200711-194320-e1rq3-urls.txt 399 download
urls-archive.max.fan-twitter-@NYPD23Pct-filtered.txt-shallow-20200711-194320-e1rq3.json 333 download   job
urls-archive.max.fan-twitter-@NYPD24Pct-filtered.txt-shallow-20200711-194320-6unom-00000.warc.gz 8236866 download   job
urls-archive.max.fan-twitter-@NYPD24Pct-filtered.txt-shallow-20200711-194320-6unom-00000.warc.os.cdx.gz 18263 download
urls-archive.max.fan-twitter-@NYPD24Pct-filtered.txt-shallow-20200711-194320-6unom-meta.warc.gz 13881 download   job
urls-archive.max.fan-twitter-@NYPD24Pct-filtered.txt-shallow-20200711-194320-6unom-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD24Pct-filtered.txt-shallow-20200711-194320-6unom-urls.txt 1938 download
urls-archive.max.fan-twitter-@NYPD24Pct-filtered.txt-shallow-20200711-194320-6unom.json 333 download   job
urls-archive.max.fan-twitter-@NYPD25Pct-filtered.txt-shallow-20200711-194315-7cjef-00000.warc.gz 3861531 download   job
urls-archive.max.fan-twitter-@NYPD25Pct-filtered.txt-shallow-20200711-194315-7cjef-00000.warc.os.cdx.gz 9716 download
urls-archive.max.fan-twitter-@NYPD25Pct-filtered.txt-shallow-20200711-194315-7cjef-meta.warc.gz 9369 download   job
urls-archive.max.fan-twitter-@NYPD25Pct-filtered.txt-shallow-20200711-194315-7cjef-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD25Pct-filtered.txt-shallow-20200711-194315-7cjef-urls.txt 456 download
urls-archive.max.fan-twitter-@NYPD25Pct-filtered.txt-shallow-20200711-194315-7cjef.json 333 download   job
urls-archive.max.fan-twitter-@NYPD26Pct-filtered.txt-shallow-20200711-194315-3ziu7-00000.warc.gz 6147224 download   job
urls-archive.max.fan-twitter-@NYPD26Pct-filtered.txt-shallow-20200711-194315-3ziu7-00000.warc.os.cdx.gz 11667 download
urls-archive.max.fan-twitter-@NYPD26Pct-filtered.txt-shallow-20200711-194315-3ziu7-meta.warc.gz 10448 download   job
urls-archive.max.fan-twitter-@NYPD26Pct-filtered.txt-shallow-20200711-194315-3ziu7-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD26Pct-filtered.txt-shallow-20200711-194315-3ziu7-urls.txt 741 download
urls-archive.max.fan-twitter-@NYPD26Pct-filtered.txt-shallow-20200711-194315-3ziu7.json 333 download   job
urls-archive.max.fan-twitter-@NYPD28Pct-filtered.txt-shallow-20200711-194248-ad7h6-00000.warc.gz 6328029 download   job
urls-archive.max.fan-twitter-@NYPD28Pct-filtered.txt-shallow-20200711-194248-ad7h6-00000.warc.os.cdx.gz 11380 download
urls-archive.max.fan-twitter-@NYPD28Pct-filtered.txt-shallow-20200711-194248-ad7h6-meta.warc.gz 10254 download   job
urls-archive.max.fan-twitter-@NYPD28Pct-filtered.txt-shallow-20200711-194248-ad7h6-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD28Pct-filtered.txt-shallow-20200711-194248-ad7h6-urls.txt 1140 download
urls-archive.max.fan-twitter-@NYPD28Pct-filtered.txt-shallow-20200711-194248-ad7h6.json 333 download   job
urls-archive.max.fan-twitter-@NYPD30Pct-filtered.txt-shallow-20200711-194247-1oavc-00000.warc.gz 6997170 download   job
urls-archive.max.fan-twitter-@NYPD30Pct-filtered.txt-shallow-20200711-194247-1oavc-00000.warc.os.cdx.gz 12923 download
urls-archive.max.fan-twitter-@NYPD30Pct-filtered.txt-shallow-20200711-194247-1oavc-meta.warc.gz 11091 download   job
urls-archive.max.fan-twitter-@NYPD30Pct-filtered.txt-shallow-20200711-194247-1oavc-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD30Pct-filtered.txt-shallow-20200711-194247-1oavc-urls.txt 1025 download
urls-archive.max.fan-twitter-@NYPD30Pct-filtered.txt-shallow-20200711-194247-1oavc.json 333 download   job
urls-archive.max.fan-twitter-@NYPD32Pct-filtered.txt-shallow-20200711-194246-5qyhd-00000.warc.gz 2875973 download   job
urls-archive.max.fan-twitter-@NYPD32Pct-filtered.txt-shallow-20200711-194246-5qyhd-00000.warc.os.cdx.gz 9142 download
urls-archive.max.fan-twitter-@NYPD32Pct-filtered.txt-shallow-20200711-194246-5qyhd-meta.warc.gz 9072 download   job
urls-archive.max.fan-twitter-@NYPD32Pct-filtered.txt-shallow-20200711-194246-5qyhd-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD32Pct-filtered.txt-shallow-20200711-194246-5qyhd-urls.txt 570 download
urls-archive.max.fan-twitter-@NYPD32Pct-filtered.txt-shallow-20200711-194246-5qyhd.json 333 download   job
urls-archive.max.fan-twitter-@NYPD34Pct-filtered.txt-shallow-20200711-194246-b9ffe-00000.warc.gz 2094416 download   job
urls-archive.max.fan-twitter-@NYPD34Pct-filtered.txt-shallow-20200711-194246-b9ffe-00000.warc.os.cdx.gz 7877 download
urls-archive.max.fan-twitter-@NYPD34Pct-filtered.txt-shallow-20200711-194246-b9ffe-meta.warc.gz 8352 download   job
urls-archive.max.fan-twitter-@NYPD34Pct-filtered.txt-shallow-20200711-194246-b9ffe-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD34Pct-filtered.txt-shallow-20200711-194246-b9ffe-urls.txt 342 download
urls-archive.max.fan-twitter-@NYPD34Pct-filtered.txt-shallow-20200711-194246-b9ffe.json 333 download   job
urls-archive.max.fan-twitter-@NYPD40Pct-filtered.txt-shallow-20200711-194246-3k8cw-00000.warc.gz 2232255 download   job
urls-archive.max.fan-twitter-@NYPD40Pct-filtered.txt-shallow-20200711-194246-3k8cw-00000.warc.os.cdx.gz 6695 download
urls-archive.max.fan-twitter-@NYPD40Pct-filtered.txt-shallow-20200711-194246-3k8cw-meta.warc.gz 7675 download   job
urls-archive.max.fan-twitter-@NYPD40Pct-filtered.txt-shallow-20200711-194246-3k8cw-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD40Pct-filtered.txt-shallow-20200711-194246-3k8cw-urls.txt 285 download
urls-archive.max.fan-twitter-@NYPD40Pct-filtered.txt-shallow-20200711-194246-3k8cw.json 333 download   job
urls-archive.max.fan-twitter-@NYPD41Pct-filtered.txt-shallow-20200711-194202-bbo9f-00000.warc.gz 7968562 download   job
urls-archive.max.fan-twitter-@NYPD41Pct-filtered.txt-shallow-20200711-194202-bbo9f-00000.warc.os.cdx.gz 15173 download
urls-archive.max.fan-twitter-@NYPD41Pct-filtered.txt-shallow-20200711-194202-bbo9f-meta.warc.gz 12294 download   job
urls-archive.max.fan-twitter-@NYPD41Pct-filtered.txt-shallow-20200711-194202-bbo9f-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD41Pct-filtered.txt-shallow-20200711-194202-bbo9f-urls.txt 1139 download
urls-archive.max.fan-twitter-@NYPD41Pct-filtered.txt-shallow-20200711-194202-bbo9f.json 333 download   job
urls-archive.max.fan-twitter-@NYPD42Pct-filtered.txt-shallow-20200711-194202-9khmz-00000.warc.gz 4912560 download   job
urls-archive.max.fan-twitter-@NYPD42Pct-filtered.txt-shallow-20200711-194202-9khmz-00000.warc.os.cdx.gz 13674 download
urls-archive.max.fan-twitter-@NYPD42Pct-filtered.txt-shallow-20200711-194202-9khmz-meta.warc.gz 11535 download   job
urls-archive.max.fan-twitter-@NYPD42Pct-filtered.txt-shallow-20200711-194202-9khmz-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD42Pct-filtered.txt-shallow-20200711-194202-9khmz-urls.txt 969 download
urls-archive.max.fan-twitter-@NYPD42Pct-filtered.txt-shallow-20200711-194202-9khmz.json 333 download   job
urls-archive.max.fan-twitter-@NYPD43Pct-filtered.txt-shallow-20200711-194159-8tz32-00000.warc.gz 4399612 download   job
urls-archive.max.fan-twitter-@NYPD43Pct-filtered.txt-shallow-20200711-194159-8tz32-00000.warc.os.cdx.gz 12499 download
urls-archive.max.fan-twitter-@NYPD43Pct-filtered.txt-shallow-20200711-194159-8tz32-meta.warc.gz 10875 download   job
urls-archive.max.fan-twitter-@NYPD43Pct-filtered.txt-shallow-20200711-194159-8tz32-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD43Pct-filtered.txt-shallow-20200711-194159-8tz32-urls.txt 968 download
urls-archive.max.fan-twitter-@NYPD43Pct-filtered.txt-shallow-20200711-194159-8tz32.json 333 download   job
urls-archive.max.fan-twitter-@NYPD44Pct-filtered.txt-shallow-20200711-194158-elgsk-00000.warc.gz 5799606 download   job
urls-archive.max.fan-twitter-@NYPD44Pct-filtered.txt-shallow-20200711-194158-elgsk-00000.warc.os.cdx.gz 14776 download
urls-archive.max.fan-twitter-@NYPD44Pct-filtered.txt-shallow-20200711-194158-elgsk-meta.warc.gz 12065 download   job
urls-archive.max.fan-twitter-@NYPD44Pct-filtered.txt-shallow-20200711-194158-elgsk-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD44Pct-filtered.txt-shallow-20200711-194158-elgsk-urls.txt 912 download
urls-archive.max.fan-twitter-@NYPD44Pct-filtered.txt-shallow-20200711-194158-elgsk.json 333 download   job
urls-archive.max.fan-twitter-@NYPD45Pct-filtered.txt-shallow-20200711-194041-4rm5v-00000.warc.gz 3364335 download   job
urls-archive.max.fan-twitter-@NYPD45Pct-filtered.txt-shallow-20200711-194041-4rm5v-00000.warc.os.cdx.gz 12265 download
urls-archive.max.fan-twitter-@NYPD45Pct-filtered.txt-shallow-20200711-194041-4rm5v-meta.warc.gz 10736 download   job
urls-archive.max.fan-twitter-@NYPD45Pct-filtered.txt-shallow-20200711-194041-4rm5v-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD45Pct-filtered.txt-shallow-20200711-194041-4rm5v-urls.txt 741 download
urls-archive.max.fan-twitter-@NYPD45Pct-filtered.txt-shallow-20200711-194041-4rm5v.json 333 download   job
urls-archive.max.fan-twitter-@NYflyfisher-filtered.txt-shallow-20200711-194536-a0fq1-00000.warc.gz 3719345 download   job
urls-archive.max.fan-twitter-@NYflyfisher-filtered.txt-shallow-20200711-194536-a0fq1-00000.warc.os.cdx.gz 6432 download
urls-archive.max.fan-twitter-@NYflyfisher-filtered.txt-shallow-20200711-194536-a0fq1-meta.warc.gz 7527 download   job
urls-archive.max.fan-twitter-@NYflyfisher-filtered.txt-shallow-20200711-194536-a0fq1-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYflyfisher-filtered.txt-shallow-20200711-194536-a0fq1-urls.txt 936 download
urls-archive.max.fan-twitter-@NYflyfisher-filtered.txt-shallow-20200711-194536-a0fq1.json 337 download   job
urls-archive.max.fan-twitter-@NicoleDubre17-filtered.txt-shallow-20200711-203638-a44tk.json 341 download   job
urls-archive.max.fan-twitter-@NimrodAndrew-filtered.txt-shallow-20200711-202101-27kpu-00000.warc.gz 220450317 download   job
urls-archive.max.fan-twitter-@NimrodAndrew-filtered.txt-shallow-20200711-202101-27kpu-00000.warc.os.cdx.gz 204424 download
urls-archive.max.fan-twitter-@NimrodAndrew-filtered.txt-shallow-20200711-202101-27kpu-meta.warc.gz 111482 download   job
urls-archive.max.fan-twitter-@NimrodAndrew-filtered.txt-shallow-20200711-202101-27kpu-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NimrodAndrew-filtered.txt-shallow-20200711-202101-27kpu-urls.txt 126385 download
urls-archive.max.fan-twitter-@NortheastNPS-filtered.txt-shallow-20200711-201637-aq61t-00000.warc.gz 25104796 download   job
urls-archive.max.fan-twitter-@NortheastNPS-filtered.txt-shallow-20200711-201637-aq61t-00000.warc.os.cdx.gz 46952 download
urls-archive.max.fan-twitter-@NortheastNPS-filtered.txt-shallow-20200711-201637-aq61t-meta.warc.gz 29675 download   job
urls-archive.max.fan-twitter-@NortheastNPS-filtered.txt-shallow-20200711-201637-aq61t-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NortheastNPS-filtered.txt-shallow-20200711-201637-aq61t-urls.txt 15576 download
urls-archive.max.fan-twitter-@NotifyLA-filtered.txt-shallow-20200711-201611-dmlnb-00000.warc.gz 18205789 download   job
urls-archive.max.fan-twitter-@NotifyLA-filtered.txt-shallow-20200711-201611-dmlnb-00000.warc.os.cdx.gz 67822 download
urls-archive.max.fan-twitter-@NotifyLA-filtered.txt-shallow-20200711-201611-dmlnb-meta.warc.gz 40716 download   job
urls-archive.max.fan-twitter-@NotifyLA-filtered.txt-shallow-20200711-201611-dmlnb-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NotifyLA-filtered.txt-shallow-20200711-201611-dmlnb-urls.txt 7961 download
urls-archive.max.fan-twitter-@NuLawLab-filtered.txt-shallow-20200711-200652-2yba6-00000.warc.gz 316463348 download   job
urls-archive.max.fan-twitter-@NuLawLab-filtered.txt-shallow-20200711-200652-2yba6-00000.warc.os.cdx.gz 331773 download
urls-archive.max.fan-twitter-@NuLawLab-filtered.txt-shallow-20200711-200652-2yba6-meta.warc.gz 176885 download   job
urls-archive.max.fan-twitter-@NuLawLab-filtered.txt-shallow-20200711-200652-2yba6-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NuLawLab-filtered.txt-shallow-20200711-200652-2yba6-urls.txt 267129 download
urls-archive.max.fan-twitter-@NutmegNews-filtered.txt-shallow-20200711-200651-4ngwn-00000.warc.gz 207862119 download   job
urls-archive.max.fan-twitter-@NutmegNews-filtered.txt-shallow-20200711-200651-4ngwn-00000.warc.os.cdx.gz 276388 download
urls-archive.max.fan-twitter-@NutmegNews-filtered.txt-shallow-20200711-200651-4ngwn-urls.txt 82794 download
urls-archive.max.fan-twitter-@NutmegNews-filtered.txt-shallow-20200711-200651-4ngwn.json 335 download   job
urls-archive.max.fan-twitter-@arwgilbert-filtered.txt-shallow-20200711-192911-a4r2k-00000.warc.gz 27890410 download   job
urls-archive.max.fan-twitter-@arwgilbert-filtered.txt-shallow-20200711-192911-a4r2k-00000.warc.os.cdx.gz 35047 download
urls-archive.max.fan-twitter-@arwgilbert-filtered.txt-shallow-20200711-192911-a4r2k-meta.warc.gz 23382 download   job
urls-archive.max.fan-twitter-@arwgilbert-filtered.txt-shallow-20200711-192911-a4r2k-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@arwgilbert-filtered.txt-shallow-20200711-192911-a4r2k-urls.txt 11187 download
urls-archive.max.fan-twitter-@arwgilbert-filtered.txt-shallow-20200711-192911-a4r2k.json 335 download   job
urls-archive.max.fan-twitter-@bpdpatrol_145-filtered.txt-shallow-20200711-190714-cpmud-00000.warc.gz 181850442 download   job
urls-archive.max.fan-twitter-@bpdpatrol_145-filtered.txt-shallow-20200711-190714-cpmud-00000.warc.os.cdx.gz 201883 download
urls-archive.max.fan-twitter-@bpdpatrol_145-filtered.txt-shallow-20200711-190714-cpmud-meta.warc.gz 112148 download   job
urls-archive.max.fan-twitter-@bpdpatrol_145-filtered.txt-shallow-20200711-190714-cpmud-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@bpdpatrol_145-filtered.txt-shallow-20200711-190714-cpmud-urls.txt 85034 download
urls-archive.max.fan-twitter-@bpdpatrol_145-filtered.txt-shallow-20200711-190714-cpmud.json 341 download   job
urls-archive.max.fan-twitter-@bpdschools-filtered.txt-shallow-20200711-190714-8wmrg-00000.warc.gz 24657490 download   job
urls-archive.max.fan-twitter-@bpdschools-filtered.txt-shallow-20200711-190714-8wmrg-00000.warc.os.cdx.gz 25460 download
urls-archive.max.fan-twitter-@bpdschools-filtered.txt-shallow-20200711-190714-8wmrg-meta.warc.gz 17948 download   job
urls-archive.max.fan-twitter-@bpdschools-filtered.txt-shallow-20200711-190714-8wmrg-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@bpdschools-filtered.txt-shallow-20200711-190714-8wmrg-urls.txt 4132 download
urls-archive.max.fan-twitter-@bpdschools-filtered.txt-shallow-20200711-190714-8wmrg.json 335 download   job
urls-archive.max.fan-twitter-@burlingtonpd-filtered.txt-shallow-20200711-184910-dky6f-00000.warc.gz 661619956 download   job
urls-archive.max.fan-twitter-@burlingtonpd-filtered.txt-shallow-20200711-184910-dky6f-00000.warc.os.cdx.gz 817557 download
urls-archive.max.fan-twitter-@burlingtonpd-filtered.txt-shallow-20200711-184910-dky6f-meta.warc.gz 440457 download   job
urls-archive.max.fan-twitter-@burlingtonpd-filtered.txt-shallow-20200711-184910-dky6f-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@burlingtonpd-filtered.txt-shallow-20200711-184910-dky6f-urls.txt 425731 download
urls-archive.max.fan-twitter-@burlingtonpd-filtered.txt-shallow-20200711-184910-dky6f.json 339 download   job
urls-archive.max.fan-twitter-@chief_mosley-filtered.txt-shallow-20200711-183924-ar6u8-00000.warc.gz 92160440 download   job
urls-archive.max.fan-twitter-@chief_mosley-filtered.txt-shallow-20200711-183924-ar6u8-00000.warc.os.cdx.gz 150151 download
urls-archive.max.fan-twitter-@chief_mosley-filtered.txt-shallow-20200711-183924-ar6u8-meta.warc.gz 84369 download   job
urls-archive.max.fan-twitter-@chief_mosley-filtered.txt-shallow-20200711-183924-ar6u8-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@chief_mosley-filtered.txt-shallow-20200711-183924-ar6u8-urls.txt 32820 download
urls-archive.max.fan-twitter-@chief_mosley-filtered.txt-shallow-20200711-183924-ar6u8.json 339 download   job
urls-archive.max.fan-twitter-@chiefagonzalez-filtered.txt-shallow-20200711-184609-5k0rv-00000.warc.gz 145449499 download   job
urls-archive.max.fan-twitter-@chiefagonzalez-filtered.txt-shallow-20200711-184609-5k0rv-00000.warc.os.cdx.gz 185701 download
urls-archive.max.fan-twitter-@chiefagonzalez-filtered.txt-shallow-20200711-184609-5k0rv-meta.warc.gz 103503 download   job
urls-archive.max.fan-twitter-@chiefagonzalez-filtered.txt-shallow-20200711-184609-5k0rv-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@chiefagonzalez-filtered.txt-shallow-20200711-184609-5k0rv-urls.txt 40858 download
urls-archive.max.fan-twitter-@chiefagonzalez-filtered.txt-shallow-20200711-184609-5k0rv.json 343 download   job
urls-archive.max.fan-twitter-@chiefbradleyupd-filtered.txt-shallow-20200711-184515-9pg1j-00000.warc.gz 22959392 download   job
urls-archive.max.fan-twitter-@chiefbradleyupd-filtered.txt-shallow-20200711-184515-9pg1j-00000.warc.os.cdx.gz 25702 download
urls-archive.max.fan-twitter-@chiefbradleyupd-filtered.txt-shallow-20200711-184515-9pg1j-meta.warc.gz 17975 download   job
urls-archive.max.fan-twitter-@chiefbradleyupd-filtered.txt-shallow-20200711-184515-9pg1j-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@chiefbradleyupd-filtered.txt-shallow-20200711-184515-9pg1j.json 345 download   job
urls-archive.max.fan-twitter-@chiefmacsween-filtered.txt-shallow-20200711-184245-3ncpy-00000.warc.gz 9651601 download   job
urls-archive.max.fan-twitter-@chiefmacsween-filtered.txt-shallow-20200711-184245-3ncpy-00000.warc.os.cdx.gz 26124 download
urls-archive.max.fan-twitter-@chiefmacsween-filtered.txt-shallow-20200711-184245-3ncpy-meta.warc.gz 18238 download   job
urls-archive.max.fan-twitter-@chiefmacsween-filtered.txt-shallow-20200711-184245-3ncpy-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@chiefmacsween-filtered.txt-shallow-20200711-184245-3ncpy-urls.txt 2013 download
urls-archive.max.fan-twitter-@chiefmacsween-filtered.txt-shallow-20200711-184245-3ncpy.json 341 download   job
urls-archive.max.fan-twitter-@chiefpaulcell-filtered.txt-shallow-20200711-183831-5tv6n-00000.warc.gz 2538 download   job
urls-archive.max.fan-twitter-@chiefpaulcell-filtered.txt-shallow-20200711-183831-5tv6n-00000.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@chiefpaulcell-filtered.txt-shallow-20200711-183831-5tv6n-urls.txt 0 download
urls-archive.max.fan-twitter-@chiefpaulcell-filtered.txt-shallow-20200711-183831-5tv6n.json 341 download   job
urls-archive.max.fan-twitter-@dovermapd-filtered.txt-shallow-20200711-183147-3s7xi-meta.warc.gz 110456 download   job
urls-archive.max.fan-twitter-@dovermapd-filtered.txt-shallow-20200711-183147-3s7xi-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@dovermapd-filtered.txt-shallow-20200711-183147-3s7xi.json 333 download   job
urls-archive.max.fan-twitter-@elpdsro-filtered.txt-shallow-20200711-182812-6xew2-meta.warc.gz 3397 download   job
urls-archive.max.fan-twitter-@elpdsro-filtered.txt-shallow-20200711-182812-6xew2-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@elpdsro-filtered.txt-shallow-20200711-182812-6xew2.json 329 download   job
urls-archive.max.fan-twitter-@fema-filtered.txt-shallow-20200711-182633-7gdkv-00000.warc.gz 1400982626 download   job
urls-archive.max.fan-twitter-@fema-filtered.txt-shallow-20200711-182633-7gdkv-00000.warc.os.cdx.gz 3400131 download
urls-archive.max.fan-twitter-@fema-filtered.txt-shallow-20200711-182633-7gdkv-urls.txt 608446 download
urls-archive.max.fan-twitter-@femaregion1-filtered.txt-shallow-20200711-182630-2o5ok-00000.warc.gz 426793904 download   job
urls-archive.max.fan-twitter-@femaregion1-filtered.txt-shallow-20200711-182630-2o5ok-00000.warc.os.cdx.gz 747246 download
urls-archive.max.fan-twitter-@femaregion1-filtered.txt-shallow-20200711-182630-2o5ok-meta.warc.gz 402148 download   job
urls-archive.max.fan-twitter-@femaregion1-filtered.txt-shallow-20200711-182630-2o5ok-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@femaregion1-filtered.txt-shallow-20200711-182630-2o5ok-urls.txt 288030 download
urls-archive.max.fan-twitter-@femaregion1-filtered.txt-shallow-20200711-182630-2o5ok.json 337 download   job
urls-archive.max.fan-twitter-@holyokepolice-filtered.txt-shallow-20200711-181256-9ckp3-00000.warc.gz 44341419 download   job
urls-archive.max.fan-twitter-@holyokepolice-filtered.txt-shallow-20200711-181256-9ckp3-00000.warc.os.cdx.gz 62508 download
urls-archive.max.fan-twitter-@holyokepolice-filtered.txt-shallow-20200711-181256-9ckp3-meta.warc.gz 37826 download   job
urls-archive.max.fan-twitter-@holyokepolice-filtered.txt-shallow-20200711-181256-9ckp3-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@holyokepolice-filtered.txt-shallow-20200711-181256-9ckp3-urls.txt 24108 download
urls-archive.max.fan-twitter-@holyokepolice-filtered.txt-shallow-20200711-181256-9ckp3.json 341 download   job
urls-archive.max.fan-twitter-@htownfockey2014-filtered.txt-shallow-20200711-181233-bjqvf-00000.warc.gz 6498476 download   job
urls-archive.max.fan-twitter-@htownfockey2014-filtered.txt-shallow-20200711-181233-bjqvf-00000.warc.os.cdx.gz 12054 download
urls-archive.max.fan-twitter-@htownfockey2014-filtered.txt-shallow-20200711-181233-bjqvf-meta.warc.gz 10647 download   job
urls-archive.max.fan-twitter-@htownfockey2014-filtered.txt-shallow-20200711-181233-bjqvf-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@htownfockey2014-filtered.txt-shallow-20200711-181233-bjqvf.json 345 download   job
urls-archive.max.fan-twitter-@htowngrlssoccer-filtered.txt-shallow-20200711-181231-bakxm-00000.warc.gz 2169142 download   job
urls-archive.max.fan-twitter-@htowngrlssoccer-filtered.txt-shallow-20200711-181231-bakxm-00000.warc.os.cdx.gz 5331 download
urls-archive.max.fan-twitter-@htowngrlssoccer-filtered.txt-shallow-20200711-181231-bakxm-meta.warc.gz 6838 download   job
urls-archive.max.fan-twitter-@htowngrlssoccer-filtered.txt-shallow-20200711-181231-bakxm-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@htowngrlssoccer-filtered.txt-shallow-20200711-181231-bakxm-urls.txt 744 download
urls-archive.max.fan-twitter-@kpd_policechief-filtered.txt-shallow-20200711-181137-shsvc-00000.warc.gz 5972144 download   job
urls-archive.max.fan-twitter-@kpd_policechief-filtered.txt-shallow-20200711-181137-shsvc-00000.warc.os.cdx.gz 10493 download
urls-archive.max.fan-twitter-@kpd_policechief-filtered.txt-shallow-20200711-181137-shsvc.json 345 download   job
urls-archive.max.fan-twitter-@lawrencepolice-filtered.txt-shallow-20200711-181019-44j9d-00000.warc.gz 1335717 download   job
urls-archive.max.fan-twitter-@lawrencepolice-filtered.txt-shallow-20200711-181019-44j9d-00000.warc.os.cdx.gz 4197 download
urls-archive.max.fan-twitter-@lawrencepolice-filtered.txt-shallow-20200711-181019-44j9d-meta.warc.gz 6201 download   job
urls-archive.max.fan-twitter-@lawrencepolice-filtered.txt-shallow-20200711-181019-44j9d-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@lawrencepolice-filtered.txt-shallow-20200711-181019-44j9d.json 343 download   job
urls-archive.max.fan-twitter-@nikolajcw-filtered.txt-shallow-20200711-202114-cw55g-meta.warc.gz 369764 download   job
urls-archive.max.fan-twitter-@nikolajcw-filtered.txt-shallow-20200711-202114-cw55g-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@nikolajcw-filtered.txt-shallow-20200711-202114-cw55g-urls.txt 55140 download
urls-archive.max.fan-twitter-@nnasserx-filtered.txt-shallow-20200711-201642-daasb-00000.warc.gz 10299772 download   job
urls-archive.max.fan-twitter-@nnasserx-filtered.txt-shallow-20200711-201642-daasb-00000.warc.os.cdx.gz 10931 download
urls-archive.max.fan-twitter-@nnasserx-filtered.txt-shallow-20200711-201642-daasb-meta.warc.gz 9972 download   job
urls-archive.max.fan-twitter-@nnasserx-filtered.txt-shallow-20200711-201642-daasb-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@nnasserx-filtered.txt-shallow-20200711-201642-daasb-urls.txt 3960 download
urls-archive.max.fan-twitter-@nnasserx-filtered.txt-shallow-20200711-201642-daasb.json 331 download   job
urls-archive.max.fan-twitter-@noreensnasir-filtered.txt-shallow-20200711-201639-ax0ev-00000.warc.gz 326147503 download   job
urls-archive.max.fan-twitter-@noreensnasir-filtered.txt-shallow-20200711-201639-ax0ev-00000.warc.os.cdx.gz 371362 download
urls-archive.max.fan-twitter-@noreensnasir-filtered.txt-shallow-20200711-201639-ax0ev-meta.warc.gz 199644 download   job
urls-archive.max.fan-twitter-@noreensnasir-filtered.txt-shallow-20200711-201639-ax0ev-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@npmadigan-filtered.txt-shallow-20200711-201428-2he4r-meta.warc.gz 17040 download   job
urls-archive.max.fan-twitter-@npmadigan-filtered.txt-shallow-20200711-201428-2he4r-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@npmadigan-filtered.txt-shallow-20200711-201428-2he4r.json 333 download   job
urls-archive.max.fan-twitter-@nymongolia-filtered.txt-shallow-20200711-194535-2ogaj-00000.warc.gz 69680264 download   job
urls-archive.max.fan-twitter-@nymongolia-filtered.txt-shallow-20200711-194535-2ogaj-00000.warc.os.cdx.gz 76000 download
urls-archive.max.fan-twitter-@nymongolia-filtered.txt-shallow-20200711-194535-2ogaj-meta.warc.gz 44987 download   job
urls-archive.max.fan-twitter-@nymongolia-filtered.txt-shallow-20200711-194535-2ogaj-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@nymongolia-filtered.txt-shallow-20200711-194535-2ogaj-urls.txt 26375 download
urls-archive.max.fan-twitter-@nymongolia-filtered.txt-shallow-20200711-194535-2ogaj.json 335 download   job
urls-archive.max.fan-twitter-@nytimes-filtered.txt-shallow-20200710-213818-4f3nw-00007.warc.gz 5368713580 download   job
urls-archive.max.fan-twitter-@nytimes-filtered.txt-shallow-20200710-213818-4f3nw-00007.warc.os.cdx.gz 14507066 download
urls-transfer.notkiska.pw-suntuubi.com-subdomains-inf-20200105-191743-9m75g-00200.warc.gz 5368862546 download   job
urls-transfer.notkiska.pw-suntuubi.com-subdomains-inf-20200105-191743-9m75g-00200.warc.os.cdx.gz 1887719 download
urls-transfer.notkiska.pw-twitter-%23notmypresident-shallow-20200530-220957-2c0z0-00118.warc.gz 5378423588 download   job
urls-transfer.notkiska.pw-twitter-%23notmypresident-shallow-20200530-220957-2c0z0-00118.warc.os.cdx.gz 1822640 download
watergoesred.wordpress.com-inf-20200711-173553-bpz6t-00000.warc.gz 1154479710 download   job
watergoesred.wordpress.com-inf-20200711-173553-bpz6t-00000.warc.os.cdx.gz 463823 download
watergoesred.wordpress.com-inf-20200711-173553-bpz6t-meta.warc.gz 312193 download   job
watergoesred.wordpress.com-inf-20200711-173553-bpz6t-meta.warc.os.cdx.gz 47 download
watergoesred.wordpress.com-inf-20200711-173553-bpz6t.json 251 download   job
wilhelmsgames.wordpress.com-inf-20200711-173555-k3zba-00000.warc.gz 1433895468 download   job
wilhelmsgames.wordpress.com-inf-20200711-173555-k3zba-00000.warc.os.cdx.gz 576575 download
wilhelmsgames.wordpress.com-inf-20200711-173555-k3zba-meta.warc.gz 398405 download   job
wilhelmsgames.wordpress.com-inf-20200711-173555-k3zba-meta.warc.os.cdx.gz 47 download
wilhelmsgames.wordpress.com-inf-20200711-173555-k3zba.json 252 download   job
wilper.wordpress.com-inf-20200711-173603-8w7oy-meta.warc.gz 621819 download   job
wilper.wordpress.com-inf-20200711-173603-8w7oy-meta.warc.os.cdx.gz 47 download
wilper.wordpress.com-inf-20200711-173603-8w7oy.json 245 download   job
winstonp.wordpress.com-inf-20200711-173615-6fre4-00000.warc.gz 5434122425 download   job
winstonp.wordpress.com-inf-20200711-173615-6fre4-00000.warc.os.cdx.gz 518561 download
winstonp.wordpress.com-inf-20200711-173615-6fre4-00001.warc.gz 5442139415 download   job
winstonp.wordpress.com-inf-20200711-173615-6fre4-00001.warc.os.cdx.gz 38300 download
winstonp.wordpress.com-inf-20200711-173615-6fre4-00002.warc.gz 5369617507 download   job
winstonp.wordpress.com-inf-20200711-173615-6fre4-00002.warc.os.cdx.gz 454177 download
winstonp.wordpress.com-inf-20200711-173615-6fre4-00003.warc.gz 5394751379 download   job
winstonp.wordpress.com-inf-20200711-173615-6fre4-00003.warc.os.cdx.gz 423248 download
winstonp.wordpress.com-inf-20200711-173615-6fre4-meta.warc.gz 1830190 download   job
winstonp.wordpress.com-inf-20200711-173615-6fre4-meta.warc.os.cdx.gz 47 download
winstonp.wordpress.com-inf-20200711-173615-6fre4.json 247 download   job
www.12377.cn-inf-20200711-122213-b397n-00000.warc.gz 5368719230 download   job
www.12377.cn-inf-20200711-122213-b397n-00000.warc.os.cdx.gz 2756727 download
www.mathway.com-inf-20200610-011458-6sruz-00021.warc.gz 5368732338 download   job
www.mathway.com-inf-20200610-011458-6sruz-00021.warc.os.cdx.gz 20916531 download
www.mudcrutch.com-inf-20200710-231811-ablr0-00001.warc.gz 5369089284 download   job
www.mudcrutch.com-inf-20200710-231811-ablr0-00001.warc.os.cdx.gz 3750494 download
www.qiagen.com-inf-20200621-061202-1wax4-00023.warc.gz 5368851996 download   job
www.qiagen.com-inf-20200621-061202-1wax4-00023.warc.os.cdx.gz 3407486 download
www.raspberrypi.org-inf-20200707-192424-bv6p7-00027.warc.gz 5369649880 download   job
www.raspberrypi.org-inf-20200707-192424-bv6p7-00027.warc.os.cdx.gz 2741934 download