Item archiveteam_archivebot_go_20200711050003

View on Internet Archive

Filename Size
archiveteam_archivebot_go_20200711050003.cdx.gz 89208151 download
archiveteam_archivebot_go_20200711050003.cdx.idx 79701 download
archiveteam_archivebot_go_20200711050003_files.xml 0 download
archiveteam_archivebot_go_20200711050003_meta.sqlite 677888 download
archiveteam_archivebot_go_20200711050003_meta.xml 969 download
arcteryxkorea.tistory.com-inf-20200711-014011-advvs-00000.warc.gz 5368824943 download   job
arcteryxkorea.tistory.com-inf-20200711-014011-advvs-00000.warc.os.cdx.gz 1525379 download
bestgreen.tistory.com-inf-20200711-014136-3cop8-meta.warc.gz 193734 download   job
bestgreen.tistory.com-inf-20200711-014136-3cop8-meta.warc.os.cdx.gz 47 download
bestgreen.tistory.com-inf-20200711-014136-3cop8.json 246 download   job
bestgreen.tistory.com-inf-20200711-014137-culn9-00000.warc.gz 167531404 download   job
bestgreen.tistory.com-inf-20200711-014137-culn9-00000.warc.os.cdx.gz 239613 download
bestgreen.tistory.com-inf-20200711-014137-culn9.json 255 download   job
chang1.tistory.com-inf-20200711-021325-3gu6e-00000.warc.gz 13889162 download   job
chang1.tistory.com-inf-20200711-021325-3gu6e-00000.warc.os.cdx.gz 16067 download
chang1.tistory.com-inf-20200711-021325-3gu6e-meta.warc.gz 19272 download   job
chang1.tistory.com-inf-20200711-021325-3gu6e-meta.warc.os.cdx.gz 47 download
chang1.tistory.com-inf-20200711-021325-3gu6e.json 252 download   job
detailog.tistory.com-inf-20200711-021222-9g4mp-00000.warc.gz 1049748697 download   job
detailog.tistory.com-inf-20200711-021222-9g4mp-00000.warc.os.cdx.gz 1127924 download
detailog.tistory.com-inf-20200711-021222-9g4mp-meta.warc.gz 723576 download   job
detailog.tistory.com-inf-20200711-021222-9g4mp-meta.warc.os.cdx.gz 47 download
detailog.tistory.com-inf-20200711-021222-9g4mp.json 245 download   job
detailog.tistory.com-inf-20200711-021238-f40xd-00000.warc.gz 25445874 download   job
detailog.tistory.com-inf-20200711-021238-f40xd-00000.warc.os.cdx.gz 69379 download
detailog.tistory.com-inf-20200711-021238-f40xd.json 254 download   job
devmae.tistory.com-inf-20200711-000928-1uzcj-00000.warc.gz 3444200268 download   job
devmae.tistory.com-inf-20200711-000928-1uzcj-00000.warc.os.cdx.gz 1686228 download
devmae.tistory.com-inf-20200711-000928-1uzcj-meta.warc.gz 1103404 download   job
devmae.tistory.com-inf-20200711-000928-1uzcj-meta.warc.os.cdx.gz 47 download
devmae.tistory.com-inf-20200711-000928-1uzcj.json 243 download   job
eggnara.tistory.com-inf-20200711-014314-69mu4-meta.warc.gz 86694 download   job
eggnara.tistory.com-inf-20200711-014314-69mu4-meta.warc.os.cdx.gz 47 download
eggnara.tistory.com-inf-20200711-014314-69mu4.json 253 download   job
ektoplazm.com-inf-20200704-233408-66i1h-00022.warc.gz 5575776763 download   job
ektoplazm.com-inf-20200704-233408-66i1h-00022.warc.os.cdx.gz 7131 download
forum.cdaction.pl-inf-20200428-110001-eq14m-00119.warc.gz 5368722578 download   job
forum.cdaction.pl-inf-20200428-110001-eq14m-00119.warc.os.cdx.gz 9726717 download
forums.nextgames.com-inf-20200709-160247-15pvo-00005.warc.gz 5368733635 download   job
forums.nextgames.com-inf-20200709-160247-15pvo-00005.warc.os.cdx.gz 2354902 download
history/files/urls-transfer.notkiska.pw-twitter-%23WorldRefugeeDay-shallow-20200605-213315-5wxzx-00067.warc.gz.~1~ 5424563976 download
history/files/www.12371.cn-inf-20200709-194054-1lotk-00009.warc.gz.~1~ 5397122490 download
ibabo.tistory.com-inf-20200711-000937-ct2mk-00000.warc.gz 535115311 download   job
ibabo.tistory.com-inf-20200711-000937-ct2mk-00000.warc.os.cdx.gz 1080037 download
ibabo.tistory.com-inf-20200711-000937-ct2mk-meta.warc.gz 707358 download   job
ibabo.tistory.com-inf-20200711-000937-ct2mk-meta.warc.os.cdx.gz 47 download
ibabo.tistory.com-inf-20200711-000937-ct2mk.json 242 download   job
insp.tistory.com-inf-20200711-014159-3hq7h-00000.warc.gz 1013738705 download   job
insp.tistory.com-inf-20200711-014159-3hq7h-00000.warc.os.cdx.gz 693738 download
insp.tistory.com-inf-20200711-014159-3hq7h-meta.warc.gz 435623 download   job
insp.tistory.com-inf-20200711-014159-3hq7h-meta.warc.os.cdx.gz 47 download
insp.tistory.com-inf-20200711-014159-3hq7h.json 241 download   job
insp.tistory.com-inf-20200711-014234-lyad1-00000.warc.gz 75464241 download   job
insp.tistory.com-inf-20200711-014234-lyad1-00000.warc.os.cdx.gz 335402 download
insp.tistory.com-inf-20200711-014234-lyad1-meta.warc.gz 251136 download   job
insp.tistory.com-inf-20200711-014234-lyad1-meta.warc.os.cdx.gz 47 download
insp.tistory.com-inf-20200711-014234-lyad1.json 250 download   job
luc.devroye.org-inf-20200629-195003-6kmq5-00047.warc.gz 5372603008 download   job
luc.devroye.org-inf-20200629-195003-6kmq5-00047.warc.os.cdx.gz 4177712 download
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00088.warc.gz 5661833759 download   job
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00088.warc.os.cdx.gz 5181 download
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00089.warc.gz 5788607280 download   job
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00089.warc.os.cdx.gz 2439 download
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00090.warc.gz 6024332912 download   job
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00090.warc.os.cdx.gz 3129 download
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00091.warc.gz 5552453651 download   job
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00091.warc.os.cdx.gz 2603 download
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00092.warc.gz 5800923218 download   job
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00092.warc.os.cdx.gz 46758 download
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00093.warc.gz 5555135557 download   job
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00093.warc.os.cdx.gz 20434 download
new.12377.cn-inf-20200710-201841-4uz15-00001.warc.gz 5443099915 download   job
new.12377.cn-inf-20200710-201841-4uz15-00001.warc.os.cdx.gz 1568329 download
player.fm-inf-20200501-233943-6recr-00679.warc.gz 5475921143 download   job
player.fm-inf-20200501-233943-6recr-00679.warc.os.cdx.gz 972891 download
rideup.tistory.com-inf-20200711-001006-5beu9-00000.warc.gz 1445609267 download   job
rideup.tistory.com-inf-20200711-001006-5beu9-00000.warc.os.cdx.gz 1024641 download
rideup.tistory.com-inf-20200711-001006-5beu9-meta.warc.gz 696869 download   job
rideup.tistory.com-inf-20200711-001006-5beu9-meta.warc.os.cdx.gz 47 download
serimbook.tistory.com-inf-20200711-005418-7a8ga-00000.warc.gz 357488274 download   job
serimbook.tistory.com-inf-20200711-005418-7a8ga-00000.warc.os.cdx.gz 625721 download
serimbook.tistory.com-inf-20200711-005418-7a8ga-meta.warc.gz 420917 download   job
serimbook.tistory.com-inf-20200711-005418-7a8ga-meta.warc.os.cdx.gz 47 download
serimbook.tistory.com-inf-20200711-005418-7a8ga.json 246 download   job
sh1r.tistory.com-inf-20200711-005427-4muz8-00000.warc.gz 993342965 download   job
sh1r.tistory.com-inf-20200711-005427-4muz8-00000.warc.os.cdx.gz 1554091 download
sh1r.tistory.com-inf-20200711-005427-4muz8-meta.warc.gz 1012855 download   job
sh1r.tistory.com-inf-20200711-005427-4muz8-meta.warc.os.cdx.gz 47 download
sh1r.tistory.com-inf-20200711-005427-4muz8.json 241 download   job
urls-archive.max.fan-jobs.txt-shallow-20200711-045009-78icp-00000.warc.gz 97380942 download   job
urls-archive.max.fan-jobs.txt-shallow-20200711-045009-78icp-00000.warc.os.cdx.gz 93691 download
urls-archive.max.fan-jobs.txt-shallow-20200711-045009-78icp-meta.warc.gz 50799 download   job
urls-archive.max.fan-jobs.txt-shallow-20200711-045009-78icp-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-jobs.txt-shallow-20200711-045009-78icp-urls.txt 101615 download
urls-archive.max.fan-jobs.txt-shallow-20200711-045009-78icp.json 282 download   job
urls-archive.max.fan-police.txt-shallow-20200711-045026-ahfjq-00000.warc.gz 7269450 download   job
urls-archive.max.fan-police.txt-shallow-20200711-045026-ahfjq-00000.warc.os.cdx.gz 13646 download
urls-archive.max.fan-police.txt-shallow-20200711-045026-ahfjq-meta.warc.gz 10567 download   job
urls-archive.max.fan-police.txt-shallow-20200711-045026-ahfjq-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-police.txt-shallow-20200711-045026-ahfjq-urls.txt 15685 download
urls-archive.max.fan-police.txt-shallow-20200711-045026-ahfjq.json 290 download   job
urls-archive.max.fan-twitter-@MITPolice-filtered.txt-shallow-20200711-034927-1c9b5-00000.warc.gz 152610057 download   job
urls-archive.max.fan-twitter-@MITPolice-filtered.txt-shallow-20200711-034927-1c9b5-00000.warc.os.cdx.gz 249693 download
urls-archive.max.fan-twitter-@MITPolice-filtered.txt-shallow-20200711-034927-1c9b5-meta.warc.gz 137618 download   job
urls-archive.max.fan-twitter-@MITPolice-filtered.txt-shallow-20200711-034927-1c9b5-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MITPolice-filtered.txt-shallow-20200711-034927-1c9b5-urls.txt 57594 download
urls-archive.max.fan-twitter-@MITPolice-filtered.txt-shallow-20200711-034927-1c9b5.json 333 download   job
urls-archive.max.fan-twitter-@MaynardPolice-filtered.txt-shallow-20200711-045142-b07d4-00000.warc.gz 38454596 download   job
urls-archive.max.fan-twitter-@MaynardPolice-filtered.txt-shallow-20200711-045142-b07d4-00000.warc.os.cdx.gz 49262 download
urls-archive.max.fan-twitter-@MaynardPolice-filtered.txt-shallow-20200711-045142-b07d4-meta.warc.gz 31020 download   job
urls-archive.max.fan-twitter-@MaynardPolice-filtered.txt-shallow-20200711-045142-b07d4-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MaynardPolice-filtered.txt-shallow-20200711-045142-b07d4-urls.txt 17489 download
urls-archive.max.fan-twitter-@MaynardPolice-filtered.txt-shallow-20200711-045142-b07d4.json 341 download   job
urls-archive.max.fan-twitter-@Maynard_MAFire-filtered.txt-shallow-20200711-045143-8dqx7-00000.warc.gz 36104894 download   job
urls-archive.max.fan-twitter-@Maynard_MAFire-filtered.txt-shallow-20200711-045143-8dqx7-00000.warc.os.cdx.gz 43752 download
urls-archive.max.fan-twitter-@Maynard_MAFire-filtered.txt-shallow-20200711-045143-8dqx7-meta.warc.gz 28050 download   job
urls-archive.max.fan-twitter-@Maynard_MAFire-filtered.txt-shallow-20200711-045143-8dqx7-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Maynard_MAFire-filtered.txt-shallow-20200711-045143-8dqx7-urls.txt 15597 download
urls-archive.max.fan-twitter-@Maynard_MAFire-filtered.txt-shallow-20200711-045143-8dqx7.json 343 download   job
urls-archive.max.fan-twitter-@MedfieldPolice-filtered.txt-shallow-20200711-044904-8o2m6-00000.warc.gz 85980219 download   job
urls-archive.max.fan-twitter-@MedfieldPolice-filtered.txt-shallow-20200711-044904-8o2m6-00000.warc.os.cdx.gz 99957 download
urls-archive.max.fan-twitter-@MedfieldPolice-filtered.txt-shallow-20200711-044904-8o2m6-meta.warc.gz 58250 download   job
urls-archive.max.fan-twitter-@MedfieldPolice-filtered.txt-shallow-20200711-044904-8o2m6-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MedfieldPolice-filtered.txt-shallow-20200711-044904-8o2m6-urls.txt 33533 download
urls-archive.max.fan-twitter-@MedfieldPolice-filtered.txt-shallow-20200711-044904-8o2m6.json 343 download   job
urls-archive.max.fan-twitter-@MerrCtySheriff-filtered.txt-shallow-20200711-044206-cc8re-00000.warc.gz 3339379 download   job
urls-archive.max.fan-twitter-@MerrCtySheriff-filtered.txt-shallow-20200711-044206-cc8re-00000.warc.os.cdx.gz 6193 download
urls-archive.max.fan-twitter-@MerrCtySheriff-filtered.txt-shallow-20200711-044206-cc8re-meta.warc.gz 7435 download   job
urls-archive.max.fan-twitter-@MerrCtySheriff-filtered.txt-shallow-20200711-044206-cc8re-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MerrCtySheriff-filtered.txt-shallow-20200711-044206-cc8re-urls.txt 2257 download
urls-archive.max.fan-twitter-@MerrCtySheriff-filtered.txt-shallow-20200711-044206-cc8re.json 343 download   job
urls-archive.max.fan-twitter-@Merrimack_Rep-filtered.txt-shallow-20200711-044206-f3qi3-00000.warc.gz 219222692 download   job
urls-archive.max.fan-twitter-@Merrimack_Rep-filtered.txt-shallow-20200711-044206-f3qi3-00000.warc.os.cdx.gz 216845 download
urls-archive.max.fan-twitter-@Merrimack_Rep-filtered.txt-shallow-20200711-044206-f3qi3-meta.warc.gz 119176 download   job
urls-archive.max.fan-twitter-@Merrimack_Rep-filtered.txt-shallow-20200711-044206-f3qi3-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Merrimack_Rep-filtered.txt-shallow-20200711-044206-f3qi3-urls.txt 129794 download
urls-archive.max.fan-twitter-@Merrimack_Rep-filtered.txt-shallow-20200711-044206-f3qi3.json 341 download   job
urls-archive.max.fan-twitter-@Mheadpolice-filtered.txt-shallow-20200711-044131-7rove-00000.warc.gz 127777108 download   job
urls-archive.max.fan-twitter-@Mheadpolice-filtered.txt-shallow-20200711-044131-7rove-00000.warc.os.cdx.gz 166153 download
urls-archive.max.fan-twitter-@Mheadpolice-filtered.txt-shallow-20200711-044131-7rove-meta.warc.gz 92441 download   job
urls-archive.max.fan-twitter-@Mheadpolice-filtered.txt-shallow-20200711-044131-7rove-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Mheadpolice-filtered.txt-shallow-20200711-044131-7rove-urls.txt 119342 download
urls-archive.max.fan-twitter-@Mheadpolice-filtered.txt-shallow-20200711-044131-7rove.json 337 download   job
urls-archive.max.fan-twitter-@MiddleboroughPD-filtered.txt-shallow-20200711-035452-54nfr-00000.warc.gz 122281811 download   job
urls-archive.max.fan-twitter-@MiddleboroughPD-filtered.txt-shallow-20200711-035452-54nfr-00000.warc.os.cdx.gz 165128 download
urls-archive.max.fan-twitter-@MiddleboroughPD-filtered.txt-shallow-20200711-035452-54nfr-meta.warc.gz 91653 download   job
urls-archive.max.fan-twitter-@MiddleboroughPD-filtered.txt-shallow-20200711-035452-54nfr-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MiddleboroughPD-filtered.txt-shallow-20200711-035452-54nfr-urls.txt 53797 download
urls-archive.max.fan-twitter-@MiddleboroughPD-filtered.txt-shallow-20200711-035452-54nfr.json 345 download   job
urls-archive.max.fan-twitter-@MiddletonMaPD-filtered.txt-shallow-20200711-035452-9g6uf-00000.warc.gz 43899363 download   job
urls-archive.max.fan-twitter-@MiddletonMaPD-filtered.txt-shallow-20200711-035452-9g6uf-00000.warc.os.cdx.gz 57192 download
urls-archive.max.fan-twitter-@MiddletonMaPD-filtered.txt-shallow-20200711-035452-9g6uf-meta.warc.gz 35372 download   job
urls-archive.max.fan-twitter-@MiddletonMaPD-filtered.txt-shallow-20200711-035452-9g6uf-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MiddletonMaPD-filtered.txt-shallow-20200711-035452-9g6uf-urls.txt 30017 download
urls-archive.max.fan-twitter-@MiddletonMaPD-filtered.txt-shallow-20200711-035452-9g6uf.json 341 download   job
urls-archive.max.fan-twitter-@MillisPolice-filtered.txt-shallow-20200711-035116-6skkz-00000.warc.gz 38789849 download   job
urls-archive.max.fan-twitter-@MillisPolice-filtered.txt-shallow-20200711-035116-6skkz-00000.warc.os.cdx.gz 47745 download
urls-archive.max.fan-twitter-@MillisPolice-filtered.txt-shallow-20200711-035116-6skkz-meta.warc.gz 30048 download   job
urls-archive.max.fan-twitter-@MillisPolice-filtered.txt-shallow-20200711-035116-6skkz-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MillisPolice-filtered.txt-shallow-20200711-035116-6skkz-urls.txt 16582 download
urls-archive.max.fan-twitter-@MillisPolice-filtered.txt-shallow-20200711-035116-6skkz.json 339 download   job
urls-archive.max.fan-twitter-@MiltonPolice-filtered.txt-shallow-20200711-035115-er7vr-00000.warc.gz 73431179 download   job
urls-archive.max.fan-twitter-@MiltonPolice-filtered.txt-shallow-20200711-035115-er7vr-00000.warc.os.cdx.gz 117808 download
urls-archive.max.fan-twitter-@MiltonPolice-filtered.txt-shallow-20200711-035115-er7vr-meta.warc.gz 67800 download   job
urls-archive.max.fan-twitter-@MiltonPolice-filtered.txt-shallow-20200711-035115-er7vr-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MiltonPolice-filtered.txt-shallow-20200711-035115-er7vr-urls.txt 67785 download
urls-archive.max.fan-twitter-@MiltonPolice-filtered.txt-shallow-20200711-035115-er7vr.json 339 download   job
urls-archive.max.fan-twitter-@NH_StatePolice-filtered.txt-shallow-20200711-032714-dedzx-00000.warc.gz 792202212 download   job
urls-archive.max.fan-twitter-@NH_StatePolice-filtered.txt-shallow-20200711-032714-dedzx-00000.warc.os.cdx.gz 678433 download
urls-archive.max.fan-twitter-@NH_StatePolice-filtered.txt-shallow-20200711-032714-dedzx-meta.warc.gz 362428 download   job
urls-archive.max.fan-twitter-@NH_StatePolice-filtered.txt-shallow-20200711-032714-dedzx-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NH_StatePolice-filtered.txt-shallow-20200711-032714-dedzx-urls.txt 241845 download
urls-archive.max.fan-twitter-@NH_StatePolice-filtered.txt-shallow-20200711-032714-dedzx.json 343 download   job
urls-archive.max.fan-twitter-@NMStatePolice-filtered.txt-shallow-20200711-032712-5askd-00000.warc.gz 518385603 download   job
urls-archive.max.fan-twitter-@NMStatePolice-filtered.txt-shallow-20200711-032712-5askd-00000.warc.os.cdx.gz 558987 download
urls-archive.max.fan-twitter-@NMStatePolice-filtered.txt-shallow-20200711-032712-5askd-meta.warc.gz 300015 download   job
urls-archive.max.fan-twitter-@NMStatePolice-filtered.txt-shallow-20200711-032712-5askd-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NMStatePolice-filtered.txt-shallow-20200711-032712-5askd-urls.txt 203831 download
urls-archive.max.fan-twitter-@NMStatePolice-filtered.txt-shallow-20200711-032712-5askd.json 341 download   job
urls-archive.max.fan-twitter-@NYPDnews-filtered.txt-shallow-20200710-230342-d2yyj-00000.warc.gz 4054551802 download   job
urls-archive.max.fan-twitter-@NYPDnews-filtered.txt-shallow-20200710-230342-d2yyj-00000.warc.os.cdx.gz 7545407 download
urls-archive.max.fan-twitter-@NYPDnews-filtered.txt-shallow-20200710-230342-d2yyj-meta.warc.gz 4015957 download   job
urls-archive.max.fan-twitter-@NYPDnews-filtered.txt-shallow-20200710-230342-d2yyj-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDnews-filtered.txt-shallow-20200710-230342-d2yyj-urls.txt 1265522 download
urls-archive.max.fan-twitter-@NYPDnews-filtered.txt-shallow-20200710-230342-d2yyj.json 331 download   job
urls-archive.max.fan-twitter-@NahantPolice-filtered.txt-shallow-20200711-034858-8ow7r-00000.warc.gz 12185913 download   job
urls-archive.max.fan-twitter-@NahantPolice-filtered.txt-shallow-20200711-034858-8ow7r-00000.warc.os.cdx.gz 19967 download
urls-archive.max.fan-twitter-@NahantPolice-filtered.txt-shallow-20200711-034858-8ow7r-meta.warc.gz 15233 download   job
urls-archive.max.fan-twitter-@NahantPolice-filtered.txt-shallow-20200711-034858-8ow7r-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NahantPolice-filtered.txt-shallow-20200711-034858-8ow7r-urls.txt 9792 download
urls-archive.max.fan-twitter-@NahantPolice-filtered.txt-shallow-20200711-034858-8ow7r.json 339 download   job
urls-archive.max.fan-twitter-@NantucketPolice-filtered.txt-shallow-20200711-033849-sm6pj-00000.warc.gz 230122094 download   job
urls-archive.max.fan-twitter-@NantucketPolice-filtered.txt-shallow-20200711-033849-sm6pj-00000.warc.os.cdx.gz 264919 download
urls-archive.max.fan-twitter-@NantucketPolice-filtered.txt-shallow-20200711-033849-sm6pj-meta.warc.gz 145873 download   job
urls-archive.max.fan-twitter-@NantucketPolice-filtered.txt-shallow-20200711-033849-sm6pj-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NantucketPolice-filtered.txt-shallow-20200711-033849-sm6pj-urls.txt 100726 download
urls-archive.max.fan-twitter-@NantucketPolice-filtered.txt-shallow-20200711-033849-sm6pj.json 345 download   job
urls-archive.max.fan-twitter-@NashuaPolice-filtered.txt-shallow-20200711-033644-aashy-00000.warc.gz 353587158 download   job
urls-archive.max.fan-twitter-@NashuaPolice-filtered.txt-shallow-20200711-033644-aashy-00000.warc.os.cdx.gz 380432 download
urls-archive.max.fan-twitter-@NashuaPolice-filtered.txt-shallow-20200711-033644-aashy-meta.warc.gz 206471 download   job
urls-archive.max.fan-twitter-@NashuaPolice-filtered.txt-shallow-20200711-033644-aashy-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NashuaPolice-filtered.txt-shallow-20200711-033644-aashy-urls.txt 114705 download
urls-archive.max.fan-twitter-@NashuaPolice-filtered.txt-shallow-20200711-033644-aashy.json 339 download   job
urls-archive.max.fan-twitter-@NeedhamPolice-filtered.txt-shallow-20200711-033228-1pjp3-00000.warc.gz 49256485 download   job
urls-archive.max.fan-twitter-@NeedhamPolice-filtered.txt-shallow-20200711-033228-1pjp3-00000.warc.os.cdx.gz 72778 download
urls-archive.max.fan-twitter-@NeedhamPolice-filtered.txt-shallow-20200711-033228-1pjp3-meta.warc.gz 43341 download   job
urls-archive.max.fan-twitter-@NeedhamPolice-filtered.txt-shallow-20200711-033228-1pjp3-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NeedhamPolice-filtered.txt-shallow-20200711-033228-1pjp3-urls.txt 38249 download
urls-archive.max.fan-twitter-@NeedhamPolice-filtered.txt-shallow-20200711-033228-1pjp3.json 341 download   job
urls-archive.max.fan-twitter-@NewEnglandPD-filtered.txt-shallow-20200711-032805-by1g9-00000.warc.gz 2204397 download   job
urls-archive.max.fan-twitter-@NewEnglandPD-filtered.txt-shallow-20200711-032805-by1g9-00000.warc.os.cdx.gz 5413 download
urls-archive.max.fan-twitter-@NewEnglandPD-filtered.txt-shallow-20200711-032805-by1g9-meta.warc.gz 6906 download   job
urls-archive.max.fan-twitter-@NewEnglandPD-filtered.txt-shallow-20200711-032805-by1g9-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NewEnglandPD-filtered.txt-shallow-20200711-032805-by1g9-urls.txt 1003 download
urls-archive.max.fan-twitter-@NewEnglandPD-filtered.txt-shallow-20200711-032805-by1g9.json 339 download   job
urls-archive.max.fan-twitter-@NewtonFireDept-filtered.txt-shallow-20200711-032802-68uvn-00000.warc.gz 300630151 download   job
urls-archive.max.fan-twitter-@NewtonFireDept-filtered.txt-shallow-20200711-032802-68uvn-00000.warc.os.cdx.gz 334434 download
urls-archive.max.fan-twitter-@NewtonFireDept-filtered.txt-shallow-20200711-032802-68uvn-meta.warc.gz 181073 download   job
urls-archive.max.fan-twitter-@NewtonFireDept-filtered.txt-shallow-20200711-032802-68uvn-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NewtonFireDept-filtered.txt-shallow-20200711-032802-68uvn-urls.txt 125095 download
urls-archive.max.fan-twitter-@NewtonFireDept-filtered.txt-shallow-20200711-032802-68uvn.json 343 download   job
urls-archive.max.fan-twitter-@NorfolkMAPolice-filtered.txt-shallow-20200711-032613-15umm-00000.warc.gz 18187710 download   job
urls-archive.max.fan-twitter-@NorfolkMAPolice-filtered.txt-shallow-20200711-032613-15umm-00000.warc.os.cdx.gz 31370 download
urls-archive.max.fan-twitter-@NorfolkMAPolice-filtered.txt-shallow-20200711-032613-15umm-meta.warc.gz 21285 download   job
urls-archive.max.fan-twitter-@NorfolkMAPolice-filtered.txt-shallow-20200711-032613-15umm-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NorfolkMAPolice-filtered.txt-shallow-20200711-032613-15umm-urls.txt 9192 download
urls-archive.max.fan-twitter-@NorfolkMAPolice-filtered.txt-shallow-20200711-032613-15umm.json 345 download   job
urls-archive.max.fan-twitter-@NorfolkSheriff-filtered.txt-shallow-20200711-032612-9zm74-00000.warc.gz 233100600 download   job
urls-archive.max.fan-twitter-@NorfolkSheriff-filtered.txt-shallow-20200711-032612-9zm74-00000.warc.os.cdx.gz 183994 download
urls-archive.max.fan-twitter-@NorfolkSheriff-filtered.txt-shallow-20200711-032612-9zm74-meta.warc.gz 101070 download   job
urls-archive.max.fan-twitter-@NorfolkSheriff-filtered.txt-shallow-20200711-032612-9zm74-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NorfolkSheriff-filtered.txt-shallow-20200711-032612-9zm74-urls.txt 55607 download
urls-archive.max.fan-twitter-@NorfolkSheriff-filtered.txt-shallow-20200711-032612-9zm74.json 343 download   job
urls-archive.max.fan-twitter-@NorthamptonPD-filtered.txt-shallow-20200711-031412-a4kon-00000.warc.gz 139332664 download   job
urls-archive.max.fan-twitter-@NorthamptonPD-filtered.txt-shallow-20200711-031412-a4kon-00000.warc.os.cdx.gz 160273 download
urls-archive.max.fan-twitter-@NorthamptonPD-filtered.txt-shallow-20200711-031412-a4kon-meta.warc.gz 89629 download   job
urls-archive.max.fan-twitter-@NorthamptonPD-filtered.txt-shallow-20200711-031412-a4kon-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NorthamptonPD-filtered.txt-shallow-20200711-031412-a4kon-urls.txt 50527 download
urls-archive.max.fan-twitter-@NorthamptonPD-filtered.txt-shallow-20200711-031412-a4kon.json 341 download   job
urls-archive.max.fan-twitter-@NorthboroughPD-filtered.txt-shallow-20200711-031409-cqowi-00000.warc.gz 25647098 download   job
urls-archive.max.fan-twitter-@NorthboroughPD-filtered.txt-shallow-20200711-031409-cqowi-00000.warc.os.cdx.gz 38750 download
urls-archive.max.fan-twitter-@NorthboroughPD-filtered.txt-shallow-20200711-031409-cqowi-meta.warc.gz 25297 download   job
urls-archive.max.fan-twitter-@NorthboroughPD-filtered.txt-shallow-20200711-031409-cqowi-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NorthboroughPD-filtered.txt-shallow-20200711-031409-cqowi-urls.txt 18548 download
urls-archive.max.fan-twitter-@NorthboroughPD-filtered.txt-shallow-20200711-031409-cqowi.json 343 download   job
urls-archive.max.fan-twitter-@NortonMaPolice-filtered.txt-shallow-20200711-031343-7j3ur-00000.warc.gz 142150284 download   job
urls-archive.max.fan-twitter-@NortonMaPolice-filtered.txt-shallow-20200711-031343-7j3ur-00000.warc.os.cdx.gz 219795 download
urls-archive.max.fan-twitter-@NortonMaPolice-filtered.txt-shallow-20200711-031343-7j3ur-meta.warc.gz 121763 download   job
urls-archive.max.fan-twitter-@NortonMaPolice-filtered.txt-shallow-20200711-031343-7j3ur-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NortonMaPolice-filtered.txt-shallow-20200711-031343-7j3ur-urls.txt 109730 download
urls-archive.max.fan-twitter-@NortonMaPolice-filtered.txt-shallow-20200711-031343-7j3ur.json 343 download   job
urls-archive.max.fan-twitter-@OEMLowell-filtered.txt-shallow-20200711-030937-btqtz-00000.warc.gz 44203149 download   job
urls-archive.max.fan-twitter-@OEMLowell-filtered.txt-shallow-20200711-030937-btqtz-00000.warc.os.cdx.gz 56494 download
urls-archive.max.fan-twitter-@OEMLowell-filtered.txt-shallow-20200711-030937-btqtz-meta.warc.gz 34980 download   job
urls-archive.max.fan-twitter-@OEMLowell-filtered.txt-shallow-20200711-030937-btqtz-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@OEMLowell-filtered.txt-shallow-20200711-030937-btqtz-urls.txt 21812 download
urls-archive.max.fan-twitter-@OEMLowell-filtered.txt-shallow-20200711-030937-btqtz.json 333 download   job
urls-archive.max.fan-twitter-@ONS_Chinatown-filtered.txt-shallow-20200711-030936-885x7-00000.warc.gz 92749090 download   job
urls-archive.max.fan-twitter-@ONS_Chinatown-filtered.txt-shallow-20200711-030936-885x7-00000.warc.os.cdx.gz 72398 download
urls-archive.max.fan-twitter-@ONS_Chinatown-filtered.txt-shallow-20200711-030936-885x7-meta.warc.gz 42644 download   job
urls-archive.max.fan-twitter-@ONS_Chinatown-filtered.txt-shallow-20200711-030936-885x7-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ONS_Chinatown-filtered.txt-shallow-20200711-030936-885x7-urls.txt 15494 download
urls-archive.max.fan-twitter-@ONS_Chinatown-filtered.txt-shallow-20200711-030936-885x7.json 341 download   job
urls-archive.max.fan-twitter-@ORStatePolice-filtered.txt-shallow-20200711-030932-79o7h-00000.warc.gz 1056699413 download   job
urls-archive.max.fan-twitter-@ORStatePolice-filtered.txt-shallow-20200711-030932-79o7h-00000.warc.os.cdx.gz 1478298 download
urls-archive.max.fan-twitter-@ORStatePolice-filtered.txt-shallow-20200711-030932-79o7h-meta.warc.gz 789530 download   job
urls-archive.max.fan-twitter-@ORStatePolice-filtered.txt-shallow-20200711-030932-79o7h-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ORStatePolice-filtered.txt-shallow-20200711-030932-79o7h-urls.txt 449664 download
urls-archive.max.fan-twitter-@ORStatePolice-filtered.txt-shallow-20200711-030932-79o7h.json 341 download   job
urls-archive.max.fan-twitter-@OrleansPolice-filtered.txt-shallow-20200711-030934-9bj4t-00000.warc.gz 206055573 download   job
urls-archive.max.fan-twitter-@OrleansPolice-filtered.txt-shallow-20200711-030934-9bj4t-00000.warc.os.cdx.gz 201270 download
urls-archive.max.fan-twitter-@OrleansPolice-filtered.txt-shallow-20200711-030934-9bj4t-meta.warc.gz 112110 download   job
urls-archive.max.fan-twitter-@OrleansPolice-filtered.txt-shallow-20200711-030934-9bj4t-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@OrleansPolice-filtered.txt-shallow-20200711-030934-9bj4t-urls.txt 84100 download
urls-archive.max.fan-twitter-@OrleansPolice-filtered.txt-shallow-20200711-030934-9bj4t.json 341 download   job
urls-archive.max.fan-twitter-@OxfordPD_MA-filtered.txt-shallow-20200711-030933-4czd6-00000.warc.gz 84802963 download   job
urls-archive.max.fan-twitter-@OxfordPD_MA-filtered.txt-shallow-20200711-030933-4czd6-00000.warc.os.cdx.gz 82925 download
urls-archive.max.fan-twitter-@OxfordPD_MA-filtered.txt-shallow-20200711-030933-4czd6-meta.warc.gz 48444 download   job
urls-archive.max.fan-twitter-@OxfordPD_MA-filtered.txt-shallow-20200711-030933-4czd6-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@OxfordPD_MA-filtered.txt-shallow-20200711-030933-4czd6-urls.txt 20890 download
urls-archive.max.fan-twitter-@OxfordPD_MA-filtered.txt-shallow-20200711-030933-4czd6.json 337 download   job
urls-archive.max.fan-twitter-@PAStatePolice-filtered.txt-shallow-20200711-025532-3oe12-00000.warc.gz 133221345 download   job
urls-archive.max.fan-twitter-@PAStatePolice-filtered.txt-shallow-20200711-025532-3oe12-00000.warc.os.cdx.gz 383191 download
urls-archive.max.fan-twitter-@PAStatePolice-filtered.txt-shallow-20200711-025532-3oe12-meta.warc.gz 208930 download   job
urls-archive.max.fan-twitter-@PAStatePolice-filtered.txt-shallow-20200711-025532-3oe12-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@PAStatePolice-filtered.txt-shallow-20200711-025532-3oe12-urls.txt 69164 download
urls-archive.max.fan-twitter-@PAStatePolice-filtered.txt-shallow-20200711-025532-3oe12.json 341 download   job
urls-archive.max.fan-twitter-@PDGeorgetownMA-filtered.txt-shallow-20200711-025532-6l43x-00000.warc.gz 70626479 download   job
urls-archive.max.fan-twitter-@PDGeorgetownMA-filtered.txt-shallow-20200711-025532-6l43x-00000.warc.os.cdx.gz 102457 download
urls-archive.max.fan-twitter-@PDGeorgetownMA-filtered.txt-shallow-20200711-025532-6l43x-meta.warc.gz 59498 download   job
urls-archive.max.fan-twitter-@PDGeorgetownMA-filtered.txt-shallow-20200711-025532-6l43x-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@PDGeorgetownMA-filtered.txt-shallow-20200711-025532-6l43x-urls.txt 46535 download
urls-archive.max.fan-twitter-@PDGeorgetownMA-filtered.txt-shallow-20200711-025532-6l43x.json 343 download   job
urls-archive.max.fan-twitter-@PelhamNHPolice-filtered.txt-shallow-20200711-025253-acyg4-00000.warc.gz 432629418 download   job
urls-archive.max.fan-twitter-@PelhamNHPolice-filtered.txt-shallow-20200711-025253-acyg4-00000.warc.os.cdx.gz 439255 download
urls-archive.max.fan-twitter-@PelhamNHPolice-filtered.txt-shallow-20200711-025253-acyg4-meta.warc.gz 235358 download   job
urls-archive.max.fan-twitter-@PelhamNHPolice-filtered.txt-shallow-20200711-025253-acyg4-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@PelhamNHPolice-filtered.txt-shallow-20200711-025253-acyg4-urls.txt 225382 download
urls-archive.max.fan-twitter-@PelhamNHPolice-filtered.txt-shallow-20200711-025253-acyg4.json 343 download   job
urls-archive.max.fan-twitter-@PembrokePD-filtered.txt-shallow-20200711-025251-9lvt4-00000.warc.gz 37276689 download   job
urls-archive.max.fan-twitter-@PembrokePD-filtered.txt-shallow-20200711-025251-9lvt4-00000.warc.os.cdx.gz 52305 download
urls-archive.max.fan-twitter-@PembrokePD-filtered.txt-shallow-20200711-025251-9lvt4-meta.warc.gz 32328 download   job
urls-archive.max.fan-twitter-@PembrokePD-filtered.txt-shallow-20200711-025251-9lvt4-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@PembrokePD-filtered.txt-shallow-20200711-025251-9lvt4-urls.txt 27605 download
urls-archive.max.fan-twitter-@PembrokePD-filtered.txt-shallow-20200711-025251-9lvt4.json 335 download   job
urls-archive.max.fan-twitter-@PembrokePolice-filtered.txt-shallow-20200711-025250-6px32-00000.warc.gz 6247315 download   job
urls-archive.max.fan-twitter-@PembrokePolice-filtered.txt-shallow-20200711-025250-6px32-00000.warc.os.cdx.gz 15729 download
urls-archive.max.fan-twitter-@PembrokePolice-filtered.txt-shallow-20200711-025250-6px32-meta.warc.gz 12834 download   job
urls-archive.max.fan-twitter-@PembrokePolice-filtered.txt-shallow-20200711-025250-6px32-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@PembrokePolice-filtered.txt-shallow-20200711-025250-6px32-urls.txt 4758 download
urls-archive.max.fan-twitter-@PembrokePolice-filtered.txt-shallow-20200711-025250-6px32.json 343 download   job
urls-archive.max.fan-twitter-@PetershamPolice-filtered.txt-shallow-20200711-025200-7e7i4-00000.warc.gz 1967125 download   job
urls-archive.max.fan-twitter-@PetershamPolice-filtered.txt-shallow-20200711-025200-7e7i4-00000.warc.os.cdx.gz 4590 download
urls-archive.max.fan-twitter-@PetershamPolice-filtered.txt-shallow-20200711-025200-7e7i4-meta.warc.gz 6421 download   job
urls-archive.max.fan-twitter-@PetershamPolice-filtered.txt-shallow-20200711-025200-7e7i4-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@PetershamPolice-filtered.txt-shallow-20200711-025200-7e7i4-urls.txt 124 download
urls-archive.max.fan-twitter-@PetershamPolice-filtered.txt-shallow-20200711-025200-7e7i4.json 345 download   job
urls-archive.max.fan-twitter-@PlymouthNHPD-filtered.txt-shallow-20200711-025159-5ldgv-meta.warc.gz 8204 download   job
urls-archive.max.fan-twitter-@PlymouthNHPD-filtered.txt-shallow-20200711-025159-5ldgv-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@PlymouthNHPD-filtered.txt-shallow-20200711-025159-5ldgv-urls.txt 2183 download
urls-archive.max.fan-twitter-@PlymouthSheriff-filtered.txt-shallow-20200711-024517-be8r5-00000.warc.gz 144140517 download   job
urls-archive.max.fan-twitter-@PlymouthSheriff-filtered.txt-shallow-20200711-024517-be8r5-00000.warc.os.cdx.gz 160940 download
urls-archive.max.fan-twitter-@PlymouthSheriff-filtered.txt-shallow-20200711-024517-be8r5-meta.warc.gz 89719 download   job
urls-archive.max.fan-twitter-@PlymouthSheriff-filtered.txt-shallow-20200711-024517-be8r5-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@PlymouthSheriff-filtered.txt-shallow-20200711-024517-be8r5-urls.txt 66382 download
urls-archive.max.fan-twitter-@PlymouthSheriff-filtered.txt-shallow-20200711-024517-be8r5.json 345 download   job
urls-archive.max.fan-twitter-@Plymouth_Police-filtered.txt-shallow-20200711-024522-695lw-00000.warc.gz 115109917 download   job
urls-archive.max.fan-twitter-@Plymouth_Police-filtered.txt-shallow-20200711-024522-695lw-00000.warc.os.cdx.gz 146118 download
urls-archive.max.fan-twitter-@Plymouth_Police-filtered.txt-shallow-20200711-024522-695lw-urls.txt 27498 download
urls-archive.max.fan-twitter-@RIStatePolice-filtered.txt-shallow-20200711-023611-drtmo-00000.warc.gz 616363888 download   job
urls-archive.max.fan-twitter-@RIStatePolice-filtered.txt-shallow-20200711-023611-drtmo-00000.warc.os.cdx.gz 864963 download
urls-archive.max.fan-twitter-@RIStatePolice-filtered.txt-shallow-20200711-023611-drtmo-meta.warc.gz 463733 download   job
urls-archive.max.fan-twitter-@RIStatePolice-filtered.txt-shallow-20200711-023611-drtmo-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@RIStatePolice-filtered.txt-shallow-20200711-023611-drtmo-urls.txt 278039 download
urls-archive.max.fan-twitter-@RIStatePolice-filtered.txt-shallow-20200711-023611-drtmo.json 341 download   job
urls-archive.max.fan-twitter-@RPD02370-filtered.txt-shallow-20200711-023117-f33wk-00000.warc.gz 38678406 download   job
urls-archive.max.fan-twitter-@RPD02370-filtered.txt-shallow-20200711-023117-f33wk-00000.warc.os.cdx.gz 61050 download
urls-archive.max.fan-twitter-@RPD02370-filtered.txt-shallow-20200711-023117-f33wk-urls.txt 18655 download
urls-archive.max.fan-twitter-@RPD02370-filtered.txt-shallow-20200711-023117-f33wk.json 331 download   job
urls-archive.max.fan-twitter-@RaymondPolice-filtered.txt-shallow-20200711-024449-28i8t-00000.warc.gz 2541 download   job
urls-archive.max.fan-twitter-@RaymondPolice-filtered.txt-shallow-20200711-024449-28i8t-00000.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@RaymondPolice-filtered.txt-shallow-20200711-024449-28i8t-meta.warc.gz 3414 download   job
urls-archive.max.fan-twitter-@RaymondPolice-filtered.txt-shallow-20200711-024449-28i8t-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@RaymondPolice-filtered.txt-shallow-20200711-024449-28i8t-urls.txt 0 download
urls-archive.max.fan-twitter-@RaymondPolice-filtered.txt-shallow-20200711-024449-28i8t.json 341 download   job
urls-archive.max.fan-twitter-@ReadingPolice-filtered.txt-shallow-20200711-024230-4wfxf-00000.warc.gz 265354703 download   job
urls-archive.max.fan-twitter-@ReadingPolice-filtered.txt-shallow-20200711-024230-4wfxf-00000.warc.os.cdx.gz 329256 download
urls-archive.max.fan-twitter-@ReadingPolice-filtered.txt-shallow-20200711-024230-4wfxf-meta.warc.gz 179940 download   job
urls-archive.max.fan-twitter-@ReadingPolice-filtered.txt-shallow-20200711-024230-4wfxf-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ReadingPolice-filtered.txt-shallow-20200711-024230-4wfxf-urls.txt 161559 download
urls-archive.max.fan-twitter-@ReadingPolice-filtered.txt-shallow-20200711-024230-4wfxf.json 341 download   job
urls-archive.max.fan-twitter-@RehobothPD-filtered.txt-shallow-20200711-024228-44tqz-00000.warc.gz 37237432 download   job
urls-archive.max.fan-twitter-@RehobothPD-filtered.txt-shallow-20200711-024228-44tqz-00000.warc.os.cdx.gz 55081 download
urls-archive.max.fan-twitter-@RehobothPD-filtered.txt-shallow-20200711-024228-44tqz-meta.warc.gz 33973 download   job
urls-archive.max.fan-twitter-@RehobothPD-filtered.txt-shallow-20200711-024228-44tqz-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@RepKevinHonan-filtered.txt-shallow-20200711-024039-euj9a-00000.warc.gz 59194799 download   job
urls-archive.max.fan-twitter-@RepKevinHonan-filtered.txt-shallow-20200711-024039-euj9a-00000.warc.os.cdx.gz 88235 download
urls-archive.max.fan-twitter-@RepKevinHonan-filtered.txt-shallow-20200711-024039-euj9a-meta.warc.gz 51625 download   job
urls-archive.max.fan-twitter-@RepKevinHonan-filtered.txt-shallow-20200711-024039-euj9a-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@RepKevinHonan-filtered.txt-shallow-20200711-024039-euj9a-urls.txt 22625 download
urls-archive.max.fan-twitter-@RepKevinHonan-filtered.txt-shallow-20200711-024039-euj9a.json 341 download   job
urls-archive.max.fan-twitter-@RepStanley-filtered.txt-shallow-20200711-023823-f2c0u-00000.warc.gz 476170014 download   job
urls-archive.max.fan-twitter-@RepStanley-filtered.txt-shallow-20200711-023823-f2c0u-00000.warc.os.cdx.gz 414150 download
urls-archive.max.fan-twitter-@RepStanley-filtered.txt-shallow-20200711-023823-f2c0u-meta.warc.gz 221015 download   job
urls-archive.max.fan-twitter-@RepStanley-filtered.txt-shallow-20200711-023823-f2c0u-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@RepStanley-filtered.txt-shallow-20200711-023823-f2c0u-urls.txt 179603 download
urls-archive.max.fan-twitter-@RepStanley-filtered.txt-shallow-20200711-023823-f2c0u.json 335 download   job
urls-archive.max.fan-twitter-@RepTomConroy-filtered.txt-shallow-20200711-023822-4f84d-00000.warc.gz 26524438 download   job
urls-archive.max.fan-twitter-@RepTomConroy-filtered.txt-shallow-20200711-023822-4f84d-00000.warc.os.cdx.gz 30765 download
urls-archive.max.fan-twitter-@RepTomConroy-filtered.txt-shallow-20200711-023822-4f84d-meta.warc.gz 21055 download   job
urls-archive.max.fan-twitter-@RepTomConroy-filtered.txt-shallow-20200711-023822-4f84d-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@RepTomConroy-filtered.txt-shallow-20200711-023822-4f84d-urls.txt 24115 download
urls-archive.max.fan-twitter-@RockSheriffNH-filtered.txt-shallow-20200711-023119-buhk0-meta.warc.gz 52836 download   job
urls-archive.max.fan-twitter-@RockSheriffNH-filtered.txt-shallow-20200711-023119-buhk0-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@RockSheriffNH-filtered.txt-shallow-20200711-023119-buhk0-urls.txt 65151 download
urls-archive.max.fan-twitter-@RockSheriffNH-filtered.txt-shallow-20200711-023119-buhk0.json 341 download   job
urls-archive.max.fan-twitter-@Rockcountyjail-filtered.txt-shallow-20200711-023610-9iltp-00000.warc.gz 96387740 download   job
urls-archive.max.fan-twitter-@Rockcountyjail-filtered.txt-shallow-20200711-023610-9iltp-00000.warc.os.cdx.gz 67251 download
urls-archive.max.fan-twitter-@Rockcountyjail-filtered.txt-shallow-20200711-023610-9iltp-meta.warc.gz 40444 download   job
urls-archive.max.fan-twitter-@Rockcountyjail-filtered.txt-shallow-20200711-023610-9iltp-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Rockcountyjail-filtered.txt-shallow-20200711-023610-9iltp-urls.txt 31239 download
urls-archive.max.fan-twitter-@Rockcountyjail-filtered.txt-shallow-20200711-023610-9iltp.json 343 download   job
urls-archive.max.fan-twitter-@RocklandPolice-filtered.txt-shallow-20200711-023542-ar72h-00000.warc.gz 1044413 download   job
urls-archive.max.fan-twitter-@RocklandPolice-filtered.txt-shallow-20200711-023542-ar72h-00000.warc.os.cdx.gz 4096 download
urls-archive.max.fan-twitter-@RocklandPolice-filtered.txt-shallow-20200711-023542-ar72h-meta.warc.gz 6144 download   job
urls-archive.max.fan-twitter-@RocklandPolice-filtered.txt-shallow-20200711-023542-ar72h-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@RocklandPolice-filtered.txt-shallow-20200711-023542-ar72h-urls.txt 159 download
urls-archive.max.fan-twitter-@RocklandPolice-filtered.txt-shallow-20200711-023542-ar72h.json 343 download   job
urls-archive.max.fan-twitter-@Rowley_PD-filtered.txt-shallow-20200711-023118-qqf1i-00000.warc.gz 35873542 download   job
urls-archive.max.fan-twitter-@Rowley_PD-filtered.txt-shallow-20200711-023118-qqf1i-00000.warc.os.cdx.gz 52725 download
urls-archive.max.fan-twitter-@Rowley_PD-filtered.txt-shallow-20200711-023118-qqf1i-meta.warc.gz 33031 download   job
urls-archive.max.fan-twitter-@Rowley_PD-filtered.txt-shallow-20200711-023118-qqf1i-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Rowley_PD-filtered.txt-shallow-20200711-023118-qqf1i-urls.txt 17955 download
urls-archive.max.fan-twitter-@Rowley_PD-filtered.txt-shallow-20200711-023118-qqf1i.json 333 download   job
urls-archive.max.fan-twitter-@SCDPS_PIO-filtered.txt-shallow-20200711-022645-36kxj-meta.warc.gz 237303 download   job
urls-archive.max.fan-twitter-@SCDPS_PIO-filtered.txt-shallow-20200711-022645-36kxj-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@SDHighwayPatrol-filtered.txt-shallow-20200711-021012-1myzq-00000.warc.gz 885398069 download   job
urls-archive.max.fan-twitter-@SDHighwayPatrol-filtered.txt-shallow-20200711-021012-1myzq-00000.warc.os.cdx.gz 1025343 download
urls-archive.max.fan-twitter-@SDHighwayPatrol-filtered.txt-shallow-20200711-021012-1myzq-meta.warc.gz 546516 download   job
urls-archive.max.fan-twitter-@SDHighwayPatrol-filtered.txt-shallow-20200711-021012-1myzq-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@SDHighwayPatrol-filtered.txt-shallow-20200711-021012-1myzq-urls.txt 413873 download
urls-archive.max.fan-twitter-@SDHighwayPatrol-filtered.txt-shallow-20200711-021012-1myzq.json 345 download   job
urls-archive.max.fan-twitter-@SPD_HQ-filtered.txt-shallow-20200711-014217-4v4oq.json 327 download   job
urls-archive.max.fan-twitter-@SalemMAPolice-filtered.txt-shallow-20200711-023116-6cdaf-00000.warc.gz 106530542 download   job
urls-archive.max.fan-twitter-@SalemMAPolice-filtered.txt-shallow-20200711-023116-6cdaf-00000.warc.os.cdx.gz 166588 download
urls-archive.max.fan-twitter-@SalemMAPolice-filtered.txt-shallow-20200711-023116-6cdaf-meta.warc.gz 92689 download   job
urls-archive.max.fan-twitter-@SalemMAPolice-filtered.txt-shallow-20200711-023116-6cdaf-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@SalemMAPolice-filtered.txt-shallow-20200711-023116-6cdaf-urls.txt 56529 download
urls-archive.max.fan-twitter-@SalemMAPolice-filtered.txt-shallow-20200711-023116-6cdaf.json 341 download   job
urls-archive.max.fan-twitter-@SalemNHPolice-filtered.txt-shallow-20200711-023114-98yjt-00000.warc.gz 122687195 download   job
urls-archive.max.fan-twitter-@SalemNHPolice-filtered.txt-shallow-20200711-023114-98yjt-00000.warc.os.cdx.gz 211959 download
urls-archive.max.fan-twitter-@SalemNHPolice-filtered.txt-shallow-20200711-023114-98yjt-meta.warc.gz 118168 download   job
urls-archive.max.fan-twitter-@SalemNHPolice-filtered.txt-shallow-20200711-023114-98yjt-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@SalemNHPolice-filtered.txt-shallow-20200711-023114-98yjt-urls.txt 77450 download
urls-archive.max.fan-twitter-@SalemNHPolice-filtered.txt-shallow-20200711-023114-98yjt.json 341 download   job
urls-archive.max.fan-twitter-@SaugusPD-filtered.txt-shallow-20200711-022835-bfukg-00000.warc.gz 22949524 download   job
urls-archive.max.fan-twitter-@SaugusPD-filtered.txt-shallow-20200711-022835-bfukg-00000.warc.os.cdx.gz 53062 download
urls-archive.max.fan-twitter-@SaugusPD-filtered.txt-shallow-20200711-022835-bfukg-meta.warc.gz 33644 download   job
urls-archive.max.fan-twitter-@SaugusPD-filtered.txt-shallow-20200711-022835-bfukg-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@SaugusPD-filtered.txt-shallow-20200711-022835-bfukg-urls.txt 12757 download
urls-archive.max.fan-twitter-@SaugusPD-filtered.txt-shallow-20200711-022835-bfukg.json 331 download   job
urls-archive.max.fan-twitter-@SbgePolice-filtered.txt-shallow-20200711-022834-9g09r-meta.warc.gz 6590 download   job
urls-archive.max.fan-twitter-@SbgePolice-filtered.txt-shallow-20200711-022834-9g09r-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@SbgePolice-filtered.txt-shallow-20200711-022834-9g09r-urls.txt 114 download
urls-archive.max.fan-twitter-@SbgePolice-filtered.txt-shallow-20200711-022834-9g09r.json 335 download   job
urls-archive.max.fan-twitter-@ScituatePolice-filtered.txt-shallow-20200711-022644-64c94-00000.warc.gz 18314525 download   job
urls-archive.max.fan-twitter-@ScituatePolice-filtered.txt-shallow-20200711-022644-64c94-00000.warc.os.cdx.gz 34001 download
urls-archive.max.fan-twitter-@ScituatePolice-filtered.txt-shallow-20200711-022644-64c94-meta.warc.gz 22737 download   job
urls-archive.max.fan-twitter-@ScituatePolice-filtered.txt-shallow-20200711-022644-64c94-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ScituatePolice-filtered.txt-shallow-20200711-022644-64c94-urls.txt 11843 download
urls-archive.max.fan-twitter-@ScituatePolice-filtered.txt-shallow-20200711-022644-64c94.json 343 download   job
urls-archive.max.fan-twitter-@SgtDearthHPD-filtered.txt-shallow-20200711-021012-50p2v-00000.warc.gz 569136624 download   job
urls-archive.max.fan-twitter-@SgtDearthHPD-filtered.txt-shallow-20200711-021012-50p2v-00000.warc.os.cdx.gz 527433 download
urls-archive.max.fan-twitter-@SgtDearthHPD-filtered.txt-shallow-20200711-021012-50p2v-meta.warc.gz 282264 download   job
urls-archive.max.fan-twitter-@SgtDearthHPD-filtered.txt-shallow-20200711-021012-50p2v-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@SgtDearthHPD-filtered.txt-shallow-20200711-021012-50p2v.json 339 download   job
urls-archive.max.fan-twitter-@SharonMAPolice-filtered.txt-shallow-20200711-020735-6l7mv-meta.warc.gz 146218 download   job
urls-archive.max.fan-twitter-@SharonMAPolice-filtered.txt-shallow-20200711-020735-6l7mv-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@SharonMAPolice-filtered.txt-shallow-20200711-020735-6l7mv-urls.txt 130364 download
urls-archive.max.fan-twitter-@SharonMAPolice-filtered.txt-shallow-20200711-020735-6l7mv.json 343 download   job
urls-archive.max.fan-twitter-@SherbornMAPD-filtered.txt-shallow-20200711-020733-4auqz-meta.warc.gz 29755 download   job
urls-archive.max.fan-twitter-@SherbornMAPD-filtered.txt-shallow-20200711-020733-4auqz-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@SherbornMAPD-filtered.txt-shallow-20200711-020733-4auqz-urls.txt 19296 download
urls-archive.max.fan-twitter-@SherbornMAPD-filtered.txt-shallow-20200711-020733-4auqz.json 339 download   job
urls-archive.max.fan-twitter-@SheriffBowler-filtered.txt-shallow-20200711-020640-2vn3f-meta.warc.gz 20693 download   job
urls-archive.max.fan-twitter-@SheriffBowler-filtered.txt-shallow-20200711-020640-2vn3f-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@SheriffBowler-filtered.txt-shallow-20200711-020640-2vn3f-urls.txt 18402 download
urls-archive.max.fan-twitter-@ShirleyMAPD-filtered.txt-shallow-20200711-020639-7vw06-00000.warc.gz 8483848 download   job
urls-archive.max.fan-twitter-@ShirleyMAPD-filtered.txt-shallow-20200711-020639-7vw06-00000.warc.os.cdx.gz 13295 download
urls-archive.max.fan-twitter-@ShirleyMAPD-filtered.txt-shallow-20200711-020639-7vw06-meta.warc.gz 11404 download   job
urls-archive.max.fan-twitter-@ShirleyMAPD-filtered.txt-shallow-20200711-020639-7vw06-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ShirleyMAPD-filtered.txt-shallow-20200711-020639-7vw06-urls.txt 5220 download
urls-archive.max.fan-twitter-@ShirleyMAPD-filtered.txt-shallow-20200711-020639-7vw06.json 337 download   job
urls-archive.max.fan-twitter-@SomervillePD-filtered.txt-shallow-20200711-020638-350g8-00000.warc.gz 170600223 download   job
urls-archive.max.fan-twitter-@SomervillePD-filtered.txt-shallow-20200711-020638-350g8-00000.warc.os.cdx.gz 209344 download
urls-archive.max.fan-twitter-@SomervillePD-filtered.txt-shallow-20200711-020638-350g8-meta.warc.gz 116096 download   job
urls-archive.max.fan-twitter-@SomervillePD-filtered.txt-shallow-20200711-020638-350g8-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@SomervillePD-filtered.txt-shallow-20200711-020638-350g8-urls.txt 62621 download
urls-archive.max.fan-twitter-@SomervillePD-filtered.txt-shallow-20200711-020638-350g8.json 339 download   job
urls-archive.max.fan-twitter-@TewksburyPD-filtered.txt-shallow-20200711-013416-91pco-00000.warc.gz 400779197 download   job
urls-archive.max.fan-twitter-@TewksburyPD-filtered.txt-shallow-20200711-013416-91pco-00000.warc.os.cdx.gz 483519 download
urls-archive.max.fan-twitter-@TewksburyPD-filtered.txt-shallow-20200711-013416-91pco-meta.warc.gz 262807 download   job
urls-archive.max.fan-twitter-@TewksburyPD-filtered.txt-shallow-20200711-013416-91pco-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@TewksburyPD-filtered.txt-shallow-20200711-013416-91pco-urls.txt 158719 download
urls-archive.max.fan-twitter-@TewksburyPD-filtered.txt-shallow-20200711-013416-91pco.json 337 download   job
urls-archive.max.fan-twitter-@TownofBrookline-filtered.txt-shallow-20200711-012831-1q1xn-meta.warc.gz 268386 download   job
urls-archive.max.fan-twitter-@TownofBrookline-filtered.txt-shallow-20200711-012831-1q1xn-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@TownofBrookline-filtered.txt-shallow-20200711-012831-1q1xn-urls.txt 305969 download
urls-archive.max.fan-twitter-@TownofBrookline-filtered.txt-shallow-20200711-012831-1q1xn.json 345 download   job
urls-archive.max.fan-twitter-@UMassLowell-filtered.txt-shallow-20200711-012820-bcjwa-00000.warc.gz 1265930065 download   job
urls-archive.max.fan-twitter-@UMassLowell-filtered.txt-shallow-20200711-012820-bcjwa-00000.warc.os.cdx.gz 1409743 download
urls-archive.max.fan-twitter-@UMassLowell-filtered.txt-shallow-20200711-012820-bcjwa-meta.warc.gz 755248 download   job
urls-archive.max.fan-twitter-@UMassLowell-filtered.txt-shallow-20200711-012820-bcjwa-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@UMassLowell-filtered.txt-shallow-20200711-012820-bcjwa-urls.txt 740905 download
urls-archive.max.fan-twitter-@UMassLowell-filtered.txt-shallow-20200711-012820-bcjwa.json 337 download   job
urls-archive.max.fan-twitter-@USDOT-filtered.txt-shallow-20200711-012401-707qn-00000.warc.gz 561036299 download   job
urls-archive.max.fan-twitter-@USDOT-filtered.txt-shallow-20200711-012401-707qn-00000.warc.os.cdx.gz 1175668 download
urls-archive.max.fan-twitter-@USDOT-filtered.txt-shallow-20200711-012401-707qn-meta.warc.gz 628641 download   job
urls-archive.max.fan-twitter-@USDOT-filtered.txt-shallow-20200711-012401-707qn-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@USDOT-filtered.txt-shallow-20200711-012401-707qn-urls.txt 214685 download
urls-archive.max.fan-twitter-@msosheriff-filtered.txt-shallow-20200711-034926-3q36f-00000.warc.gz 628747593 download   job
urls-archive.max.fan-twitter-@msosheriff-filtered.txt-shallow-20200711-034926-3q36f-00000.warc.os.cdx.gz 640771 download
urls-archive.max.fan-twitter-@msosheriff-filtered.txt-shallow-20200711-034926-3q36f-urls.txt 234725 download
urls-archive.max.fan-twitter-@msosheriff-filtered.txt-shallow-20200711-034926-3q36f.json 335 download   job
urls-archive.max.fan-twitter-@newburypd-filtered.txt-shallow-20200711-033228-992ez-00000.warc.gz 1139117 download   job
urls-archive.max.fan-twitter-@newburypd-filtered.txt-shallow-20200711-033228-992ez-00000.warc.os.cdx.gz 4351 download
urls-archive.max.fan-twitter-@newburypd-filtered.txt-shallow-20200711-033228-992ez-meta.warc.gz 6311 download   job
urls-archive.max.fan-twitter-@newburypd-filtered.txt-shallow-20200711-033228-992ez-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@newburypd-filtered.txt-shallow-20200711-033228-992ez-urls.txt 112 download
urls-archive.max.fan-twitter-@newburypd-filtered.txt-shallow-20200711-033228-992ez.json 333 download   job
urls-archive.max.fan-twitter-@nhshtown-filtered.txt-shallow-20200711-032715-befw1-00000.warc.gz 873019 download   job
urls-archive.max.fan-twitter-@nhshtown-filtered.txt-shallow-20200711-032715-befw1-00000.warc.os.cdx.gz 3925 download
urls-archive.max.fan-twitter-@nhshtown-filtered.txt-shallow-20200711-032715-befw1-meta.warc.gz 6059 download   job
urls-archive.max.fan-twitter-@nhshtown-filtered.txt-shallow-20200711-032715-befw1-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@nhshtown-filtered.txt-shallow-20200711-032715-befw1-urls.txt 55 download
urls-archive.max.fan-twitter-@nhshtown-filtered.txt-shallow-20200711-032715-befw1.json 331 download   job
urls-archive.max.fan-twitter-@north_andover-filtered.txt-shallow-20200711-031410-6qmag-00000.warc.gz 576651003 download   job
urls-archive.max.fan-twitter-@north_andover-filtered.txt-shallow-20200711-031410-6qmag-00000.warc.os.cdx.gz 560572 download
urls-archive.max.fan-twitter-@north_andover-filtered.txt-shallow-20200711-031410-6qmag-meta.warc.gz 306641 download   job
urls-archive.max.fan-twitter-@north_andover-filtered.txt-shallow-20200711-031410-6qmag-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@north_andover-filtered.txt-shallow-20200711-031410-6qmag-urls.txt 353829 download
urls-archive.max.fan-twitter-@north_andover-filtered.txt-shallow-20200711-031410-6qmag.json 341 download   job
urls-archive.max.fan-twitter-@northeasternpd-filtered.txt-shallow-20200711-031344-e4esr-00000.warc.gz 629254188 download   job
urls-archive.max.fan-twitter-@northeasternpd-filtered.txt-shallow-20200711-031344-e4esr-00000.warc.os.cdx.gz 549027 download
urls-archive.max.fan-twitter-@northeasternpd-filtered.txt-shallow-20200711-031344-e4esr-meta.warc.gz 293891 download   job
urls-archive.max.fan-twitter-@northeasternpd-filtered.txt-shallow-20200711-031344-e4esr-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@northeasternpd-filtered.txt-shallow-20200711-031344-e4esr-urls.txt 207549 download
urls-archive.max.fan-twitter-@northeasternpd-filtered.txt-shallow-20200711-031344-e4esr.json 343 download   job
urls-archive.max.fan-twitter-@norwellpd-filtered.txt-shallow-20200711-031343-8536z-00000.warc.gz 316614066 download   job
urls-archive.max.fan-twitter-@norwellpd-filtered.txt-shallow-20200711-031343-8536z-00000.warc.os.cdx.gz 307378 download
urls-archive.max.fan-twitter-@norwellpd-filtered.txt-shallow-20200711-031343-8536z-meta.warc.gz 167307 download   job
urls-archive.max.fan-twitter-@norwellpd-filtered.txt-shallow-20200711-031343-8536z-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@norwellpd-filtered.txt-shallow-20200711-031343-8536z-urls.txt 108583 download
urls-archive.max.fan-twitter-@norwellpd-filtered.txt-shallow-20200711-031343-8536z.json 333 download   job
urls-archive.max.fan-twitter-@nottinghampdnh-filtered.txt-shallow-20200711-031220-dk134-00000.warc.gz 14191749 download   job
urls-archive.max.fan-twitter-@nottinghampdnh-filtered.txt-shallow-20200711-031220-dk134-00000.warc.os.cdx.gz 17777 download
urls-archive.max.fan-twitter-@nottinghampdnh-filtered.txt-shallow-20200711-031220-dk134-meta.warc.gz 13841 download   job
urls-archive.max.fan-twitter-@nottinghampdnh-filtered.txt-shallow-20200711-031220-dk134-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@nottinghampdnh-filtered.txt-shallow-20200711-031220-dk134-urls.txt 10099 download
urls-archive.max.fan-twitter-@nottinghampdnh-filtered.txt-shallow-20200711-031220-dk134.json 343 download   job
urls-archive.max.fan-twitter-@nyspolice-filtered.txt-shallow-20200711-030937-9a6f6-00000.warc.gz 907113116 download   job
urls-archive.max.fan-twitter-@nyspolice-filtered.txt-shallow-20200711-030937-9a6f6-00000.warc.os.cdx.gz 1353876 download
urls-archive.max.fan-twitter-@nyspolice-filtered.txt-shallow-20200711-030937-9a6f6-urls.txt 546051 download
urls-archive.max.fan-twitter-@nyspolice-filtered.txt-shallow-20200711-030937-9a6f6.json 333 download   job
urls-archive.max.fan-twitter-@nytimes-filtered.txt-shallow-20200710-213818-4f3nw-00000.warc.gz 5368723548 download   job
urls-archive.max.fan-twitter-@nytimes-filtered.txt-shallow-20200710-213818-4f3nw-00000.warc.os.cdx.gz 4264989 download
urls-archive.max.fan-twitter-@pittsfieldnhpd-filtered.txt-shallow-20200711-025201-4zirv-00000.warc.gz 44311719 download   job
urls-archive.max.fan-twitter-@pittsfieldnhpd-filtered.txt-shallow-20200711-025201-4zirv-00000.warc.os.cdx.gz 44173 download
urls-archive.max.fan-twitter-@pittsfieldnhpd-filtered.txt-shallow-20200711-025201-4zirv-meta.warc.gz 27359 download   job
urls-archive.max.fan-twitter-@pittsfieldnhpd-filtered.txt-shallow-20200711-025201-4zirv-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@pittsfieldnhpd-filtered.txt-shallow-20200711-025201-4zirv-urls.txt 44343 download
urls-archive.max.fan-twitter-@pittsfieldnhpd-filtered.txt-shallow-20200711-025201-4zirv.json 343 download   job
urls-archive.max.fan-twitter-@rccampuspolice-filtered.txt-shallow-20200711-024447-a4sly-meta.warc.gz 71274 download   job
urls-archive.max.fan-twitter-@rccampuspolice-filtered.txt-shallow-20200711-024447-a4sly-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@rccampuspolice-filtered.txt-shallow-20200711-024447-a4sly.json 343 download   job
urls-archive.max.fan-twitter-@rconnollyHPD46-filtered.txt-shallow-20200711-024447-51nv6-meta.warc.gz 26993 download   job
urls-archive.max.fan-twitter-@rconnollyHPD46-filtered.txt-shallow-20200711-024447-51nv6-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@rconnollyHPD46-filtered.txt-shallow-20200711-024447-51nv6-urls.txt 14625 download
urls-archive.max.fan-twitter-@rconnollyHPD46-filtered.txt-shallow-20200711-024447-51nv6.json 343 download   job
urls-archive.max.fan-twitter-@reverepolice-filtered.txt-shallow-20200711-023612-3swhz-00000.warc.gz 139288422 download   job
urls-archive.max.fan-twitter-@reverepolice-filtered.txt-shallow-20200711-023612-3swhz-00000.warc.os.cdx.gz 204850 download
urls-archive.max.fan-twitter-@reverepolice-filtered.txt-shallow-20200711-023612-3swhz-meta.warc.gz 113664 download   job
urls-archive.max.fan-twitter-@reverepolice-filtered.txt-shallow-20200711-023612-3swhz-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@reverepolice-filtered.txt-shallow-20200711-023612-3swhz-urls.txt 67192 download
urls-archive.max.fan-twitter-@reverepolice-filtered.txt-shallow-20200711-023612-3swhz.json 339 download   job
urls-archive.max.fan-twitter-@rjPAPD-filtered.txt-shallow-20200711-023610-ar6e8-00000.warc.gz 75402161 download   job
urls-archive.max.fan-twitter-@rjPAPD-filtered.txt-shallow-20200711-023610-ar6e8-00000.warc.os.cdx.gz 99617 download
urls-archive.max.fan-twitter-@rjPAPD-filtered.txt-shallow-20200711-023610-ar6e8-meta.warc.gz 58043 download   job
urls-archive.max.fan-twitter-@rjPAPD-filtered.txt-shallow-20200711-023610-ar6e8-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@rjPAPD-filtered.txt-shallow-20200711-023610-ar6e8-urls.txt 16416 download
urls-archive.max.fan-twitter-@rjPAPD-filtered.txt-shallow-20200711-023610-ar6e8.json 327 download   job
urls-transfer.notkiska.pw-facebook-@MACorrections-shallow-20200711-012057-2ronh-00000.warc.gz 1724324953 download   job
urls-transfer.notkiska.pw-facebook-@MACorrections-shallow-20200711-012057-2ronh-00000.warc.os.cdx.gz 685932 download
urls-transfer.notkiska.pw-facebook-@MACorrections-shallow-20200711-012057-2ronh-meta.warc.gz 442482 download   job
urls-transfer.notkiska.pw-facebook-@MACorrections-shallow-20200711-012057-2ronh-meta.warc.os.cdx.gz 47 download
urls-transfer.notkiska.pw-facebook-@MACorrections-shallow-20200711-012057-2ronh-urls.txt 97750 download
urls-transfer.notkiska.pw-facebook-@MACorrections-shallow-20200711-012057-2ronh.json 340 download   job
urls-transfer.notkiska.pw-suntuubi.com-subdomains-inf-20200105-191743-9m75g-00199.warc.gz 5383449150 download   job
urls-transfer.notkiska.pw-suntuubi.com-subdomains-inf-20200105-191743-9m75g-00199.warc.os.cdx.gz 2163927 download
urls-transfer.notkiska.pw-twitter-%23BlackHistoryMonth-shallow-20200610-132545-46qdq-00193.warc.gz 5371061766 download   job
urls-transfer.notkiska.pw-twitter-%23BlackHistoryMonth-shallow-20200610-132545-46qdq-00193.warc.os.cdx.gz 2835816 download
urls-transfer.notkiska.pw-twitter-%23BlackTwitter-shallow-20200710-163004-dpwry-00000.warc.gz 5368756701 download   job
urls-transfer.notkiska.pw-twitter-%23BlackTwitter-shallow-20200710-163004-dpwry-00000.warc.os.cdx.gz 7698223 download
urls-transfer.notkiska.pw-twitter-%23WorldRefugeeDay-shallow-20200605-213315-5wxzx-00067.warc.gz 5424563976 download   job
urls-transfer.notkiska.pw-twitter-%23WorldRefugeeDay-shallow-20200605-213315-5wxzx-00067.warc.os.cdx.gz 6261343 download
urls-transfer.notkiska.pw-twitter-%23notmypresident-shallow-20200530-220957-2c0z0-00107.warc.gz 5607530354 download   job
urls-transfer.notkiska.pw-twitter-%23notmypresident-shallow-20200530-220957-2c0z0-00107.warc.os.cdx.gz 1905383 download
www.12371.cn-inf-20200709-194054-1lotk-00009.warc.gz 5397122490 download   job
www.12371.cn-inf-20200709-194054-1lotk-00009.warc.os.cdx.gz 2150436 download
www.notcot.com-inf-20200709-213423-116f3-00006.warc.gz 5394734151 download   job
www.notcot.com-inf-20200709-213423-116f3-00006.warc.os.cdx.gz 2382877 download
www.peakprosperity.com-shallow-20200711-033441-cx4o6-00000.warc.gz 6243327 download   job
www.peakprosperity.com-shallow-20200711-033441-cx4o6-00000.warc.os.cdx.gz 30501 download
www.peakprosperity.com-shallow-20200711-033441-cx4o6-meta.warc.gz 21118 download   job
www.peakprosperity.com-shallow-20200711-033441-cx4o6-meta.warc.os.cdx.gz 47 download
www.peakprosperity.com-shallow-20200711-033441-cx4o6.json 293 download   job
www.raspberrypi.org-inf-20200707-192424-bv6p7-00024.warc.gz 6003464320 download   job
www.raspberrypi.org-inf-20200707-192424-bv6p7-00024.warc.os.cdx.gz 2057538 download
www.seniorsnews.com.au-inf-20200528-062104-cuuvc-00107.warc.gz 5371711739 download   job
www.seniorsnews.com.au-inf-20200528-062104-cuuvc-00107.warc.os.cdx.gz 4780572 download