Item archiveteam_archivebot_go_20200712230003

View on Internet Archive

Filename Size
archiveteam_archivebot_go_20200712230003.cdx.gz 143804538 download
archiveteam_archivebot_go_20200712230003.cdx.idx 118231 download
archiveteam_archivebot_go_20200712230003_files.xml 0 download
archiveteam_archivebot_go_20200712230003_meta.sqlite 889856 download
archiveteam_archivebot_go_20200712230003_meta.xml 969 download
forums.bohemia.net-inf-20200603-013635-egbvu-00096.warc.gz 5368720453 download   job
forums.bohemia.net-inf-20200603-013635-egbvu-00096.warc.os.cdx.gz 7242315 download
freerepublic.com-inf-20200627-122612-3g9x9-aborted.json 252 download   job
up.frubar.net-shallow-20200712-220001-63t2a.json 260 download   job
urls-archive.max.fan-twitter-@ACNURLagoAgrio-filtered.txt-shallow-20200712-224714-7lhya-00000.warc.gz 4743813 download   job
urls-archive.max.fan-twitter-@ACNURLagoAgrio-filtered.txt-shallow-20200712-224714-7lhya-00000.warc.os.cdx.gz 7966 download
urls-archive.max.fan-twitter-@AGAshleyMoody-filtered.txt-shallow-20200712-224401-7w7s1-00000.warc.gz 11334678 download   job
urls-archive.max.fan-twitter-@AGAshleyMoody-filtered.txt-shallow-20200712-224401-7w7s1-00000.warc.os.cdx.gz 29184 download
urls-archive.max.fan-twitter-@AGLynch-filtered.txt-shallow-20200712-224358-daoem-00000.warc.gz 27513344 download   job
urls-archive.max.fan-twitter-@AGLynch-filtered.txt-shallow-20200712-224358-daoem-00000.warc.os.cdx.gz 107973 download
urls-archive.max.fan-twitter-@ANDORRA_UN-filtered.txt-shallow-20200712-215901-94eg3-00000.warc.gz 2091977 download   job
urls-archive.max.fan-twitter-@ANDORRA_UN-filtered.txt-shallow-20200712-215901-94eg3-00000.warc.os.cdx.gz 6553 download
urls-archive.max.fan-twitter-@ANDORRA_UN-filtered.txt-shallow-20200712-215901-94eg3-meta.warc.gz 7591 download   job
urls-archive.max.fan-twitter-@ANDORRA_UN-filtered.txt-shallow-20200712-215901-94eg3-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ANDORRA_UN-filtered.txt-shallow-20200712-215901-94eg3-urls.txt 232 download
urls-archive.max.fan-twitter-@ANDORRA_UN-filtered.txt-shallow-20200712-215901-94eg3.json 335 download   job
urls-archive.max.fan-twitter-@AP-filtered.txt-shallow-20200712-213537-qc7i9-00000.warc.gz 125232745 download   job
urls-archive.max.fan-twitter-@AP-filtered.txt-shallow-20200712-213537-qc7i9-00000.warc.os.cdx.gz 584055 download
urls-archive.max.fan-twitter-@AP-filtered.txt-shallow-20200712-213537-qc7i9-meta.warc.gz 306135 download   job
urls-archive.max.fan-twitter-@AP-filtered.txt-shallow-20200712-213537-qc7i9-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AP-filtered.txt-shallow-20200712-213537-qc7i9-urls.txt 27100 download
urls-archive.max.fan-twitter-@AP-filtered.txt-shallow-20200712-213537-qc7i9.json 319 download   job
urls-archive.max.fan-twitter-@APDiploWriter-filtered.txt-shallow-20200712-214252-csrka-00000.warc.gz 14286174 download   job
urls-archive.max.fan-twitter-@APDiploWriter-filtered.txt-shallow-20200712-214252-csrka-00000.warc.os.cdx.gz 67531 download
urls-archive.max.fan-twitter-@APDiploWriter-filtered.txt-shallow-20200712-214252-csrka-meta.warc.gz 40380 download   job
urls-archive.max.fan-twitter-@APDiploWriter-filtered.txt-shallow-20200712-214252-csrka-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@APDiploWriter-filtered.txt-shallow-20200712-214252-csrka-urls.txt 9021 download
urls-archive.max.fan-twitter-@APDiploWriter-filtered.txt-shallow-20200712-214252-csrka.json 341 download   job
urls-archive.max.fan-twitter-@APDonKing-filtered.txt-shallow-20200712-214251-4ybi6-00000.warc.gz 11850323 download   job
urls-archive.max.fan-twitter-@APDonKing-filtered.txt-shallow-20200712-214251-4ybi6-00000.warc.os.cdx.gz 16633 download
urls-archive.max.fan-twitter-@APDonKing-filtered.txt-shallow-20200712-214251-4ybi6-meta.warc.gz 13256 download   job
urls-archive.max.fan-twitter-@APDonKing-filtered.txt-shallow-20200712-214251-4ybi6-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@APDonKing-filtered.txt-shallow-20200712-214251-4ybi6-urls.txt 4113 download
urls-archive.max.fan-twitter-@APDonKing-filtered.txt-shallow-20200712-214251-4ybi6.json 333 download   job
urls-archive.max.fan-twitter-@APUkraine-filtered.txt-shallow-20200712-213525-10fy1-00000.warc.gz 28176401 download   job
urls-archive.max.fan-twitter-@APUkraine-filtered.txt-shallow-20200712-213525-10fy1-00000.warc.os.cdx.gz 53866 download
urls-archive.max.fan-twitter-@APUkraine-filtered.txt-shallow-20200712-213525-10fy1-meta.warc.gz 32964 download   job
urls-archive.max.fan-twitter-@APUkraine-filtered.txt-shallow-20200712-213525-10fy1-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@APUkraine-filtered.txt-shallow-20200712-213525-10fy1-urls.txt 3477 download
urls-archive.max.fan-twitter-@APUkraine-filtered.txt-shallow-20200712-213525-10fy1.json 333 download   job
urls-archive.max.fan-twitter-@APWally-filtered.txt-shallow-20200712-213105-46yts-00000.warc.gz 7141246 download   job
urls-archive.max.fan-twitter-@APWally-filtered.txt-shallow-20200712-213105-46yts-00000.warc.os.cdx.gz 11524 download
urls-archive.max.fan-twitter-@APWally-filtered.txt-shallow-20200712-213105-46yts-meta.warc.gz 10407 download   job
urls-archive.max.fan-twitter-@APWally-filtered.txt-shallow-20200712-213105-46yts-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@APWally-filtered.txt-shallow-20200712-213105-46yts-urls.txt 5024 download
urls-archive.max.fan-twitter-@APWally-filtered.txt-shallow-20200712-213105-46yts.json 329 download   job
urls-archive.max.fan-twitter-@AP_Travel-filtered.txt-shallow-20200712-213529-6sxfn-meta.warc.gz 951057 download   job
urls-archive.max.fan-twitter-@AP_Travel-filtered.txt-shallow-20200712-213529-6sxfn-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AP_Travel-filtered.txt-shallow-20200712-213529-6sxfn-urls.txt 847703 download
urls-archive.max.fan-twitter-@AP_Travel-filtered.txt-shallow-20200712-213529-6sxfn.json 333 download   job
urls-archive.max.fan-twitter-@ASteiner-filtered.txt-shallow-20200712-211613-e2c2y-00000.warc.gz 11183714 download   job
urls-archive.max.fan-twitter-@ASteiner-filtered.txt-shallow-20200712-211613-e2c2y-00000.warc.os.cdx.gz 39373 download
urls-archive.max.fan-twitter-@ASteiner-filtered.txt-shallow-20200712-211613-e2c2y-meta.warc.gz 25109 download   job
urls-archive.max.fan-twitter-@ASteiner-filtered.txt-shallow-20200712-211613-e2c2y-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ASteiner-filtered.txt-shallow-20200712-211613-e2c2y-urls.txt 3415 download
urls-archive.max.fan-twitter-@ASteiner-filtered.txt-shallow-20200712-211613-e2c2y.json 331 download   job
urls-archive.max.fan-twitter-@AVilliers2017-filtered.txt-shallow-20200712-210855-1we0e-00000.warc.gz 10712018 download   job
urls-archive.max.fan-twitter-@AVilliers2017-filtered.txt-shallow-20200712-210855-1we0e-00000.warc.os.cdx.gz 13647 download
urls-archive.max.fan-twitter-@AVilliers2017-filtered.txt-shallow-20200712-210855-1we0e-meta.warc.gz 11777 download   job
urls-archive.max.fan-twitter-@AVilliers2017-filtered.txt-shallow-20200712-210855-1we0e-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AVilliers2017-filtered.txt-shallow-20200712-210855-1we0e-urls.txt 7020 download
urls-archive.max.fan-twitter-@AVilliers2017-filtered.txt-shallow-20200712-210855-1we0e.json 341 download   job
urls-archive.max.fan-twitter-@AidejuridiqueON-filtered.txt-shallow-20200712-224345-4rncv-meta.warc.gz 6313 download   job
urls-archive.max.fan-twitter-@AidejuridiqueON-filtered.txt-shallow-20200712-224345-4rncv-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AlMissionUN-filtered.txt-shallow-20200712-223717-eix4g-urls.txt 33899 download
urls-archive.max.fan-twitter-@AlMissionUN-filtered.txt-shallow-20200712-223717-eix4g.json 337 download   job
urls-archive.max.fan-twitter-@AlanKahKitLeong-filtered.txt-shallow-20200712-224331-3eh3r-00000.warc.gz 244443257 download   job
urls-archive.max.fan-twitter-@AlanKahKitLeong-filtered.txt-shallow-20200712-224331-3eh3r-00000.warc.os.cdx.gz 327786 download
urls-archive.max.fan-twitter-@AlannaHartzokD9-filtered.txt-shallow-20200712-224301-dpzh5-meta.warc.gz 7388 download   job
urls-archive.max.fan-twitter-@AlannaHartzokD9-filtered.txt-shallow-20200712-224301-dpzh5-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AlbankAldawli-filtered.txt-shallow-20200712-224301-d58mh-meta.warc.gz 32129 download   job
urls-archive.max.fan-twitter-@AlbankAldawli-filtered.txt-shallow-20200712-224301-d58mh-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Alemaniaparati-filtered.txt-shallow-20200712-224244-3c9jz-meta.warc.gz 20537 download   job
urls-archive.max.fan-twitter-@Alemaniaparati-filtered.txt-shallow-20200712-224244-3c9jz-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AlertTampa-filtered.txt-shallow-20200712-224244-2c41x-meta.warc.gz 7400 download   job
urls-archive.max.fan-twitter-@AlertTampa-filtered.txt-shallow-20200712-224244-2c41x-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AlgeriaUN-filtered.txt-shallow-20200712-224132-3he3g-00000.warc.gz 83686807 download   job
urls-archive.max.fan-twitter-@AlgeriaUN-filtered.txt-shallow-20200712-224132-3he3g-00000.warc.os.cdx.gz 111849 download
urls-archive.max.fan-twitter-@AlgeriaUN-filtered.txt-shallow-20200712-224132-3he3g-meta.warc.gz 64380 download   job
urls-archive.max.fan-twitter-@AlgeriaUN-filtered.txt-shallow-20200712-224132-3he3g-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AliHoshow-filtered.txt-shallow-20200712-223717-c4itl-urls.txt 912 download
urls-archive.max.fan-twitter-@AliceThourot-filtered.txt-shallow-20200712-223800-5ik8t-00000.warc.gz 58821064 download   job
urls-archive.max.fan-twitter-@AliceThourot-filtered.txt-shallow-20200712-223800-5ik8t-00000.warc.os.cdx.gz 82621 download
urls-archive.max.fan-twitter-@AlvinYeungnk-filtered.txt-shallow-20200712-223713-6b0rl-meta.warc.gz 45606 download   job
urls-archive.max.fan-twitter-@AlvinYeungnk-filtered.txt-shallow-20200712-223713-6b0rl-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AlyssaMGoodman-filtered.txt-shallow-20200712-223211-46cde.json 343 download   job
urls-archive.max.fan-twitter-@AmandaB1201-filtered.txt-shallow-20200712-223208-dw6c0-meta.warc.gz 100783 download   job
urls-archive.max.fan-twitter-@AmandaB1201-filtered.txt-shallow-20200712-223208-dw6c0-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AmbKawar-filtered.txt-shallow-20200712-223145-1vne6-meta.warc.gz 140505 download   job
urls-archive.max.fan-twitter-@AmbKawar-filtered.txt-shallow-20200712-223145-1vne6-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AmbKawar-filtered.txt-shallow-20200712-223145-1vne6-urls.txt 49343 download
urls-archive.max.fan-twitter-@AmbMKamau-filtered.txt-shallow-20200712-222248-bswyq-meta.warc.gz 334374 download   job
urls-archive.max.fan-twitter-@AmbMKamau-filtered.txt-shallow-20200712-222248-bswyq-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AmbMKamau-filtered.txt-shallow-20200712-222248-bswyq.json 333 download   job
urls-archive.max.fan-twitter-@Ami_Sterling-filtered.txt-shallow-20200712-222043-6pa29.json 339 download   job
urls-archive.max.fan-twitter-@AminaJMohammed-filtered.txt-shallow-20200712-222244-7harr-meta.warc.gz 289604 download   job
urls-archive.max.fan-twitter-@AminaJMohammed-filtered.txt-shallow-20200712-222244-7harr-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AminaJMohammed-filtered.txt-shallow-20200712-222244-7harr-urls.txt 40672 download
urls-archive.max.fan-twitter-@Andrej_Kiska-filtered.txt-shallow-20200712-215855-5vwhk-00000.warc.gz 32786207 download   job
urls-archive.max.fan-twitter-@Andrej_Kiska-filtered.txt-shallow-20200712-215855-5vwhk-00000.warc.os.cdx.gz 114898 download
urls-archive.max.fan-twitter-@AndrewDasNYT-filtered.txt-shallow-20200712-215614-7z3gn-00000.warc.gz 16824371 download   job
urls-archive.max.fan-twitter-@AndrewDasNYT-filtered.txt-shallow-20200712-215614-7z3gn-00000.warc.os.cdx.gz 52879 download
urls-archive.max.fan-twitter-@AndrewDasNYT-filtered.txt-shallow-20200712-215614-7z3gn-meta.warc.gz 32902 download   job
urls-archive.max.fan-twitter-@AndrewDasNYT-filtered.txt-shallow-20200712-215614-7z3gn-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AndrewDasNYT-filtered.txt-shallow-20200712-215614-7z3gn-urls.txt 7310 download
urls-archive.max.fan-twitter-@AndrewDasNYT-filtered.txt-shallow-20200712-215614-7z3gn.json 339 download   job
urls-archive.max.fan-twitter-@Ann4Congress-filtered.txt-shallow-20200712-215144-3unk5-00000.warc.gz 7263449 download   job
urls-archive.max.fan-twitter-@Ann4Congress-filtered.txt-shallow-20200712-215144-3unk5-00000.warc.os.cdx.gz 13057 download
urls-archive.max.fan-twitter-@Ann4Congress-filtered.txt-shallow-20200712-215144-3unk5-meta.warc.gz 11260 download   job
urls-archive.max.fan-twitter-@Ann4Congress-filtered.txt-shallow-20200712-215144-3unk5-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Ann4Congress-filtered.txt-shallow-20200712-215144-3unk5-urls.txt 5310 download
urls-archive.max.fan-twitter-@Ann4Congress-filtered.txt-shallow-20200712-215144-3unk5.json 339 download   job
urls-archive.max.fan-twitter-@AnnaKNelson_DC-filtered.txt-shallow-20200712-215004-asi9j-00000.warc.gz 27501316 download   job
urls-archive.max.fan-twitter-@AnnaKNelson_DC-filtered.txt-shallow-20200712-215004-asi9j-00000.warc.os.cdx.gz 39566 download
urls-archive.max.fan-twitter-@AnnaKNelson_DC-filtered.txt-shallow-20200712-215004-asi9j-meta.warc.gz 25653 download   job
urls-archive.max.fan-twitter-@AnnaKNelson_DC-filtered.txt-shallow-20200712-215004-asi9j-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AnnaKNelson_DC-filtered.txt-shallow-20200712-215004-asi9j-urls.txt 12398 download
urls-archive.max.fan-twitter-@AnnaKNelson_DC-filtered.txt-shallow-20200712-215004-asi9j.json 343 download   job
urls-archive.max.fan-twitter-@AnnaKSteele-filtered.txt-shallow-20200712-214924-585kd-00000.warc.gz 15460176 download   job
urls-archive.max.fan-twitter-@AnnaKSteele-filtered.txt-shallow-20200712-214924-585kd-00000.warc.os.cdx.gz 23055 download
urls-archive.max.fan-twitter-@AnnaKSteele-filtered.txt-shallow-20200712-214924-585kd-meta.warc.gz 16939 download   job
urls-archive.max.fan-twitter-@AnnaKSteele-filtered.txt-shallow-20200712-214924-585kd-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AnnaKSteele-filtered.txt-shallow-20200712-214924-585kd-urls.txt 12365 download
urls-archive.max.fan-twitter-@AnnaKSteele-filtered.txt-shallow-20200712-214924-585kd.json 337 download   job
urls-archive.max.fan-twitter-@AnnickGirardin-filtered.txt-shallow-20200712-214920-epyeg-urls.txt 205270 download
urls-archive.max.fan-twitter-@Ari_NYT-filtered.txt-shallow-20200712-213100-19mup-00000.warc.gz 40528408 download   job
urls-archive.max.fan-twitter-@Ari_NYT-filtered.txt-shallow-20200712-213100-19mup-00000.warc.os.cdx.gz 64381 download
urls-archive.max.fan-twitter-@Ari_NYT-filtered.txt-shallow-20200712-213100-19mup-meta.warc.gz 39566 download   job
urls-archive.max.fan-twitter-@Ari_NYT-filtered.txt-shallow-20200712-213100-19mup-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Ari_NYT-filtered.txt-shallow-20200712-213100-19mup-urls.txt 24989 download
urls-archive.max.fan-twitter-@Ari_NYT-filtered.txt-shallow-20200712-213100-19mup.json 329 download   job
urls-archive.max.fan-twitter-@AsaforAR-filtered.txt-shallow-20200712-212556-83k02-00000.warc.gz 90903869 download   job
urls-archive.max.fan-twitter-@AsaforAR-filtered.txt-shallow-20200712-212556-83k02-00000.warc.os.cdx.gz 130456 download
urls-archive.max.fan-twitter-@AsaforAR-filtered.txt-shallow-20200712-212556-83k02-meta.warc.gz 73968 download   job
urls-archive.max.fan-twitter-@AsaforAR-filtered.txt-shallow-20200712-212556-83k02-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AsaforAR-filtered.txt-shallow-20200712-212556-83k02-urls.txt 17193 download
urls-archive.max.fan-twitter-@AsaforAR-filtered.txt-shallow-20200712-212556-83k02.json 331 download   job
urls-archive.max.fan-twitter-@AsmAguiarCurry-filtered.txt-shallow-20200712-212526-eehfq-00000.warc.gz 186366365 download   job
urls-archive.max.fan-twitter-@AsmAguiarCurry-filtered.txt-shallow-20200712-212526-eehfq-00000.warc.os.cdx.gz 146561 download
urls-archive.max.fan-twitter-@AsmAguiarCurry-filtered.txt-shallow-20200712-212526-eehfq-meta.warc.gz 81731 download   job
urls-archive.max.fan-twitter-@AsmAguiarCurry-filtered.txt-shallow-20200712-212526-eehfq-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AsmAguiarCurry-filtered.txt-shallow-20200712-212526-eehfq-urls.txt 40962 download
urls-archive.max.fan-twitter-@AsmAguiarCurry-filtered.txt-shallow-20200712-212526-eehfq.json 343 download   job
urls-archive.max.fan-twitter-@AsmFrazier-filtered.txt-shallow-20200712-212507-jpbev-00000.warc.gz 5129244 download   job
urls-archive.max.fan-twitter-@AsmFrazier-filtered.txt-shallow-20200712-212507-jpbev-00000.warc.os.cdx.gz 12156 download
urls-archive.max.fan-twitter-@AsmFrazier-filtered.txt-shallow-20200712-212507-jpbev-meta.warc.gz 10699 download   job
urls-archive.max.fan-twitter-@AsmFrazier-filtered.txt-shallow-20200712-212507-jpbev-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AsmFrazier-filtered.txt-shallow-20200712-212507-jpbev-urls.txt 1276 download
urls-archive.max.fan-twitter-@AsmFrazier-filtered.txt-shallow-20200712-212507-jpbev.json 335 download   job
urls-archive.max.fan-twitter-@AsmGrayson-filtered.txt-shallow-20200712-212447-80qyf-00000.warc.gz 159547854 download   job
urls-archive.max.fan-twitter-@AsmGrayson-filtered.txt-shallow-20200712-212447-80qyf-00000.warc.os.cdx.gz 170152 download
urls-archive.max.fan-twitter-@AsmGrayson-filtered.txt-shallow-20200712-212447-80qyf-meta.warc.gz 93787 download   job
urls-archive.max.fan-twitter-@AsmGrayson-filtered.txt-shallow-20200712-212447-80qyf-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AsmGrayson-filtered.txt-shallow-20200712-212447-80qyf-urls.txt 48945 download
urls-archive.max.fan-twitter-@AsmGrayson-filtered.txt-shallow-20200712-212447-80qyf.json 335 download   job
urls-archive.max.fan-twitter-@AsmJesseGabriel-filtered.txt-shallow-20200712-212444-6giyq-00000.warc.gz 79768175 download   job
urls-archive.max.fan-twitter-@AsmJesseGabriel-filtered.txt-shallow-20200712-212444-6giyq-00000.warc.os.cdx.gz 99272 download
urls-archive.max.fan-twitter-@AsmJesseGabriel-filtered.txt-shallow-20200712-212444-6giyq-meta.warc.gz 57055 download   job
urls-archive.max.fan-twitter-@AsmJesseGabriel-filtered.txt-shallow-20200712-212444-6giyq-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AsmJesseGabriel-filtered.txt-shallow-20200712-212444-6giyq-urls.txt 20412 download
urls-archive.max.fan-twitter-@AsmJesseGabriel-filtered.txt-shallow-20200712-212444-6giyq.json 345 download   job
urls-archive.max.fan-twitter-@AsmMarcLevine-filtered.txt-shallow-20200712-212137-48qne-00000.warc.gz 384838109 download   job
urls-archive.max.fan-twitter-@AsmMarcLevine-filtered.txt-shallow-20200712-212137-48qne-00000.warc.os.cdx.gz 326488 download
urls-archive.max.fan-twitter-@AsmMarcLevine-filtered.txt-shallow-20200712-212137-48qne-meta.warc.gz 176976 download   job
urls-archive.max.fan-twitter-@AsmMarcLevine-filtered.txt-shallow-20200712-212137-48qne-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AsmMarcLevine-filtered.txt-shallow-20200712-212137-48qne-urls.txt 111804 download
urls-archive.max.fan-twitter-@AsmMarcLevine-filtered.txt-shallow-20200712-212137-48qne.json 341 download   job
urls-archive.max.fan-twitter-@AsmMoniqueLimon-filtered.txt-shallow-20200712-212136-57nro-00000.warc.gz 251485863 download   job
urls-archive.max.fan-twitter-@AsmMoniqueLimon-filtered.txt-shallow-20200712-212136-57nro-00000.warc.os.cdx.gz 268609 download
urls-archive.max.fan-twitter-@AsmMoniqueLimon-filtered.txt-shallow-20200712-212136-57nro-meta.warc.gz 147095 download   job
urls-archive.max.fan-twitter-@AsmMoniqueLimon-filtered.txt-shallow-20200712-212136-57nro-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AsmMoniqueLimon-filtered.txt-shallow-20200712-212136-57nro-urls.txt 65631 download
urls-archive.max.fan-twitter-@AsmMoniqueLimon-filtered.txt-shallow-20200712-212136-57nro.json 345 download   job
urls-archive.max.fan-twitter-@AsmRichardBloom-filtered.txt-shallow-20200712-211633-dlf4p-00000.warc.gz 292899275 download   job
urls-archive.max.fan-twitter-@AsmRichardBloom-filtered.txt-shallow-20200712-211633-dlf4p-00000.warc.os.cdx.gz 377269 download
urls-archive.max.fan-twitter-@AsmRichardBloom-filtered.txt-shallow-20200712-211633-dlf4p-meta.warc.gz 204367 download   job
urls-archive.max.fan-twitter-@AsmRichardBloom-filtered.txt-shallow-20200712-211633-dlf4p-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AsmRichardBloom-filtered.txt-shallow-20200712-211633-dlf4p-urls.txt 115730 download
urls-archive.max.fan-twitter-@AsmRichardBloom-filtered.txt-shallow-20200712-211633-dlf4p.json 345 download   job
urls-archive.max.fan-twitter-@Atayeshe-filtered.txt-shallow-20200712-211613-cd7ly-00000.warc.gz 627528540 download   job
urls-archive.max.fan-twitter-@Atayeshe-filtered.txt-shallow-20200712-211613-cd7ly-00000.warc.os.cdx.gz 972387 download
urls-archive.max.fan-twitter-@Atayeshe-filtered.txt-shallow-20200712-211613-cd7ly-meta.warc.gz 513181 download   job
urls-archive.max.fan-twitter-@Atayeshe-filtered.txt-shallow-20200712-211613-cd7ly-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Atayeshe-filtered.txt-shallow-20200712-211613-cd7ly-urls.txt 189214 download
urls-archive.max.fan-twitter-@Atayeshe-filtered.txt-shallow-20200712-211613-cd7ly.json 331 download   job
urls-archive.max.fan-twitter-@AthinaPrassa-filtered.txt-shallow-20200712-211613-lhed6-00000.warc.gz 1403419 download   job
urls-archive.max.fan-twitter-@AthinaPrassa-filtered.txt-shallow-20200712-211613-lhed6-00000.warc.os.cdx.gz 4824 download
urls-archive.max.fan-twitter-@AthinaPrassa-filtered.txt-shallow-20200712-211613-lhed6-meta.warc.gz 6544 download   job
urls-archive.max.fan-twitter-@AthinaPrassa-filtered.txt-shallow-20200712-211613-lhed6-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AthinaPrassa-filtered.txt-shallow-20200712-211613-lhed6-urls.txt 531 download
urls-archive.max.fan-twitter-@AthinaPrassa-filtered.txt-shallow-20200712-211613-lhed6.json 339 download   job
urls-archive.max.fan-twitter-@AusAmbDK-filtered.txt-shallow-20200712-211446-urm8i-00000.warc.gz 234161788 download   job
urls-archive.max.fan-twitter-@AusAmbDK-filtered.txt-shallow-20200712-211446-urm8i-00000.warc.os.cdx.gz 273424 download
urls-archive.max.fan-twitter-@AusAmbDK-filtered.txt-shallow-20200712-211446-urm8i-meta.warc.gz 150069 download   job
urls-archive.max.fan-twitter-@AusAmbDK-filtered.txt-shallow-20200712-211446-urm8i-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AusAmbDK-filtered.txt-shallow-20200712-211446-urm8i-urls.txt 80635 download
urls-archive.max.fan-twitter-@AusAmbDK-filtered.txt-shallow-20200712-211446-urm8i.json 331 download   job
urls-archive.max.fan-twitter-@Austin_TechJobs-filtered.txt-shallow-20200712-211442-bs2du-00000.warc.gz 1072749657 download   job
urls-archive.max.fan-twitter-@Austin_TechJobs-filtered.txt-shallow-20200712-211442-bs2du-00000.warc.os.cdx.gz 1005383 download
urls-archive.max.fan-twitter-@Austin_TechJobs-filtered.txt-shallow-20200712-211442-bs2du.json 345 download   job
urls-archive.max.fan-twitter-@AustriaUN-filtered.txt-shallow-20200712-210901-7whtw-00000.warc.gz 3946494 download   job
urls-archive.max.fan-twitter-@AustriaUN-filtered.txt-shallow-20200712-210901-7whtw-00000.warc.os.cdx.gz 10645 download
urls-archive.max.fan-twitter-@AustriaUN-filtered.txt-shallow-20200712-210901-7whtw-meta.warc.gz 9910 download   job
urls-archive.max.fan-twitter-@AustriaUN-filtered.txt-shallow-20200712-210901-7whtw-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@AustriaUN-filtered.txt-shallow-20200712-210901-7whtw-urls.txt 570 download
urls-archive.max.fan-twitter-@AustriaUN-filtered.txt-shallow-20200712-210901-7whtw.json 333 download   job
urls-archive.max.fan-twitter-@BAG_OFSP_UFSP-filtered.txt-shallow-20200712-210458-6zpgy-00000.warc.gz 314235014 download   job
urls-archive.max.fan-twitter-@BAG_OFSP_UFSP-filtered.txt-shallow-20200712-210458-6zpgy-00000.warc.os.cdx.gz 557842 download
urls-archive.max.fan-twitter-@BAG_OFSP_UFSP-filtered.txt-shallow-20200712-210458-6zpgy-meta.warc.gz 297083 download   job
urls-archive.max.fan-twitter-@BAG_OFSP_UFSP-filtered.txt-shallow-20200712-210458-6zpgy-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BAG_OFSP_UFSP-filtered.txt-shallow-20200712-210458-6zpgy-urls.txt 103681 download
urls-archive.max.fan-twitter-@BAG_OFSP_UFSP-filtered.txt-shallow-20200712-210458-6zpgy.json 341 download   job
urls-archive.max.fan-twitter-@BGriveaux-filtered.txt-shallow-20200712-203356-6vb0z-00000.warc.gz 808201914 download   job
urls-archive.max.fan-twitter-@BGriveaux-filtered.txt-shallow-20200712-203356-6vb0z-00000.warc.os.cdx.gz 1583753 download
urls-archive.max.fan-twitter-@BGriveaux-filtered.txt-shallow-20200712-203356-6vb0z-meta.warc.gz 838649 download   job
urls-archive.max.fan-twitter-@BGriveaux-filtered.txt-shallow-20200712-203356-6vb0z-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BGriveaux-filtered.txt-shallow-20200712-203356-6vb0z-urls.txt 185503 download
urls-archive.max.fan-twitter-@BGriveaux-filtered.txt-shallow-20200712-203356-6vb0z.json 333 download   job
urls-archive.max.fan-twitter-@BH_officiel-filtered.txt-shallow-20200712-203354-8vzra-00000.warc.gz 12900900 download   job
urls-archive.max.fan-twitter-@BH_officiel-filtered.txt-shallow-20200712-203354-8vzra-00000.warc.os.cdx.gz 39088 download
urls-archive.max.fan-twitter-@BH_officiel-filtered.txt-shallow-20200712-203354-8vzra-meta.warc.gz 25272 download   job
urls-archive.max.fan-twitter-@BH_officiel-filtered.txt-shallow-20200712-203354-8vzra-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BH_officiel-filtered.txt-shallow-20200712-203354-8vzra-urls.txt 6581 download
urls-archive.max.fan-twitter-@BH_officiel-filtered.txt-shallow-20200712-203354-8vzra.json 337 download   job
urls-archive.max.fan-twitter-@B_BessotBallot-filtered.txt-shallow-20200712-205552-dl94o-00000.warc.gz 144270479 download   job
urls-archive.max.fan-twitter-@B_BessotBallot-filtered.txt-shallow-20200712-205552-dl94o-00000.warc.os.cdx.gz 166766 download
urls-archive.max.fan-twitter-@B_BessotBallot-filtered.txt-shallow-20200712-205552-dl94o-meta.warc.gz 92000 download   job
urls-archive.max.fan-twitter-@B_BessotBallot-filtered.txt-shallow-20200712-205552-dl94o-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@B_BessotBallot-filtered.txt-shallow-20200712-205552-dl94o-urls.txt 36289 download
urls-archive.max.fan-twitter-@B_BessotBallot-filtered.txt-shallow-20200712-205552-dl94o.json 343 download   job
urls-archive.max.fan-twitter-@BaltCityHall-filtered.txt-shallow-20200712-210258-2b8j4-00000.warc.gz 292315801 download   job
urls-archive.max.fan-twitter-@BaltCityHall-filtered.txt-shallow-20200712-210258-2b8j4-00000.warc.os.cdx.gz 263701 download
urls-archive.max.fan-twitter-@BaltCityHall-filtered.txt-shallow-20200712-210258-2b8j4-meta.warc.gz 143826 download   job
urls-archive.max.fan-twitter-@BaltCityHall-filtered.txt-shallow-20200712-210258-2b8j4-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BaltCityHall-filtered.txt-shallow-20200712-210258-2b8j4-urls.txt 77392 download
urls-archive.max.fan-twitter-@BaltCityHall-filtered.txt-shallow-20200712-210258-2b8j4.json 339 download   job
urls-archive.max.fan-twitter-@BankingGOP-filtered.txt-shallow-20200712-210259-aayfx-00000.warc.gz 24687713 download   job
urls-archive.max.fan-twitter-@BankingGOP-filtered.txt-shallow-20200712-210259-aayfx-00000.warc.os.cdx.gz 54627 download
urls-archive.max.fan-twitter-@BankingGOP-filtered.txt-shallow-20200712-210259-aayfx-meta.warc.gz 34043 download   job
urls-archive.max.fan-twitter-@BankingGOP-filtered.txt-shallow-20200712-210259-aayfx-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BankingGOP-filtered.txt-shallow-20200712-210259-aayfx-urls.txt 17480 download
urls-archive.max.fan-twitter-@BankingGOP-filtered.txt-shallow-20200712-210259-aayfx.json 335 download   job
urls-archive.max.fan-twitter-@BardishKW-filtered.txt-shallow-20200712-210257-b8153-00000.warc.gz 2554771423 download   job
urls-archive.max.fan-twitter-@BardishKW-filtered.txt-shallow-20200712-210257-b8153-00000.warc.os.cdx.gz 1963016 download
urls-archive.max.fan-twitter-@BardishKW-filtered.txt-shallow-20200712-210257-b8153-meta.warc.gz 1044847 download   job
urls-archive.max.fan-twitter-@BardishKW-filtered.txt-shallow-20200712-210257-b8153-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BardishKW-filtered.txt-shallow-20200712-210257-b8153-urls.txt 364541 download
urls-archive.max.fan-twitter-@BardishKW-filtered.txt-shallow-20200712-210257-b8153.json 333 download   job
urls-archive.max.fan-twitter-@Baric_Lab-filtered.txt-shallow-20200712-210123-4q77f-00000.warc.gz 9781719 download   job
urls-archive.max.fan-twitter-@Baric_Lab-filtered.txt-shallow-20200712-210123-4q77f-00000.warc.os.cdx.gz 28998 download
urls-archive.max.fan-twitter-@Baric_Lab-filtered.txt-shallow-20200712-210123-4q77f-meta.warc.gz 19827 download   job
urls-archive.max.fan-twitter-@Baric_Lab-filtered.txt-shallow-20200712-210123-4q77f-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Baric_Lab-filtered.txt-shallow-20200712-210123-4q77f-urls.txt 3524 download
urls-archive.max.fan-twitter-@Baric_Lab-filtered.txt-shallow-20200712-210123-4q77f.json 333 download   job
urls-archive.max.fan-twitter-@BarrowPresident-filtered.txt-shallow-20200712-210116-b1qgg-00000.warc.gz 61716765 download   job
urls-archive.max.fan-twitter-@BarrowPresident-filtered.txt-shallow-20200712-210116-b1qgg-00000.warc.os.cdx.gz 137025 download
urls-archive.max.fan-twitter-@BarrowPresident-filtered.txt-shallow-20200712-210116-b1qgg-meta.warc.gz 76644 download   job
urls-archive.max.fan-twitter-@BarrowPresident-filtered.txt-shallow-20200712-210116-b1qgg-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BarrowPresident-filtered.txt-shallow-20200712-210116-b1qgg-urls.txt 15120 download
urls-archive.max.fan-twitter-@BarrowPresident-filtered.txt-shallow-20200712-210116-b1qgg.json 345 download   job
urls-archive.max.fan-twitter-@BasharatPeer-filtered.txt-shallow-20200712-205809-bog4n-00000.warc.gz 164105556 download   job
urls-archive.max.fan-twitter-@BasharatPeer-filtered.txt-shallow-20200712-205809-bog4n-00000.warc.os.cdx.gz 541009 download
urls-archive.max.fan-twitter-@BasharatPeer-filtered.txt-shallow-20200712-205809-bog4n-meta.warc.gz 286392 download   job
urls-archive.max.fan-twitter-@BasharatPeer-filtered.txt-shallow-20200712-205809-bog4n-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BasharatPeer-filtered.txt-shallow-20200712-205809-bog4n-urls.txt 98296 download
urls-archive.max.fan-twitter-@BasharatPeer-filtered.txt-shallow-20200712-205809-bog4n.json 339 download   job
urls-archive.max.fan-twitter-@BauerKahan-filtered.txt-shallow-20200712-205551-avla1-00000.warc.gz 25981131 download   job
urls-archive.max.fan-twitter-@BauerKahan-filtered.txt-shallow-20200712-205551-avla1-00000.warc.os.cdx.gz 48009 download
urls-archive.max.fan-twitter-@BauerKahan-filtered.txt-shallow-20200712-205551-avla1-meta.warc.gz 30131 download   job
urls-archive.max.fan-twitter-@BauerKahan-filtered.txt-shallow-20200712-205551-avla1-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BauerKahan-filtered.txt-shallow-20200712-205551-avla1-urls.txt 9338 download
urls-archive.max.fan-twitter-@BauerKahan-filtered.txt-shallow-20200712-205551-avla1.json 335 download   job
urls-archive.max.fan-twitter-@Beatrice_Dupuy-filtered.txt-shallow-20200712-205351-bbv77-00000.warc.gz 157085133 download   job
urls-archive.max.fan-twitter-@Beatrice_Dupuy-filtered.txt-shallow-20200712-205351-bbv77-00000.warc.os.cdx.gz 192065 download
urls-archive.max.fan-twitter-@Beatrice_Dupuy-filtered.txt-shallow-20200712-205351-bbv77-meta.warc.gz 106281 download   job
urls-archive.max.fan-twitter-@Beatrice_Dupuy-filtered.txt-shallow-20200712-205351-bbv77-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Beatrice_Dupuy-filtered.txt-shallow-20200712-205351-bbv77-urls.txt 80889 download
urls-archive.max.fan-twitter-@Beatrice_Dupuy-filtered.txt-shallow-20200712-205351-bbv77.json 343 download   job
urls-archive.max.fan-twitter-@BedelSaget-filtered.txt-shallow-20200712-205350-4w501-00000.warc.gz 15341667 download   job
urls-archive.max.fan-twitter-@BedelSaget-filtered.txt-shallow-20200712-205350-4w501-00000.warc.os.cdx.gz 29249 download
urls-archive.max.fan-twitter-@BedelSaget-filtered.txt-shallow-20200712-205350-4w501-meta.warc.gz 20177 download   job
urls-archive.max.fan-twitter-@BedelSaget-filtered.txt-shallow-20200712-205350-4w501-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BedelSaget-filtered.txt-shallow-20200712-205350-4w501-urls.txt 10529 download
urls-archive.max.fan-twitter-@BedelSaget-filtered.txt-shallow-20200712-205350-4w501.json 335 download   job
urls-archive.max.fan-twitter-@BejiCEOfficial-filtered.txt-shallow-20200712-205059-aw9jo-00000.warc.gz 123552674 download   job
urls-archive.max.fan-twitter-@BejiCEOfficial-filtered.txt-shallow-20200712-205059-aw9jo-00000.warc.os.cdx.gz 229744 download
urls-archive.max.fan-twitter-@BejiCEOfficial-filtered.txt-shallow-20200712-205059-aw9jo-meta.warc.gz 127971 download   job
urls-archive.max.fan-twitter-@BejiCEOfficial-filtered.txt-shallow-20200712-205059-aw9jo-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BejiCEOfficial-filtered.txt-shallow-20200712-205059-aw9jo-urls.txt 72407 download
urls-archive.max.fan-twitter-@BejiCEOfficial-filtered.txt-shallow-20200712-205059-aw9jo.json 343 download   job
urls-archive.max.fan-twitter-@BelgiumUN-filtered.txt-shallow-20200712-205059-53l7q-00000.warc.gz 7568700 download   job
urls-archive.max.fan-twitter-@BelgiumUN-filtered.txt-shallow-20200712-205059-53l7q-00000.warc.os.cdx.gz 20587 download
urls-archive.max.fan-twitter-@BelgiumUN-filtered.txt-shallow-20200712-205059-53l7q-meta.warc.gz 15354 download   job
urls-archive.max.fan-twitter-@BelgiumUN-filtered.txt-shallow-20200712-205059-53l7q-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BelgiumUN-filtered.txt-shallow-20200712-205059-53l7q-urls.txt 1767 download
urls-archive.max.fan-twitter-@BelgiumUN-filtered.txt-shallow-20200712-205059-53l7q.json 333 download   job
urls-archive.max.fan-twitter-@BenAllenCA-filtered.txt-shallow-20200712-204942-94wlt-00000.warc.gz 4212239 download   job
urls-archive.max.fan-twitter-@BenAllenCA-filtered.txt-shallow-20200712-204942-94wlt-00000.warc.os.cdx.gz 13768 download
urls-archive.max.fan-twitter-@BenAllenCA-filtered.txt-shallow-20200712-204942-94wlt-meta.warc.gz 11639 download   job
urls-archive.max.fan-twitter-@BenAllenCA-filtered.txt-shallow-20200712-204942-94wlt-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BenAllenCA-filtered.txt-shallow-20200712-204942-94wlt-urls.txt 1681 download
urls-archive.max.fan-twitter-@BenAllenCA-filtered.txt-shallow-20200712-204942-94wlt.json 335 download   job
urls-archive.max.fan-twitter-@BenWeiserNYT-filtered.txt-shallow-20200712-204902-1s2rf-00000.warc.gz 374308402 download   job
urls-archive.max.fan-twitter-@BenWeiserNYT-filtered.txt-shallow-20200712-204902-1s2rf-00000.warc.os.cdx.gz 924054 download
urls-archive.max.fan-twitter-@BenWeiserNYT-filtered.txt-shallow-20200712-204902-1s2rf-meta.warc.gz 486096 download   job
urls-archive.max.fan-twitter-@BenWeiserNYT-filtered.txt-shallow-20200712-204902-1s2rf-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BenWeiserNYT-filtered.txt-shallow-20200712-204902-1s2rf-urls.txt 317849 download
urls-archive.max.fan-twitter-@BenWeiserNYT-filtered.txt-shallow-20200712-204902-1s2rf.json 339 download   job
urls-archive.max.fan-twitter-@BenjaminDooley-filtered.txt-shallow-20200712-204941-3g3pq-00000.warc.gz 614565252 download   job
urls-archive.max.fan-twitter-@BenjaminDooley-filtered.txt-shallow-20200712-204941-3g3pq-00000.warc.os.cdx.gz 719151 download
urls-archive.max.fan-twitter-@BenjaminDooley-filtered.txt-shallow-20200712-204941-3g3pq-meta.warc.gz 384657 download   job
urls-archive.max.fan-twitter-@BenjaminDooley-filtered.txt-shallow-20200712-204941-3g3pq-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BenjaminDooley-filtered.txt-shallow-20200712-204941-3g3pq-urls.txt 448762 download
urls-archive.max.fan-twitter-@BenjaminDooley-filtered.txt-shallow-20200712-204941-3g3pq.json 343 download   job
urls-archive.max.fan-twitter-@BennySnyderAP-filtered.txt-shallow-20200712-204902-9i0vc-00000.warc.gz 1182948 download   job
urls-archive.max.fan-twitter-@BennySnyderAP-filtered.txt-shallow-20200712-204902-9i0vc-00000.warc.os.cdx.gz 4368 download
urls-archive.max.fan-twitter-@BennySnyderAP-filtered.txt-shallow-20200712-204902-9i0vc-meta.warc.gz 6321 download   job
urls-archive.max.fan-twitter-@BennySnyderAP-filtered.txt-shallow-20200712-204902-9i0vc-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BennySnyderAP-filtered.txt-shallow-20200712-204902-9i0vc-urls.txt 180 download
urls-archive.max.fan-twitter-@BennySnyderAP-filtered.txt-shallow-20200712-204902-9i0vc.json 341 download   job
urls-archive.max.fan-twitter-@Berlin_TechJob-filtered.txt-shallow-20200712-204859-43hfg-00000.warc.gz 993403891 download   job
urls-archive.max.fan-twitter-@Berlin_TechJob-filtered.txt-shallow-20200712-204859-43hfg-00000.warc.os.cdx.gz 937532 download
urls-archive.max.fan-twitter-@Berlin_TechJob-filtered.txt-shallow-20200712-204859-43hfg-meta.warc.gz 490732 download   job
urls-archive.max.fan-twitter-@Berlin_TechJob-filtered.txt-shallow-20200712-204859-43hfg-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Berlin_TechJob-filtered.txt-shallow-20200712-204859-43hfg-urls.txt 1008208 download
urls-archive.max.fan-twitter-@Berlin_TechJob-filtered.txt-shallow-20200712-204859-43hfg.json 343 download   job
urls-archive.max.fan-twitter-@BernatArmangue-filtered.txt-shallow-20200712-204439-az3gt-00000.warc.gz 59342559 download   job
urls-archive.max.fan-twitter-@BernatArmangue-filtered.txt-shallow-20200712-204439-az3gt-00000.warc.os.cdx.gz 96781 download
urls-archive.max.fan-twitter-@BernatArmangue-filtered.txt-shallow-20200712-204439-az3gt-meta.warc.gz 55902 download   job
urls-archive.max.fan-twitter-@BernatArmangue-filtered.txt-shallow-20200712-204439-az3gt-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BernatArmangue-filtered.txt-shallow-20200712-204439-az3gt-urls.txt 47185 download
urls-archive.max.fan-twitter-@BernatArmangue-filtered.txt-shallow-20200712-204439-az3gt.json 343 download   job
urls-archive.max.fan-twitter-@BernieSanders-filtered.txt-shallow-20200712-204439-7fsjh-00000.warc.gz 97215348 download   job
urls-archive.max.fan-twitter-@BernieSanders-filtered.txt-shallow-20200712-204439-7fsjh-00000.warc.os.cdx.gz 302705 download
urls-archive.max.fan-twitter-@BernieSanders-filtered.txt-shallow-20200712-204439-7fsjh-meta.warc.gz 163328 download   job
urls-archive.max.fan-twitter-@BernieSanders-filtered.txt-shallow-20200712-204439-7fsjh-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BernieSanders-filtered.txt-shallow-20200712-204439-7fsjh-urls.txt 16104 download
urls-archive.max.fan-twitter-@BernieSanders-filtered.txt-shallow-20200712-204439-7fsjh.json 341 download   job
urls-archive.max.fan-twitter-@BetoORourke-filtered.txt-shallow-20200712-204436-2osy5-00000.warc.gz 176786682 download   job
urls-archive.max.fan-twitter-@BetoORourke-filtered.txt-shallow-20200712-204436-2osy5-00000.warc.os.cdx.gz 561586 download
urls-archive.max.fan-twitter-@BetoORourke-filtered.txt-shallow-20200712-204436-2osy5-meta.warc.gz 298966 download   job
urls-archive.max.fan-twitter-@BetoORourke-filtered.txt-shallow-20200712-204436-2osy5-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BetoORourke-filtered.txt-shallow-20200712-204436-2osy5-urls.txt 34397 download
urls-archive.max.fan-twitter-@BetoORourke-filtered.txt-shallow-20200712-204436-2osy5.json 337 download   job
urls-archive.max.fan-twitter-@BetsyDeVosED-filtered.txt-shallow-20200712-203356-lmfo2-00000.warc.gz 302847873 download   job
urls-archive.max.fan-twitter-@BetsyDeVosED-filtered.txt-shallow-20200712-203356-lmfo2-00000.warc.os.cdx.gz 794699 download
urls-archive.max.fan-twitter-@BetsyDeVosED-filtered.txt-shallow-20200712-203356-lmfo2-meta.warc.gz 422670 download   job
urls-archive.max.fan-twitter-@BetsyDeVosED-filtered.txt-shallow-20200712-203356-lmfo2-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BetsyDeVosED-filtered.txt-shallow-20200712-203356-lmfo2-urls.txt 65715 download
urls-archive.max.fan-twitter-@BetsyDeVosED-filtered.txt-shallow-20200712-203356-lmfo2.json 339 download   job
urls-archive.max.fan-twitter-@BhutanUN-filtered.txt-shallow-20200712-203219-887z6-00000.warc.gz 14389783 download   job
urls-archive.max.fan-twitter-@BhutanUN-filtered.txt-shallow-20200712-203219-887z6-00000.warc.os.cdx.gz 30627 download
urls-archive.max.fan-twitter-@BhutanUN-filtered.txt-shallow-20200712-203219-887z6-meta.warc.gz 20531 download   job
urls-archive.max.fan-twitter-@BhutanUN-filtered.txt-shallow-20200712-203219-887z6-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BhutanUN-filtered.txt-shallow-20200712-203219-887z6-urls.txt 2800 download
urls-archive.max.fan-twitter-@BhutanUN-filtered.txt-shallow-20200712-203219-887z6.json 331 download   job
urls-archive.max.fan-twitter-@BigBendNPS-filtered.txt-shallow-20200712-203219-djpra-00000.warc.gz 163429842 download   job
urls-archive.max.fan-twitter-@BigBendNPS-filtered.txt-shallow-20200712-203219-djpra-00000.warc.os.cdx.gz 220885 download
urls-archive.max.fan-twitter-@BigBendNPS-filtered.txt-shallow-20200712-203219-djpra-meta.warc.gz 122166 download   job
urls-archive.max.fan-twitter-@BigBendNPS-filtered.txt-shallow-20200712-203219-djpra-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BigBendNPS-filtered.txt-shallow-20200712-203219-djpra-urls.txt 44791 download
urls-archive.max.fan-twitter-@BigBendNPS-filtered.txt-shallow-20200712-203219-djpra.json 335 download   job
urls-archive.max.fan-twitter-@BillBarrowAP-filtered.txt-shallow-20200712-202914-8yr66-00000.warc.gz 552637312 download   job
urls-archive.max.fan-twitter-@BillBarrowAP-filtered.txt-shallow-20200712-202914-8yr66-00000.warc.os.cdx.gz 774378 download
urls-archive.max.fan-twitter-@BillBarrowAP-filtered.txt-shallow-20200712-202914-8yr66-meta.warc.gz 409825 download   job
urls-archive.max.fan-twitter-@BillBarrowAP-filtered.txt-shallow-20200712-202914-8yr66-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BillBarrowAP-filtered.txt-shallow-20200712-202914-8yr66-urls.txt 452518 download
urls-archive.max.fan-twitter-@BillBarrowAP-filtered.txt-shallow-20200712-202914-8yr66.json 339 download   job
urls-archive.max.fan-twitter-@BillBlair-filtered.txt-shallow-20200712-202506-c11r5-00000.warc.gz 337933434 download   job
urls-archive.max.fan-twitter-@BillBlair-filtered.txt-shallow-20200712-202506-c11r5-00000.warc.os.cdx.gz 567091 download
urls-archive.max.fan-twitter-@BillBlair-filtered.txt-shallow-20200712-202506-c11r5-meta.warc.gz 305825 download   job
urls-archive.max.fan-twitter-@BillBlair-filtered.txt-shallow-20200712-202506-c11r5-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BillBlair-filtered.txt-shallow-20200712-202506-c11r5-urls.txt 76948 download
urls-archive.max.fan-twitter-@BillBlair-filtered.txt-shallow-20200712-202506-c11r5.json 333 download   job
urls-archive.max.fan-twitter-@BillGalvin4MA-filtered.txt-shallow-20200712-202235-ajwte-00000.warc.gz 26805389 download   job
urls-archive.max.fan-twitter-@BillGalvin4MA-filtered.txt-shallow-20200712-202235-ajwte-00000.warc.os.cdx.gz 56453 download
urls-archive.max.fan-twitter-@BillGalvin4MA-filtered.txt-shallow-20200712-202235-ajwte-meta.warc.gz 34377 download   job
urls-archive.max.fan-twitter-@BillGalvin4MA-filtered.txt-shallow-20200712-202235-ajwte-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BillGalvin4MA-filtered.txt-shallow-20200712-202235-ajwte-urls.txt 12383 download
urls-archive.max.fan-twitter-@BillGalvin4MA-filtered.txt-shallow-20200712-202235-ajwte.json 341 download   job
urls-archive.max.fan-twitter-@BillLeeTN-filtered.txt-shallow-20200712-201941-63p51-00000.warc.gz 361767909 download   job
urls-archive.max.fan-twitter-@BillLeeTN-filtered.txt-shallow-20200712-201941-63p51-00000.warc.os.cdx.gz 415968 download
urls-archive.max.fan-twitter-@BillLeeTN-filtered.txt-shallow-20200712-201941-63p51-meta.warc.gz 223786 download   job
urls-archive.max.fan-twitter-@BillLeeTN-filtered.txt-shallow-20200712-201941-63p51-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BillLeeTN-filtered.txt-shallow-20200712-201941-63p51-urls.txt 78973 download
urls-archive.max.fan-twitter-@BillLeeTN-filtered.txt-shallow-20200712-201941-63p51.json 333 download   job
urls-archive.max.fan-twitter-@Bill_Morneau-filtered.txt-shallow-20200712-201550-ewjxg-00000.warc.gz 846008522 download   job
urls-archive.max.fan-twitter-@Bill_Morneau-filtered.txt-shallow-20200712-201550-ewjxg-00000.warc.os.cdx.gz 1395144 download
urls-archive.max.fan-twitter-@Bill_Morneau-filtered.txt-shallow-20200712-201550-ewjxg-meta.warc.gz 742250 download   job
urls-archive.max.fan-twitter-@Bill_Morneau-filtered.txt-shallow-20200712-201550-ewjxg-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Bill_Morneau-filtered.txt-shallow-20200712-201550-ewjxg-urls.txt 251764 download
urls-archive.max.fan-twitter-@Bill_Morneau-filtered.txt-shallow-20200712-201550-ewjxg.json 339 download   job
urls-archive.max.fan-twitter-@BilldeBlasio-filtered.txt-shallow-20200712-202238-bisiy-00000.warc.gz 435821528 download   job
urls-archive.max.fan-twitter-@BilldeBlasio-filtered.txt-shallow-20200712-202238-bisiy-00000.warc.os.cdx.gz 1178738 download
urls-archive.max.fan-twitter-@BilldeBlasio-filtered.txt-shallow-20200712-202238-bisiy-meta.warc.gz 626735 download   job
urls-archive.max.fan-twitter-@BilldeBlasio-filtered.txt-shallow-20200712-202238-bisiy-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BilldeBlasio-filtered.txt-shallow-20200712-202238-bisiy-urls.txt 106827 download
urls-archive.max.fan-twitter-@BilldeBlasio-filtered.txt-shallow-20200712-202238-bisiy.json 339 download   job
urls-archive.max.fan-twitter-@BogdanAurescu-filtered.txt-shallow-20200712-201247-blfre-00000.warc.gz 93869628 download   job
urls-archive.max.fan-twitter-@BogdanAurescu-filtered.txt-shallow-20200712-201247-blfre-00000.warc.os.cdx.gz 138260 download
urls-archive.max.fan-twitter-@BogdanAurescu-filtered.txt-shallow-20200712-201247-blfre-meta.warc.gz 77867 download   job
urls-archive.max.fan-twitter-@BogdanAurescu-filtered.txt-shallow-20200712-201247-blfre-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BogdanAurescu-filtered.txt-shallow-20200712-201247-blfre-urls.txt 38441 download
urls-archive.max.fan-twitter-@BogdanAurescu-filtered.txt-shallow-20200712-201247-blfre.json 341 download   job
urls-archive.max.fan-twitter-@BoliviaUNNY-filtered.txt-shallow-20200712-201216-dv9nr-00000.warc.gz 328764438 download   job
urls-archive.max.fan-twitter-@BoliviaUNNY-filtered.txt-shallow-20200712-201216-dv9nr-00000.warc.os.cdx.gz 334641 download
urls-archive.max.fan-twitter-@BoliviaUNNY-filtered.txt-shallow-20200712-201216-dv9nr-meta.warc.gz 178968 download   job
urls-archive.max.fan-twitter-@BoliviaUNNY-filtered.txt-shallow-20200712-201216-dv9nr-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BoliviaUNNY-filtered.txt-shallow-20200712-201216-dv9nr-urls.txt 93932 download
urls-archive.max.fan-twitter-@BoliviaUNNY-filtered.txt-shallow-20200712-201216-dv9nr.json 337 download   job
urls-archive.max.fan-twitter-@BorderlessEdu-filtered.txt-shallow-20200712-200838-brtph-00000.warc.gz 52973008 download   job
urls-archive.max.fan-twitter-@BorderlessEdu-filtered.txt-shallow-20200712-200838-brtph-00000.warc.os.cdx.gz 60085 download
urls-archive.max.fan-twitter-@BorderlessEdu-filtered.txt-shallow-20200712-200838-brtph-meta.warc.gz 36878 download   job
urls-archive.max.fan-twitter-@BorderlessEdu-filtered.txt-shallow-20200712-200838-brtph-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BorderlessEdu-filtered.txt-shallow-20200712-200838-brtph-urls.txt 39709 download
urls-archive.max.fan-twitter-@BorderlessEdu-filtered.txt-shallow-20200712-200838-brtph.json 341 download   job
urls-archive.max.fan-twitter-@BosyUncle-filtered.txt-shallow-20200712-200727-92z89-00000.warc.gz 913116392 download   job
urls-archive.max.fan-twitter-@BosyUncle-filtered.txt-shallow-20200712-200727-92z89-00000.warc.os.cdx.gz 839900 download
urls-archive.max.fan-twitter-@BosyUncle-filtered.txt-shallow-20200712-200727-92z89-meta.warc.gz 444681 download   job
urls-archive.max.fan-twitter-@BosyUncle-filtered.txt-shallow-20200712-200727-92z89-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BosyUncle-filtered.txt-shallow-20200712-200727-92z89-urls.txt 804850 download
urls-archive.max.fan-twitter-@BosyUncle-filtered.txt-shallow-20200712-200727-92z89.json 333 download   job
urls-archive.max.fan-twitter-@BradEickholt-filtered.txt-shallow-20200712-200341-bidx9-00000.warc.gz 34939009 download   job
urls-archive.max.fan-twitter-@BradEickholt-filtered.txt-shallow-20200712-200341-bidx9-00000.warc.os.cdx.gz 49113 download
urls-archive.max.fan-twitter-@BradEickholt-filtered.txt-shallow-20200712-200341-bidx9-meta.warc.gz 30940 download   job
urls-archive.max.fan-twitter-@BradEickholt-filtered.txt-shallow-20200712-200341-bidx9-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BradEickholt-filtered.txt-shallow-20200712-200341-bidx9-urls.txt 20328 download
urls-archive.max.fan-twitter-@BradEickholt-filtered.txt-shallow-20200712-200341-bidx9.json 339 download   job
urls-archive.max.fan-twitter-@BrentNYT-filtered.txt-shallow-20200712-200110-94bi2-00000.warc.gz 1830621395 download   job
urls-archive.max.fan-twitter-@BrentNYT-filtered.txt-shallow-20200712-200110-94bi2-00000.warc.os.cdx.gz 3248614 download
urls-archive.max.fan-twitter-@BrentNYT-filtered.txt-shallow-20200712-200110-94bi2-meta.warc.gz 1695868 download   job
urls-archive.max.fan-twitter-@BrentNYT-filtered.txt-shallow-20200712-200110-94bi2-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BrentNYT-filtered.txt-shallow-20200712-200110-94bi2-urls.txt 836538 download
urls-archive.max.fan-twitter-@BrentNYT-filtered.txt-shallow-20200712-200110-94bi2.json 331 download   job
urls-archive.max.fan-twitter-@BrianDahleCA-filtered.txt-shallow-20200712-200107-7iykx-00000.warc.gz 7429782 download   job
urls-archive.max.fan-twitter-@BrianDahleCA-filtered.txt-shallow-20200712-200107-7iykx-00000.warc.os.cdx.gz 17806 download
urls-archive.max.fan-twitter-@BrianDahleCA-filtered.txt-shallow-20200712-200107-7iykx-meta.warc.gz 13879 download   job
urls-archive.max.fan-twitter-@BrianDahleCA-filtered.txt-shallow-20200712-200107-7iykx-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BrianDahleCA-filtered.txt-shallow-20200712-200107-7iykx-urls.txt 2760 download
urls-archive.max.fan-twitter-@BrianDahleCA-filtered.txt-shallow-20200712-200107-7iykx.json 339 download   job
urls-archive.max.fan-twitter-@BrianLeungKP-filtered.txt-shallow-20200712-200106-chcpd-00000.warc.gz 23613215 download   job
urls-archive.max.fan-twitter-@BrianLeungKP-filtered.txt-shallow-20200712-200106-chcpd-00000.warc.os.cdx.gz 75912 download
urls-archive.max.fan-twitter-@BrianLeungKP-filtered.txt-shallow-20200712-200106-chcpd-meta.warc.gz 44644 download   job
urls-archive.max.fan-twitter-@BrianLeungKP-filtered.txt-shallow-20200712-200106-chcpd-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BrianLeungKP-filtered.txt-shallow-20200712-200106-chcpd-urls.txt 8220 download
urls-archive.max.fan-twitter-@BrianLeungKP-filtered.txt-shallow-20200712-200106-chcpd.json 339 download   job
urls-archive.max.fan-twitter-@Brian_Rohan-filtered.txt-shallow-20200712-200106-922yn-00000.warc.gz 106244896 download   job
urls-archive.max.fan-twitter-@Brian_Rohan-filtered.txt-shallow-20200712-200106-922yn-00000.warc.os.cdx.gz 166098 download
urls-archive.max.fan-twitter-@Brian_Rohan-filtered.txt-shallow-20200712-200106-922yn-meta.warc.gz 92744 download   job
urls-archive.max.fan-twitter-@Brian_Rohan-filtered.txt-shallow-20200712-200106-922yn-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Brian_Rohan-filtered.txt-shallow-20200712-200106-922yn-urls.txt 78953 download
urls-archive.max.fan-twitter-@Brian_Rohan-filtered.txt-shallow-20200712-200106-922yn.json 337 download   job
urls-archive.max.fan-twitter-@BrunoLeMaire-filtered.txt-shallow-20200712-195435-ei8xs-00000.warc.gz 1617308336 download   job
urls-archive.max.fan-twitter-@BrunoLeMaire-filtered.txt-shallow-20200712-195435-ei8xs-00000.warc.os.cdx.gz 3252552 download
urls-archive.max.fan-twitter-@BrunoLeMaire-filtered.txt-shallow-20200712-195435-ei8xs-meta.warc.gz 1720843 download   job
urls-archive.max.fan-twitter-@BrunoLeMaire-filtered.txt-shallow-20200712-195435-ei8xs-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BrunoLeMaire-filtered.txt-shallow-20200712-195435-ei8xs-urls.txt 544412 download
urls-archive.max.fan-twitter-@BrunoLeMaire-filtered.txt-shallow-20200712-195435-ei8xs.json 339 download   job
urls-archive.max.fan-twitter-@BryceDHoward-filtered.txt-shallow-20200712-195425-2nwwz-00000.warc.gz 471455518 download   job
urls-archive.max.fan-twitter-@BryceDHoward-filtered.txt-shallow-20200712-195425-2nwwz-00000.warc.os.cdx.gz 993360 download
urls-archive.max.fan-twitter-@BryceDHoward-filtered.txt-shallow-20200712-195425-2nwwz-meta.warc.gz 523284 download   job
urls-archive.max.fan-twitter-@BryceDHoward-filtered.txt-shallow-20200712-195425-2nwwz-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@BryceDHoward-filtered.txt-shallow-20200712-195425-2nwwz-urls.txt 112832 download
urls-archive.max.fan-twitter-@BryceDHoward-filtered.txt-shallow-20200712-195425-2nwwz.json 339 download   job
urls-archive.max.fan-twitter-@BurundiForeign-filtered.txt-shallow-20200712-195421-1cxp1.json 343 download   job
urls-archive.max.fan-twitter-@ButteSheriff-filtered.txt-shallow-20200712-195134-68f3e-00000.warc.gz 443828126 download   job
urls-archive.max.fan-twitter-@ButteSheriff-filtered.txt-shallow-20200712-195134-68f3e-00000.warc.os.cdx.gz 514138 download
urls-archive.max.fan-twitter-@ButteSheriff-filtered.txt-shallow-20200712-195134-68f3e-meta.warc.gz 273580 download   job
urls-archive.max.fan-twitter-@ButteSheriff-filtered.txt-shallow-20200712-195134-68f3e-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ButteSheriff-filtered.txt-shallow-20200712-195134-68f3e-urls.txt 132571 download
urls-archive.max.fan-twitter-@ButteSheriff-filtered.txt-shallow-20200712-195134-68f3e.json 339 download   job
urls-archive.max.fan-twitter-@ByAntonioG-filtered.txt-shallow-20200712-195132-diuj5-00000.warc.gz 406116144 download   job
urls-archive.max.fan-twitter-@ByAntonioG-filtered.txt-shallow-20200712-195132-diuj5-00000.warc.os.cdx.gz 529282 download
urls-archive.max.fan-twitter-@ByAntonioG-filtered.txt-shallow-20200712-195132-diuj5-meta.warc.gz 283355 download   job
urls-archive.max.fan-twitter-@ByAntonioG-filtered.txt-shallow-20200712-195132-diuj5-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ByAntonioG-filtered.txt-shallow-20200712-195132-diuj5-urls.txt 370485 download
urls-archive.max.fan-twitter-@ByAntonioG-filtered.txt-shallow-20200712-195132-diuj5.json 335 download   job
urls-archive.max.fan-twitter-@ByMattStevens-filtered.txt-shallow-20200712-194711-3dv3u-00000.warc.gz 407043702 download   job
urls-archive.max.fan-twitter-@ByMattStevens-filtered.txt-shallow-20200712-194711-3dv3u-00000.warc.os.cdx.gz 664829 download
urls-archive.max.fan-twitter-@ByMattStevens-filtered.txt-shallow-20200712-194711-3dv3u-meta.warc.gz 354745 download   job
urls-archive.max.fan-twitter-@ByMattStevens-filtered.txt-shallow-20200712-194711-3dv3u-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ByMattStevens-filtered.txt-shallow-20200712-194711-3dv3u-urls.txt 273623 download
urls-archive.max.fan-twitter-@ByMattStevens-filtered.txt-shallow-20200712-194711-3dv3u.json 341 download   job
urls-archive.max.fan-twitter-@CCNUCC-filtered.txt-shallow-20200712-191912-3awcj-00000.warc.gz 1744778628 download   job
urls-archive.max.fan-twitter-@CCNUCC-filtered.txt-shallow-20200712-191912-3awcj-00000.warc.os.cdx.gz 2119375 download
urls-archive.max.fan-twitter-@CCNUCC-filtered.txt-shallow-20200712-191912-3awcj-meta.warc.gz 1121991 download   job
urls-archive.max.fan-twitter-@CCNUCC-filtered.txt-shallow-20200712-191912-3awcj-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CCNUCC-filtered.txt-shallow-20200712-191912-3awcj-urls.txt 485034 download
urls-archive.max.fan-twitter-@CCNUCC-filtered.txt-shallow-20200712-191912-3awcj.json 327 download   job
urls-archive.max.fan-twitter-@CDCDirector-filtered.txt-shallow-20200712-191911-5exwm-00000.warc.gz 1527940871 download   job
urls-archive.max.fan-twitter-@CDCDirector-filtered.txt-shallow-20200712-191911-5exwm-00000.warc.os.cdx.gz 4077379 download
urls-archive.max.fan-twitter-@CDCDirector-filtered.txt-shallow-20200712-191911-5exwm-meta.warc.gz 2150042 download   job
urls-archive.max.fan-twitter-@CDCDirector-filtered.txt-shallow-20200712-191911-5exwm-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CDCDirector-filtered.txt-shallow-20200712-191911-5exwm-urls.txt 767513 download
urls-archive.max.fan-twitter-@CDCDirector-filtered.txt-shallow-20200712-191911-5exwm.json 337 download   job
urls-archive.max.fan-twitter-@CFEmx-filtered.txt-shallow-20200712-190746-d71j9-00000.warc.gz 5368716223 download   job
urls-archive.max.fan-twitter-@CFEmx-filtered.txt-shallow-20200712-190746-d71j9-00000.warc.os.cdx.gz 4466114 download
urls-archive.max.fan-twitter-@CHP_SantaBarb-filtered.txt-shallow-20200712-185854-8va56.json 341 download   job
urls-archive.max.fan-twitter-@CNPC_MX-filtered.txt-shallow-20200712-174945-8f8me-00000.warc.gz 5368787131 download   job
urls-archive.max.fan-twitter-@CNPC_MX-filtered.txt-shallow-20200712-174945-8f8me-00000.warc.os.cdx.gz 5856557 download
urls-archive.max.fan-twitter-@CNPC_MX-filtered.txt-shallow-20200712-174945-8f8me-00001.warc.gz 1536962313 download   job
urls-archive.max.fan-twitter-@CNPC_MX-filtered.txt-shallow-20200712-174945-8f8me-00001.warc.os.cdx.gz 1760989 download
urls-archive.max.fan-twitter-@CNPC_MX-filtered.txt-shallow-20200712-174945-8f8me-meta.warc.gz 4011453 download   job
urls-archive.max.fan-twitter-@CNPC_MX-filtered.txt-shallow-20200712-174945-8f8me-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CNPC_MX-filtered.txt-shallow-20200712-174945-8f8me-urls.txt 1746858 download
urls-archive.max.fan-twitter-@CNPC_MX-filtered.txt-shallow-20200712-174945-8f8me.json 329 download   job
urls-archive.max.fan-twitter-@CT_Bergstrom-filtered.txt-shallow-20200712-172436-3b6n6-meta.warc.gz 1864659 download   job
urls-archive.max.fan-twitter-@CT_Bergstrom-filtered.txt-shallow-20200712-172436-3b6n6-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Cal_OES-filtered.txt-shallow-20200712-193930-qq9nl-00000.warc.gz 1071008006 download   job
urls-archive.max.fan-twitter-@Cal_OES-filtered.txt-shallow-20200712-193930-qq9nl-00000.warc.os.cdx.gz 1659843 download
urls-archive.max.fan-twitter-@Cal_OES-filtered.txt-shallow-20200712-193930-qq9nl-meta.warc.gz 884670 download   job
urls-archive.max.fan-twitter-@Cal_OES-filtered.txt-shallow-20200712-193930-qq9nl-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Cal_OES-filtered.txt-shallow-20200712-193930-qq9nl-urls.txt 441093 download
urls-archive.max.fan-twitter-@Cal_OES-filtered.txt-shallow-20200712-193930-qq9nl.json 329 download   job
urls-archive.max.fan-twitter-@CarlaKJohnson-filtered.txt-shallow-20200712-193114-1sl3y-00000.warc.gz 512728285 download   job
urls-archive.max.fan-twitter-@CarlaKJohnson-filtered.txt-shallow-20200712-193114-1sl3y-00000.warc.os.cdx.gz 732444 download
urls-archive.max.fan-twitter-@CarlaKJohnson-filtered.txt-shallow-20200712-193114-1sl3y-meta.warc.gz 388640 download   job
urls-archive.max.fan-twitter-@CarlaKJohnson-filtered.txt-shallow-20200712-193114-1sl3y-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CarlaKJohnson-filtered.txt-shallow-20200712-193114-1sl3y-urls.txt 442776 download
urls-archive.max.fan-twitter-@CarlaKJohnson-filtered.txt-shallow-20200712-193114-1sl3y.json 341 download   job
urls-archive.max.fan-twitter-@CarlosAlvQ-filtered.txt-shallow-20200712-193114-g5rzj-00000.warc.gz 1254821681 download   job
urls-archive.max.fan-twitter-@CarlosAlvQ-filtered.txt-shallow-20200712-193114-g5rzj-00000.warc.os.cdx.gz 2389219 download
urls-archive.max.fan-twitter-@CarlosAlvQ-filtered.txt-shallow-20200712-193114-g5rzj-meta.warc.gz 1261903 download   job
urls-archive.max.fan-twitter-@CarlosAlvQ-filtered.txt-shallow-20200712-193114-g5rzj-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CarlosAlvQ-filtered.txt-shallow-20200712-193114-g5rzj-urls.txt 378534 download
urls-archive.max.fan-twitter-@CarlosAlvQ-filtered.txt-shallow-20200712-193114-g5rzj.json 335 download   job
urls-archive.max.fan-twitter-@CasaReal-filtered.txt-shallow-20200712-192802-328tl-00000.warc.gz 3592854820 download   job
urls-archive.max.fan-twitter-@CasaReal-filtered.txt-shallow-20200712-192802-328tl-00000.warc.os.cdx.gz 3651269 download
urls-archive.max.fan-twitter-@CasaReal-filtered.txt-shallow-20200712-192802-328tl-meta.warc.gz 1883961 download   job
urls-archive.max.fan-twitter-@CasaReal-filtered.txt-shallow-20200712-192802-328tl-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@CasaReal-filtered.txt-shallow-20200712-192802-328tl-urls.txt 333111 download
urls-archive.max.fan-twitter-@CasaReal-filtered.txt-shallow-20200712-192802-328tl.json 331 download   job
urls-archive.max.fan-twitter-@adeelnyt-filtered.txt-shallow-20200712-224453-2jnp4-00000.warc.gz 39646564 download   job
urls-archive.max.fan-twitter-@adeelnyt-filtered.txt-shallow-20200712-224453-2jnp4-00000.warc.os.cdx.gz 85152 download
urls-archive.max.fan-twitter-@adriennecarter-filtered.txt-shallow-20200712-224421-2xv4k-urls.txt 87731 download
urls-archive.max.fan-twitter-@adriennecarter-filtered.txt-shallow-20200712-224421-2xv4k.json 343 download   job
urls-archive.max.fan-twitter-@amyklobuchar-filtered.txt-shallow-20200712-222023-cs0eb-meta.warc.gz 325897 download   job
urls-archive.max.fan-twitter-@amyklobuchar-filtered.txt-shallow-20200712-222023-cs0eb-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@amyyqin-filtered.txt-shallow-20200712-221626-272n0-urls.txt 179890 download
urls-archive.max.fan-twitter-@amyyqin-filtered.txt-shallow-20200712-221626-272n0.json 329 download   job
urls-archive.max.fan-twitter-@andersen_inger-filtered.txt-shallow-20200712-221625-1bq5d-urls.txt 141697 download
urls-archive.max.fan-twitter-@andreahagelgans-filtered.txt-shallow-20200712-215856-ckp73-meta.warc.gz 212695 download   job
urls-archive.max.fan-twitter-@andreahagelgans-filtered.txt-shallow-20200712-215856-ckp73-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@annakmall-filtered.txt-shallow-20200712-215007-afa3d-00000.warc.gz 23352159 download   job
urls-archive.max.fan-twitter-@annakmall-filtered.txt-shallow-20200712-215007-afa3d-00000.warc.os.cdx.gz 30447 download
urls-archive.max.fan-twitter-@annakmall-filtered.txt-shallow-20200712-215007-afa3d-meta.warc.gz 20495 download   job
urls-archive.max.fan-twitter-@annakmall-filtered.txt-shallow-20200712-215007-afa3d-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@annakmall-filtered.txt-shallow-20200712-215007-afa3d-urls.txt 9638 download
urls-archive.max.fan-twitter-@annakmall-filtered.txt-shallow-20200712-215007-afa3d.json 333 download   job
urls-archive.max.fan-twitter-@anniecorreal-filtered.txt-shallow-20200712-214918-718dq-00000.warc.gz 235482106 download   job
urls-archive.max.fan-twitter-@anniecorreal-filtered.txt-shallow-20200712-214918-718dq-00000.warc.os.cdx.gz 517709 download
urls-archive.max.fan-twitter-@antonioguterres-filtered.txt-shallow-20200712-214313-d67r9-00000.warc.gz 687243361 download   job
urls-archive.max.fan-twitter-@antonioguterres-filtered.txt-shallow-20200712-214313-d67r9-00000.warc.os.cdx.gz 1503933 download
urls-archive.max.fan-twitter-@aplamanning-filtered.txt-shallow-20200712-214226-ah50u-00000.warc.gz 986949 download   job
urls-archive.max.fan-twitter-@aplamanning-filtered.txt-shallow-20200712-214226-ah50u-00000.warc.os.cdx.gz 4406 download
urls-archive.max.fan-twitter-@aplamanning-filtered.txt-shallow-20200712-214226-ah50u-meta.warc.gz 6351 download   job
urls-archive.max.fan-twitter-@aplamanning-filtered.txt-shallow-20200712-214226-ah50u-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@aplamanning-filtered.txt-shallow-20200712-214226-ah50u-urls.txt 174 download
urls-archive.max.fan-twitter-@aplamanning-filtered.txt-shallow-20200712-214226-ah50u.json 337 download   job
urls-archive.max.fan-twitter-@aptonygutierrez-filtered.txt-shallow-20200712-213530-ccq1n-00000.warc.gz 234548750 download   job
urls-archive.max.fan-twitter-@aptonygutierrez-filtered.txt-shallow-20200712-213530-ccq1n-00000.warc.os.cdx.gz 200748 download
urls-archive.max.fan-twitter-@aptonygutierrez-filtered.txt-shallow-20200712-213530-ccq1n-meta.warc.gz 110442 download   job
urls-archive.max.fan-twitter-@aptonygutierrez-filtered.txt-shallow-20200712-213530-ccq1n-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@aptonygutierrez-filtered.txt-shallow-20200712-213530-ccq1n-urls.txt 139519 download
urls-archive.max.fan-twitter-@aptonygutierrez-filtered.txt-shallow-20200712-213530-ccq1n.json 345 download   job
urls-archive.max.fan-twitter-@apwillweissert-filtered.txt-shallow-20200712-213104-19vnn-00000.warc.gz 102858881 download   job
urls-archive.max.fan-twitter-@apwillweissert-filtered.txt-shallow-20200712-213104-19vnn-00000.warc.os.cdx.gz 151763 download
urls-archive.max.fan-twitter-@apwillweissert-filtered.txt-shallow-20200712-213104-19vnn-meta.warc.gz 84909 download   job
urls-archive.max.fan-twitter-@apwillweissert-filtered.txt-shallow-20200712-213104-19vnn-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@apwillweissert-filtered.txt-shallow-20200712-213104-19vnn-urls.txt 91815 download
urls-archive.max.fan-twitter-@apwillweissert-filtered.txt-shallow-20200712-213104-19vnn.json 343 download   job
urls-archive.max.fan-twitter-@arlegalaid-filtered.txt-shallow-20200712-213059-7hope-00000.warc.gz 70752080 download   job
urls-archive.max.fan-twitter-@arlegalaid-filtered.txt-shallow-20200712-213059-7hope-00000.warc.os.cdx.gz 81921 download
urls-archive.max.fan-twitter-@arlegalaid-filtered.txt-shallow-20200712-213059-7hope-meta.warc.gz 48333 download   job
urls-archive.max.fan-twitter-@arlegalaid-filtered.txt-shallow-20200712-213059-7hope-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@arlegalaid-filtered.txt-shallow-20200712-213059-7hope-urls.txt 61266 download
urls-archive.max.fan-twitter-@arlegalaid-filtered.txt-shallow-20200712-213059-7hope.json 335 download   job
urls-archive.max.fan-twitter-@arniestapleton-filtered.txt-shallow-20200712-212556-dm302-00000.warc.gz 304466485 download   job
urls-archive.max.fan-twitter-@arniestapleton-filtered.txt-shallow-20200712-212556-dm302-00000.warc.os.cdx.gz 352620 download
urls-archive.max.fan-twitter-@arniestapleton-filtered.txt-shallow-20200712-212556-dm302-meta.warc.gz 190379 download   job
urls-archive.max.fan-twitter-@arniestapleton-filtered.txt-shallow-20200712-212556-dm302-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@arniestapleton-filtered.txt-shallow-20200712-212556-dm302-urls.txt 232710 download
urls-archive.max.fan-twitter-@arniestapleton-filtered.txt-shallow-20200712-212556-dm302.json 343 download   job
urls-archive.max.fan-twitter-@austinramzy-filtered.txt-shallow-20200712-211446-9meem-urls.txt 690631 download
urls-archive.max.fan-twitter-@avivainvestors-filtered.txt-shallow-20200712-210853-tko1h-00000.warc.gz 315054869 download   job
urls-archive.max.fan-twitter-@avivainvestors-filtered.txt-shallow-20200712-210853-tko1h-00000.warc.os.cdx.gz 298491 download
urls-archive.max.fan-twitter-@avivainvestors-filtered.txt-shallow-20200712-210853-tko1h-meta.warc.gz 160411 download   job
urls-archive.max.fan-twitter-@avivainvestors-filtered.txt-shallow-20200712-210853-tko1h-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@avivainvestors-filtered.txt-shallow-20200712-210853-tko1h-urls.txt 115271 download
urls-archive.max.fan-twitter-@avivainvestors-filtered.txt-shallow-20200712-210853-tko1h.json 343 download   job
urls-archive.max.fan-twitter-@azagier-filtered.txt-shallow-20200712-210524-7bi8t-00000.warc.gz 104904837 download   job
urls-archive.max.fan-twitter-@azagier-filtered.txt-shallow-20200712-210524-7bi8t-00000.warc.os.cdx.gz 120493 download
urls-archive.max.fan-twitter-@azagier-filtered.txt-shallow-20200712-210524-7bi8t-meta.warc.gz 69048 download   job
urls-archive.max.fan-twitter-@azagier-filtered.txt-shallow-20200712-210524-7bi8t-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@azagier-filtered.txt-shallow-20200712-210524-7bi8t-urls.txt 77465 download
urls-archive.max.fan-twitter-@azagier-filtered.txt-shallow-20200712-210524-7bi8t.json 329 download   job
urls-archive.max.fan-twitter-@azamsahmed-filtered.txt-shallow-20200712-210522-ebskh-00000.warc.gz 47680942 download   job
urls-archive.max.fan-twitter-@azamsahmed-filtered.txt-shallow-20200712-210522-ebskh-00000.warc.os.cdx.gz 175961 download
urls-archive.max.fan-twitter-@azamsahmed-filtered.txt-shallow-20200712-210522-ebskh-meta.warc.gz 97482 download   job
urls-archive.max.fan-twitter-@azamsahmed-filtered.txt-shallow-20200712-210522-ebskh-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@azamsahmed-filtered.txt-shallow-20200712-210522-ebskh-urls.txt 25404 download
urls-archive.max.fan-twitter-@azamsahmed-filtered.txt-shallow-20200712-210522-ebskh.json 335 download   job
urls-archive.max.fan-twitter-@be_gezondheid-filtered.txt-shallow-20200712-205100-4j25u-00000.warc.gz 238580083 download   job
urls-archive.max.fan-twitter-@be_gezondheid-filtered.txt-shallow-20200712-205100-4j25u-00000.warc.os.cdx.gz 450720 download
urls-archive.max.fan-twitter-@be_gezondheid-filtered.txt-shallow-20200712-205100-4j25u-meta.warc.gz 242049 download   job
urls-archive.max.fan-twitter-@be_gezondheid-filtered.txt-shallow-20200712-205100-4j25u-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@be_gezondheid-filtered.txt-shallow-20200712-205100-4j25u-urls.txt 163514 download
urls-archive.max.fan-twitter-@be_gezondheid-filtered.txt-shallow-20200712-205100-4j25u.json 341 download   job
urls-archive.max.fan-twitter-@betsyblaney-filtered.txt-shallow-20200712-204412-8by8o-00000.warc.gz 280347303 download   job
urls-archive.max.fan-twitter-@betsyblaney-filtered.txt-shallow-20200712-204412-8by8o-00000.warc.os.cdx.gz 486052 download
urls-archive.max.fan-twitter-@betsyblaney-filtered.txt-shallow-20200712-204412-8by8o-meta.warc.gz 260253 download   job
urls-archive.max.fan-twitter-@betsyblaney-filtered.txt-shallow-20200712-204412-8by8o-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@betsyblaney-filtered.txt-shallow-20200712-204412-8by8o-urls.txt 182645 download
urls-archive.max.fan-twitter-@betsyblaney-filtered.txt-shallow-20200712-204412-8by8o.json 337 download   job
urls-archive.max.fan-twitter-@bhstiftelsen-filtered.txt-shallow-20200712-203351-6jr0a-00000.warc.gz 100075869 download   job
urls-archive.max.fan-twitter-@bhstiftelsen-filtered.txt-shallow-20200712-203351-6jr0a-00000.warc.os.cdx.gz 94324 download
urls-archive.max.fan-twitter-@bhstiftelsen-filtered.txt-shallow-20200712-203351-6jr0a-meta.warc.gz 53411 download   job
urls-archive.max.fan-twitter-@bhstiftelsen-filtered.txt-shallow-20200712-203351-6jr0a-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@bhstiftelsen-filtered.txt-shallow-20200712-203351-6jr0a-urls.txt 98678 download
urls-archive.max.fan-twitter-@bhstiftelsen-filtered.txt-shallow-20200712-203351-6jr0a.json 339 download   job
urls-archive.max.fan-twitter-@bilalhusseinz-filtered.txt-shallow-20200712-202916-41riw-00000.warc.gz 27881389 download   job
urls-archive.max.fan-twitter-@bilalhusseinz-filtered.txt-shallow-20200712-202916-41riw-00000.warc.os.cdx.gz 32987 download
urls-archive.max.fan-twitter-@bilalhusseinz-filtered.txt-shallow-20200712-202916-41riw-meta.warc.gz 22332 download   job
urls-archive.max.fan-twitter-@bilalhusseinz-filtered.txt-shallow-20200712-202916-41riw-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@bilalhusseinz-filtered.txt-shallow-20200712-202916-41riw-urls.txt 26646 download
urls-archive.max.fan-twitter-@bilalhusseinz-filtered.txt-shallow-20200712-202916-41riw.json 341 download   job
urls-archive.max.fan-twitter-@billkole-filtered.txt-shallow-20200712-201952-38iss-00000.warc.gz 1135255608 download   job
urls-archive.max.fan-twitter-@billkole-filtered.txt-shallow-20200712-201952-38iss-00000.warc.os.cdx.gz 1397528 download
urls-archive.max.fan-twitter-@billkole-filtered.txt-shallow-20200712-201952-38iss-meta.warc.gz 741351 download   job
urls-archive.max.fan-twitter-@billkole-filtered.txt-shallow-20200712-201952-38iss-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@billkole-filtered.txt-shallow-20200712-201952-38iss-urls.txt 752163 download
urls-archive.max.fan-twitter-@billkole-filtered.txt-shallow-20200712-201952-38iss.json 331 download   job
urls-archive.max.fan-twitter-@billmpennington-filtered.txt-shallow-20200712-201550-3c5l2-00000.warc.gz 110121122 download   job
urls-archive.max.fan-twitter-@billmpennington-filtered.txt-shallow-20200712-201550-3c5l2-00000.warc.os.cdx.gz 212736 download
urls-archive.max.fan-twitter-@billmpennington-filtered.txt-shallow-20200712-201550-3c5l2-meta.warc.gz 117947 download   job
urls-archive.max.fan-twitter-@billmpennington-filtered.txt-shallow-20200712-201550-3c5l2-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@billmpennington-filtered.txt-shallow-20200712-201550-3c5l2-urls.txt 89695 download
urls-archive.max.fan-twitter-@billmpennington-filtered.txt-shallow-20200712-201550-3c5l2.json 345 download   job
urls-archive.max.fan-twitter-@bokatlindell-filtered.txt-shallow-20200712-201241-aonnn-00000.warc.gz 59019534 download   job
urls-archive.max.fan-twitter-@bokatlindell-filtered.txt-shallow-20200712-201241-aonnn-00000.warc.os.cdx.gz 97454 download
urls-archive.max.fan-twitter-@bokatlindell-filtered.txt-shallow-20200712-201241-aonnn-meta.warc.gz 56389 download   job
urls-archive.max.fan-twitter-@bokatlindell-filtered.txt-shallow-20200712-201241-aonnn-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@bokatlindell-filtered.txt-shallow-20200712-201241-aonnn-urls.txt 37055 download
urls-archive.max.fan-twitter-@bokatlindell-filtered.txt-shallow-20200712-201241-aonnn.json 339 download   job
urls-archive.max.fan-twitter-@borenbears-filtered.txt-shallow-20200712-200838-6o1jz-00000.warc.gz 689048034 download   job
urls-archive.max.fan-twitter-@borenbears-filtered.txt-shallow-20200712-200838-6o1jz-00000.warc.os.cdx.gz 1393924 download
urls-archive.max.fan-twitter-@borenbears-filtered.txt-shallow-20200712-200838-6o1jz-meta.warc.gz 745720 download   job
urls-archive.max.fan-twitter-@borenbears-filtered.txt-shallow-20200712-200838-6o1jz-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@borenbears-filtered.txt-shallow-20200712-200838-6o1jz-urls.txt 409896 download
urls-archive.max.fan-twitter-@borenbears-filtered.txt-shallow-20200712-200838-6o1jz.json 335 download   job
urls-archive.max.fan-twitter-@bostonNHP-filtered.txt-shallow-20200712-200733-dzjhl-00000.warc.gz 486393547 download   job
urls-archive.max.fan-twitter-@bostonNHP-filtered.txt-shallow-20200712-200733-dzjhl-00000.warc.os.cdx.gz 580628 download
urls-archive.max.fan-twitter-@bostonNHP-filtered.txt-shallow-20200712-200733-dzjhl-meta.warc.gz 306264 download   job
urls-archive.max.fan-twitter-@bostonNHP-filtered.txt-shallow-20200712-200733-dzjhl-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@bostonNHP-filtered.txt-shallow-20200712-200733-dzjhl-urls.txt 145353 download
urls-archive.max.fan-twitter-@bostonNHP-filtered.txt-shallow-20200712-200733-dzjhl.json 333 download   job
urls-archive.max.fan-twitter-@boston_techjobs-filtered.txt-shallow-20200712-200731-64wor-00000.warc.gz 1048941158 download   job
urls-archive.max.fan-twitter-@boston_techjobs-filtered.txt-shallow-20200712-200731-64wor-00000.warc.os.cdx.gz 975963 download
urls-archive.max.fan-twitter-@boston_techjobs-filtered.txt-shallow-20200712-200731-64wor-meta.warc.gz 510095 download   job
urls-archive.max.fan-twitter-@boston_techjobs-filtered.txt-shallow-20200712-200731-64wor-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@boston_techjobs-filtered.txt-shallow-20200712-200731-64wor-urls.txt 1089278 download
urls-archive.max.fan-twitter-@boston_techjobs-filtered.txt-shallow-20200712-200731-64wor.json 345 download   job
urls-archive.max.fan-twitter-@bru_vincent-filtered.txt-shallow-20200712-195426-8u3gw-00000.warc.gz 201064352 download   job
urls-archive.max.fan-twitter-@bru_vincent-filtered.txt-shallow-20200712-195426-8u3gw-00000.warc.os.cdx.gz 201404 download
urls-archive.max.fan-twitter-@bru_vincent-filtered.txt-shallow-20200712-195426-8u3gw-meta.warc.gz 110127 download   job
urls-archive.max.fan-twitter-@bru_vincent-filtered.txt-shallow-20200712-195426-8u3gw-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@bru_vincent-filtered.txt-shallow-20200712-195426-8u3gw-urls.txt 50684 download
urls-archive.max.fan-twitter-@bru_vincent-filtered.txt-shallow-20200712-195426-8u3gw.json 337 download   job
urls-archive.max.fan-twitter-@brunepoirson-filtered.txt-shallow-20200712-200049-an9bs-00000.warc.gz 594198789 download   job
urls-archive.max.fan-twitter-@brunepoirson-filtered.txt-shallow-20200712-200049-an9bs-00000.warc.os.cdx.gz 1220562 download
urls-archive.max.fan-twitter-@brunepoirson-filtered.txt-shallow-20200712-200049-an9bs-meta.warc.gz 648998 download   job
urls-archive.max.fan-twitter-@brunepoirson-filtered.txt-shallow-20200712-200049-an9bs-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@brunepoirson-filtered.txt-shallow-20200712-200049-an9bs-urls.txt 137478 download
urls-archive.max.fan-twitter-@brunepoirson-filtered.txt-shallow-20200712-200049-an9bs.json 339 download   job
urls-archive.max.fan-twitter-@bykaren-filtered.txt-shallow-20200712-194727-ev3xf-00000.warc.gz 517781085 download   job
urls-archive.max.fan-twitter-@bykaren-filtered.txt-shallow-20200712-194727-ev3xf-00000.warc.os.cdx.gz 810662 download
urls-archive.max.fan-twitter-@bykaren-filtered.txt-shallow-20200712-194727-ev3xf-meta.warc.gz 434800 download   job
urls-archive.max.fan-twitter-@bykaren-filtered.txt-shallow-20200712-194727-ev3xf-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@bykaren-filtered.txt-shallow-20200712-194727-ev3xf-urls.txt 352792 download
urls-archive.max.fan-twitter-@bykaren-filtered.txt-shallow-20200712-194727-ev3xf.json 329 download   job
urls-archive.max.fan-twitter-@carynrousseau-filtered.txt-shallow-20200712-192804-bmgt0-urls.txt 95888 download
urls-archive.max.fan-twitter-@cepal_onu-filtered.txt-shallow-20200712-190838-23wm8-00000.warc.gz 2046132564 download   job
urls-archive.max.fan-twitter-@cepal_onu-filtered.txt-shallow-20200712-190838-23wm8-00000.warc.os.cdx.gz 4314840 download
urls-archive.max.fan-twitter-@cepal_onu-filtered.txt-shallow-20200712-190838-23wm8-meta.warc.gz 2238326 download   job
urls-archive.max.fan-twitter-@cepal_onu-filtered.txt-shallow-20200712-190838-23wm8-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@cepal_onu-filtered.txt-shallow-20200712-190838-23wm8-urls.txt 829734 download
urls-archive.max.fan-twitter-@cepal_onu-filtered.txt-shallow-20200712-190838-23wm8.json 333 download   job
urls-archive.max.fan-twitter-@christophclarey-filtered.txt-shallow-20200712-183120-9f9r4-00000.warc.gz 2980063467 download   job
urls-archive.max.fan-twitter-@christophclarey-filtered.txt-shallow-20200712-183120-9f9r4-00000.warc.os.cdx.gz 5613593 download
urls-archive.max.fan-twitter-@christophclarey-filtered.txt-shallow-20200712-183120-9f9r4-meta.warc.gz 2980481 download   job
urls-archive.max.fan-twitter-@christophclarey-filtered.txt-shallow-20200712-183120-9f9r4-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@christophclarey-filtered.txt-shallow-20200712-183120-9f9r4-urls.txt 2236466 download
urls-archive.max.fan-twitter-@christophclarey-filtered.txt-shallow-20200712-183120-9f9r4.json 345 download   job
urls-archive.max.fan-twitter-@cio_ebooks-filtered.txt-shallow-20200712-182319-ctryw-00001.warc.gz 2440763814 download   job
urls-archive.max.fan-twitter-@cio_ebooks-filtered.txt-shallow-20200712-182319-ctryw-00001.warc.os.cdx.gz 1479244 download
urls-archive.max.fan-twitter-@cio_ebooks-filtered.txt-shallow-20200712-182319-ctryw.json 335 download   job
urls-archive.max.fan-twitter-@conagua_mx-filtered.txt-shallow-20200712-174810-64bkc-00000.warc.gz 5368714965 download   job
urls-archive.max.fan-twitter-@conagua_mx-filtered.txt-shallow-20200712-174810-64bkc-00000.warc.os.cdx.gz 5207465 download
urls-archive.max.fan-twitter-@conagua_mx-filtered.txt-shallow-20200712-174810-64bkc-00001.warc.gz 460526571 download   job
urls-archive.max.fan-twitter-@conagua_mx-filtered.txt-shallow-20200712-174810-64bkc-00001.warc.os.cdx.gz 478132 download
urls-archive.max.fan-twitter-@conagua_mx-filtered.txt-shallow-20200712-174810-64bkc-meta.warc.gz 2992206 download   job
urls-archive.max.fan-twitter-@conagua_mx-filtered.txt-shallow-20200712-174810-64bkc-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@conagua_mx-filtered.txt-shallow-20200712-174810-64bkc-urls.txt 1754383 download
urls-archive.max.fan-twitter-@conagua_mx-filtered.txt-shallow-20200712-174810-64bkc.json 335 download   job
urls-archive.max.fan-twitter-@danpfeiffer-filtered.txt-shallow-20200712-170916-yi6jg-00000.warc.gz 2126325338 download   job
urls-archive.max.fan-twitter-@danpfeiffer-filtered.txt-shallow-20200712-170916-yi6jg-00000.warc.os.cdx.gz 8370872 download
urls-archive.max.fan-twitter-@danpfeiffer-filtered.txt-shallow-20200712-170916-yi6jg-meta.warc.gz 4418226 download   job
urls-archive.max.fan-twitter-@danpfeiffer-filtered.txt-shallow-20200712-170916-yi6jg-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@danpfeiffer-filtered.txt-shallow-20200712-170916-yi6jg-urls.txt 962084 download
urls-archive.max.fan-twitter-@danpfeiffer-filtered.txt-shallow-20200712-170916-yi6jg.json 337 download   job
urls-archive.max.fan-twitter-@dealbook-filtered.txt-shallow-20200712-170331-9bdou-00000.warc.gz 3779722051 download   job
urls-archive.max.fan-twitter-@dealbook-filtered.txt-shallow-20200712-170331-9bdou-00000.warc.os.cdx.gz 5078284 download
urls-archive.max.fan-twitter-@dealbook-filtered.txt-shallow-20200712-170331-9bdou-urls.txt 3119515 download
urls-archive.max.fan-twitter-@elizaorlins-filtered.txt-shallow-20200712-160409-a8hxc.json 337 download   job
urls-transfer.notkiska.pw-suntuubi.com-subdomains-inf-20200105-191743-9m75g-00201.warc.gz 5368913570 download   job
urls-transfer.notkiska.pw-suntuubi.com-subdomains-inf-20200105-191743-9m75g-00201.warc.os.cdx.gz 2452150 download
urls-transfer.notkiska.pw-twitter-%23BlackHistoryMonth-shallow-20200610-132545-46qdq-00212.warc.gz 5394470283 download   job
urls-transfer.notkiska.pw-twitter-%23BlackHistoryMonth-shallow-20200610-132545-46qdq-00212.warc.os.cdx.gz 1462941 download
urls-transfer.notkiska.pw-twitter-%23HongKong-shallow-20191011-144913-dze3i-00266.warc.gz 5369845601 download   job
urls-transfer.notkiska.pw-twitter-%23HongKong-shallow-20191011-144913-dze3i-00266.warc.os.cdx.gz 1989176 download
urls-transfer.notkiska.pw-twitter-%23notmypresident-shallow-20200530-220957-2c0z0-00131.warc.gz 5368822745 download   job
urls-transfer.notkiska.pw-twitter-%23notmypresident-shallow-20200530-220957-2c0z0-00131.warc.os.cdx.gz 2184337 download
urls-transfer.notkiska.pw-twitter-@bonappetit-shallow-20200712-003605-9ajtk-00005.warc.gz 5369189189 download   job
urls-transfer.notkiska.pw-twitter-@bonappetit-shallow-20200712-003605-9ajtk-00005.warc.os.cdx.gz 937893 download
urls-transfer.notkiska.pw-twitter-@bonappetit-shallow-20200712-003605-9ajtk-00006.warc.gz 5368852664 download   job
urls-transfer.notkiska.pw-twitter-@bonappetit-shallow-20200712-003605-9ajtk-00006.warc.os.cdx.gz 942335 download
urls-transfer.notkiska.pw-twitter-@bonappetit-shallow-20200712-003605-9ajtk-00007.warc.gz 5372346493 download   job
urls-transfer.notkiska.pw-twitter-@bonappetit-shallow-20200712-003605-9ajtk-00007.warc.os.cdx.gz 898165 download
www.bukarest.balassiintezet.hu-inf-20200707-090517-doafk-00004.warc.gz 2305729445 download   job
www.bukarest.balassiintezet.hu-inf-20200707-090517-doafk-00004.warc.os.cdx.gz 1028267 download
www.bukarest.balassiintezet.hu-inf-20200707-090517-doafk-meta.warc.gz 9963256 download   job
www.bukarest.balassiintezet.hu-inf-20200707-090517-doafk-meta.warc.os.cdx.gz 47 download
www.bukarest.balassiintezet.hu-inf-20200707-090517-doafk.json 255 download   job
www.mudcrutch.com-inf-20200710-231811-ablr0-00008.warc.gz 5373202516 download   job
www.mudcrutch.com-inf-20200710-231811-ablr0-00008.warc.os.cdx.gz 3109864 download
www.notcot.com-inf-20200709-213423-116f3-00022.warc.gz 5369436058 download   job
www.notcot.com-inf-20200709-213423-116f3-00022.warc.os.cdx.gz 3682626 download
www.refinery29.com-inf-20191002-211042-3symg-00659.warc.gz 5412324976 download   job
www.refinery29.com-inf-20191002-211042-3symg-00659.warc.os.cdx.gz 680332 download
www.refinery29.com-inf-20191002-211042-3symg-00660.warc.gz 5508121994 download   job
www.refinery29.com-inf-20191002-211042-3symg-00660.warc.os.cdx.gz 37693 download
www.seniorsnews.com.au-inf-20200528-062104-cuuvc-00113.warc.gz 5420988084 download   job
www.seniorsnews.com.au-inf-20200528-062104-cuuvc-00113.warc.os.cdx.gz 7226595 download
www.taringa.net-inf-20190927-205127-2a0h7-00702.warc.gz 5370486884 download   job
www.taringa.net-inf-20190927-205127-2a0h7-00702.warc.os.cdx.gz 3046158 download
www.turiver.com-inf-20200629-212723-6d3re-00029.warc.gz 5387800942 download   job
www.turiver.com-inf-20200629-212723-6d3re-00029.warc.os.cdx.gz 4162587 download
yepan.tistory.com-inf-20200711-025221-cq5rp-00008.warc.gz 5377008925 download   job
yepan.tistory.com-inf-20200711-025221-cq5rp-00008.warc.os.cdx.gz 3890906 download