Item archiveteam_archivebot_go_20200711000004

View on Internet Archive

Filename Size
3ds3-mys.taiko-ch.net-inf-20200710-212310-720ky-meta.warc.gz 125894 download   job
3ds3-mys.taiko-ch.net-inf-20200710-212310-720ky-meta.warc.os.cdx.gz 47 download
archiveteam_archivebot_go_20200711000004.cdx.gz 112905915 download
archiveteam_archivebot_go_20200711000004.cdx.idx 92850 download
archiveteam_archivebot_go_20200711000004_files.xml 0 download
archiveteam_archivebot_go_20200711000004_meta.sqlite 620544 download
archiveteam_archivebot_go_20200711000004_meta.xml 969 download
cdn1.ruarxive.org-inf-20200602-221412-82e21-00596.warc.gz 5371043592 download   job
cdn1.ruarxive.org-inf-20200602-221412-82e21-00596.warc.os.cdx.gz 11605 download
dslm.12371.cn-inf-20200710-200023-3bvgk-00000.warc.gz 1625449299 download   job
dslm.12371.cn-inf-20200710-200023-3bvgk-00000.warc.os.cdx.gz 1038078 download
dslm.12371.cn-inf-20200710-200023-3bvgk.json 242 download   job
fieldcraftsurvival.com-inf-20200710-224308-bjn00-00000.warc.gz 5370585265 download   job
fieldcraftsurvival.com-inf-20200710-224308-bjn00-00000.warc.os.cdx.gz 386879 download
fieldcraftsurvival.com-inf-20200710-224308-bjn00-00001.warc.gz 63706719 download   job
fieldcraftsurvival.com-inf-20200710-224308-bjn00-00001.warc.os.cdx.gz 23220 download
fieldcraftsurvival.com-inf-20200710-224308-bjn00-meta.warc.gz 294200 download   job
fieldcraftsurvival.com-inf-20200710-224308-bjn00-meta.warc.os.cdx.gz 47 download
fieldcraftsurvival.com-inf-20200710-224308-bjn00.json 252 download   job
forums.dayz.com-inf-20200603-015540-2wyve-00041.warc.gz 989871254 download   job
forums.dayz.com-inf-20200603-015540-2wyve-00041.warc.os.cdx.gz 1258163 download
forums.dayz.com-inf-20200603-015540-2wyve-meta.warc.gz 630013207 download   job
forums.dayz.com-inf-20200603-015540-2wyve-meta.warc.os.cdx.gz 47 download
forums.dayz.com-inf-20200603-015540-2wyve.json 240 download   job
forums.nextgames.com-inf-20200709-160247-15pvo-00004.warc.gz 5369253589 download   job
forums.nextgames.com-inf-20200709-160247-15pvo-00004.warc.os.cdx.gz 2771569 download
magen.whu.edu.cn-inf-20200626-142701-6m81j-00042.warc.gz 5527187754 download   job
magen.whu.edu.cn-inf-20200626-142701-6m81j-00042.warc.os.cdx.gz 1162 download
magen.whu.edu.cn-inf-20200626-142701-6m81j-00043.warc.gz 5627538980 download   job
magen.whu.edu.cn-inf-20200626-142701-6m81j-00043.warc.os.cdx.gz 1330 download
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00080.warc.gz 6059012233 download   job
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00080.warc.os.cdx.gz 8466 download
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00081.warc.gz 5371011959 download   job
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00081.warc.os.cdx.gz 65410 download
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00082.warc.gz 5524926820 download   job
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00082.warc.os.cdx.gz 171445 download
overlandtraining.com-inf-20200710-224958-e9jr3-00000.warc.gz 387197224 download   job
overlandtraining.com-inf-20200710-224958-e9jr3-00000.warc.os.cdx.gz 315629 download
overlandtraining.com-inf-20200710-224958-e9jr3-meta.warc.gz 186043 download   job
overlandtraining.com-inf-20200710-224958-e9jr3-meta.warc.os.cdx.gz 47 download
overlandtraining.com-inf-20200710-224958-e9jr3.json 250 download   job
podcasts.apple.com-shallow-20200710-224340-2q4so-00000.warc.gz 4688855759 download   job
podcasts.apple.com-shallow-20200710-224340-2q4so-00000.warc.os.cdx.gz 66950 download
podcasts.apple.com-shallow-20200710-224340-2q4so-meta.warc.gz 47434 download   job
podcasts.apple.com-shallow-20200710-224340-2q4so-meta.warc.os.cdx.gz 47 download
podcasts.apple.com-shallow-20200710-224340-2q4so.json 295 download   job
twitter.com-shallow-20200710-235047-6f53r-00000.warc.gz 1973187 download   job
twitter.com-shallow-20200710-235047-6f53r-00000.warc.os.cdx.gz 5709 download
twitter.com-shallow-20200710-235047-6f53r-meta.warc.gz 7035 download   job
twitter.com-shallow-20200710-235047-6f53r-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20200710-235047-6f53r.json 282 download   job
urls-archive.max.fan-twitter-@NYPD94Pct-filtered.txt-shallow-20200710-233514-bjksb-00000.warc.gz 428956362 download   job
urls-archive.max.fan-twitter-@NYPD94Pct-filtered.txt-shallow-20200710-233514-bjksb-00000.warc.os.cdx.gz 365945 download
urls-archive.max.fan-twitter-@NYPD94Pct-filtered.txt-shallow-20200710-233514-bjksb-meta.warc.gz 195982 download   job
urls-archive.max.fan-twitter-@NYPD94Pct-filtered.txt-shallow-20200710-233514-bjksb-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPD94Pct-filtered.txt-shallow-20200710-233514-bjksb-urls.txt 89693 download
urls-archive.max.fan-twitter-@NYPD94Pct-filtered.txt-shallow-20200710-233514-bjksb.json 333 download   job
urls-archive.max.fan-twitter-@NYPDBklynNorth-filtered.txt-shallow-20200710-233512-2mjn6-00000.warc.gz 412116265 download   job
urls-archive.max.fan-twitter-@NYPDBklynNorth-filtered.txt-shallow-20200710-233512-2mjn6-00000.warc.os.cdx.gz 483122 download
urls-archive.max.fan-twitter-@NYPDBklynNorth-filtered.txt-shallow-20200710-233512-2mjn6-meta.warc.gz 259937 download   job
urls-archive.max.fan-twitter-@NYPDBklynNorth-filtered.txt-shallow-20200710-233512-2mjn6-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDBklynNorth-filtered.txt-shallow-20200710-233512-2mjn6-urls.txt 96320 download
urls-archive.max.fan-twitter-@NYPDBklynNorth-filtered.txt-shallow-20200710-233512-2mjn6.json 343 download   job
urls-archive.max.fan-twitter-@NYPDBklynSouth-filtered.txt-shallow-20200710-233114-dnakw-00000.warc.gz 58987205 download   job
urls-archive.max.fan-twitter-@NYPDBklynSouth-filtered.txt-shallow-20200710-233114-dnakw-00000.warc.os.cdx.gz 65677 download
urls-archive.max.fan-twitter-@NYPDBklynSouth-filtered.txt-shallow-20200710-233114-dnakw-meta.warc.gz 39285 download   job
urls-archive.max.fan-twitter-@NYPDBklynSouth-filtered.txt-shallow-20200710-233114-dnakw-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDBklynSouth-filtered.txt-shallow-20200710-233114-dnakw-urls.txt 10476 download
urls-archive.max.fan-twitter-@NYPDBklynSouth-filtered.txt-shallow-20200710-233114-dnakw.json 343 download   job
urls-archive.max.fan-twitter-@NYPDCadets-filtered.txt-shallow-20200710-233113-47kh4-00000.warc.gz 318437750 download   job
urls-archive.max.fan-twitter-@NYPDCadets-filtered.txt-shallow-20200710-233113-47kh4-00000.warc.os.cdx.gz 274954 download
urls-archive.max.fan-twitter-@NYPDCadets-filtered.txt-shallow-20200710-233113-47kh4-meta.warc.gz 149033 download   job
urls-archive.max.fan-twitter-@NYPDCadets-filtered.txt-shallow-20200710-233113-47kh4-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDCadets-filtered.txt-shallow-20200710-233113-47kh4-urls.txt 68881 download
urls-archive.max.fan-twitter-@NYPDCadets-filtered.txt-shallow-20200710-233113-47kh4.json 335 download   job
urls-archive.max.fan-twitter-@NYPDCentralPark-filtered.txt-shallow-20200710-233001-5pym2-00000.warc.gz 467249027 download   job
urls-archive.max.fan-twitter-@NYPDCentralPark-filtered.txt-shallow-20200710-233001-5pym2-00000.warc.os.cdx.gz 576130 download
urls-archive.max.fan-twitter-@NYPDCentralPark-filtered.txt-shallow-20200710-233001-5pym2-meta.warc.gz 307136 download   job
urls-archive.max.fan-twitter-@NYPDCentralPark-filtered.txt-shallow-20200710-233001-5pym2-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDCentralPark-filtered.txt-shallow-20200710-233001-5pym2-urls.txt 112371 download
urls-archive.max.fan-twitter-@NYPDCentralPark-filtered.txt-shallow-20200710-233001-5pym2.json 345 download   job
urls-archive.max.fan-twitter-@NYPDChiefPatrol-filtered.txt-shallow-20200710-232803-7gj0y-meta.warc.gz 360042 download   job
urls-archive.max.fan-twitter-@NYPDChiefPatrol-filtered.txt-shallow-20200710-232803-7gj0y-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDChiefPatrol-filtered.txt-shallow-20200710-232803-7gj0y-urls.txt 106909 download
urls-archive.max.fan-twitter-@NYPDChiefPatrol-filtered.txt-shallow-20200710-232803-7gj0y.json 345 download   job
urls-archive.max.fan-twitter-@NYPDDCA-filtered.txt-shallow-20200710-232506-41tdq-00000.warc.gz 475788456 download   job
urls-archive.max.fan-twitter-@NYPDDCA-filtered.txt-shallow-20200710-232506-41tdq-00000.warc.os.cdx.gz 437056 download
urls-archive.max.fan-twitter-@NYPDDCA-filtered.txt-shallow-20200710-232506-41tdq-meta.warc.gz 235339 download   job
urls-archive.max.fan-twitter-@NYPDDCA-filtered.txt-shallow-20200710-232506-41tdq-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDDCA-filtered.txt-shallow-20200710-232506-41tdq-urls.txt 89526 download
urls-archive.max.fan-twitter-@NYPDDCA-filtered.txt-shallow-20200710-232506-41tdq.json 329 download   job
urls-archive.max.fan-twitter-@NYPDDCPI-filtered.txt-shallow-20200710-232342-cxwt2-00000.warc.gz 65170155 download   job
urls-archive.max.fan-twitter-@NYPDDCPI-filtered.txt-shallow-20200710-232342-cxwt2-00000.warc.os.cdx.gz 139745 download
urls-archive.max.fan-twitter-@NYPDDCPI-filtered.txt-shallow-20200710-232342-cxwt2-meta.warc.gz 78618 download   job
urls-archive.max.fan-twitter-@NYPDDCPI-filtered.txt-shallow-20200710-232342-cxwt2-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDDCPI-filtered.txt-shallow-20200710-232342-cxwt2-urls.txt 25349 download
urls-archive.max.fan-twitter-@NYPDDV-filtered.txt-shallow-20200710-232132-dva37-00000.warc.gz 295060742 download   job
urls-archive.max.fan-twitter-@NYPDDV-filtered.txt-shallow-20200710-232132-dva37-00000.warc.os.cdx.gz 232502 download
urls-archive.max.fan-twitter-@NYPDDV-filtered.txt-shallow-20200710-232132-dva37-meta.warc.gz 125483 download   job
urls-archive.max.fan-twitter-@NYPDDV-filtered.txt-shallow-20200710-232132-dva37-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDDV-filtered.txt-shallow-20200710-232132-dva37-urls.txt 49033 download
urls-archive.max.fan-twitter-@NYPDDV-filtered.txt-shallow-20200710-232132-dva37.json 327 download   job
urls-archive.max.fan-twitter-@NYPDEquity-filtered.txt-shallow-20200710-231819-9mj7y-00000.warc.gz 54865686 download   job
urls-archive.max.fan-twitter-@NYPDEquity-filtered.txt-shallow-20200710-231819-9mj7y-00000.warc.os.cdx.gz 71578 download
urls-archive.max.fan-twitter-@NYPDEquity-filtered.txt-shallow-20200710-231819-9mj7y-meta.warc.gz 42163 download   job
urls-archive.max.fan-twitter-@NYPDEquity-filtered.txt-shallow-20200710-231819-9mj7y-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDEquity-filtered.txt-shallow-20200710-231819-9mj7y-urls.txt 10858 download
urls-archive.max.fan-twitter-@NYPDEquity-filtered.txt-shallow-20200710-231819-9mj7y.json 335 download   job
urls-archive.max.fan-twitter-@NYPDFIRSTDEP-filtered.txt-shallow-20200710-231319-978xd-00000.warc.gz 503342517 download   job
urls-archive.max.fan-twitter-@NYPDFIRSTDEP-filtered.txt-shallow-20200710-231319-978xd-00000.warc.os.cdx.gz 498650 download
urls-archive.max.fan-twitter-@NYPDFIRSTDEP-filtered.txt-shallow-20200710-231319-978xd-meta.warc.gz 265275 download   job
urls-archive.max.fan-twitter-@NYPDFIRSTDEP-filtered.txt-shallow-20200710-231319-978xd-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDFIRSTDEP-filtered.txt-shallow-20200710-231319-978xd-urls.txt 125615 download
urls-archive.max.fan-twitter-@NYPDFIRSTDEP-filtered.txt-shallow-20200710-231319-978xd.json 339 download   job
urls-archive.max.fan-twitter-@NYPDFacilities-filtered.txt-shallow-20200710-231320-6cc8b-00000.warc.gz 102833474 download   job
urls-archive.max.fan-twitter-@NYPDFacilities-filtered.txt-shallow-20200710-231320-6cc8b-00000.warc.os.cdx.gz 83721 download
urls-archive.max.fan-twitter-@NYPDFacilities-filtered.txt-shallow-20200710-231320-6cc8b-meta.warc.gz 48506 download   job
urls-archive.max.fan-twitter-@NYPDFacilities-filtered.txt-shallow-20200710-231320-6cc8b-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDFacilities-filtered.txt-shallow-20200710-231320-6cc8b-urls.txt 21302 download
urls-archive.max.fan-twitter-@NYPDFacilities-filtered.txt-shallow-20200710-231320-6cc8b.json 343 download   job
urls-archive.max.fan-twitter-@NYPDHateCrimes-filtered.txt-shallow-20200710-231004-cx64z-00000.warc.gz 24363715 download   job
urls-archive.max.fan-twitter-@NYPDHateCrimes-filtered.txt-shallow-20200710-231004-cx64z-00000.warc.os.cdx.gz 71256 download
urls-archive.max.fan-twitter-@NYPDHateCrimes-filtered.txt-shallow-20200710-231004-cx64z-meta.warc.gz 42319 download   job
urls-archive.max.fan-twitter-@NYPDHateCrimes-filtered.txt-shallow-20200710-231004-cx64z-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDHateCrimes-filtered.txt-shallow-20200710-231004-cx64z-urls.txt 7440 download
urls-archive.max.fan-twitter-@NYPDHateCrimes-filtered.txt-shallow-20200710-231004-cx64z.json 343 download   job
urls-archive.max.fan-twitter-@NYPDHighway-filtered.txt-shallow-20200710-230940-cmyay-00000.warc.gz 152128249 download   job
urls-archive.max.fan-twitter-@NYPDHighway-filtered.txt-shallow-20200710-230940-cmyay-00000.warc.os.cdx.gz 320758 download
urls-archive.max.fan-twitter-@NYPDHighway-filtered.txt-shallow-20200710-230940-cmyay-meta.warc.gz 175774 download   job
urls-archive.max.fan-twitter-@NYPDHighway-filtered.txt-shallow-20200710-230940-cmyay-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDHighway-filtered.txt-shallow-20200710-230940-cmyay-urls.txt 49425 download
urls-archive.max.fan-twitter-@NYPDHighway-filtered.txt-shallow-20200710-230940-cmyay.json 337 download   job
urls-archive.max.fan-twitter-@NYPDHousing-filtered.txt-shallow-20200710-230936-1nno3-00000.warc.gz 291230274 download   job
urls-archive.max.fan-twitter-@NYPDHousing-filtered.txt-shallow-20200710-230936-1nno3-00000.warc.os.cdx.gz 282904 download
urls-archive.max.fan-twitter-@NYPDHousing-filtered.txt-shallow-20200710-230936-1nno3-meta.warc.gz 153727 download   job
urls-archive.max.fan-twitter-@NYPDHousing-filtered.txt-shallow-20200710-230936-1nno3-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDHousing-filtered.txt-shallow-20200710-230936-1nno3-urls.txt 58422 download
urls-archive.max.fan-twitter-@NYPDHousing-filtered.txt-shallow-20200710-230936-1nno3.json 337 download   job
urls-archive.max.fan-twitter-@NYPDInMemoriam-filtered.txt-shallow-20200710-230532-2xrm9-00000.warc.gz 479058739 download   job
urls-archive.max.fan-twitter-@NYPDInMemoriam-filtered.txt-shallow-20200710-230532-2xrm9-00000.warc.os.cdx.gz 663722 download
urls-archive.max.fan-twitter-@NYPDInMemoriam-filtered.txt-shallow-20200710-230532-2xrm9-meta.warc.gz 352806 download   job
urls-archive.max.fan-twitter-@NYPDInMemoriam-filtered.txt-shallow-20200710-230532-2xrm9-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDInMemoriam-filtered.txt-shallow-20200710-230532-2xrm9-urls.txt 219120 download
urls-archive.max.fan-twitter-@NYPDInMemoriam-filtered.txt-shallow-20200710-230532-2xrm9.json 343 download   job
urls-archive.max.fan-twitter-@NYPDMTN-filtered.txt-shallow-20200710-230531-9fwg4-00000.warc.gz 361595857 download   job
urls-archive.max.fan-twitter-@NYPDMTN-filtered.txt-shallow-20200710-230531-9fwg4-00000.warc.os.cdx.gz 503522 download
urls-archive.max.fan-twitter-@NYPDMTN-filtered.txt-shallow-20200710-230531-9fwg4-meta.warc.gz 270348 download   job
urls-archive.max.fan-twitter-@NYPDMTN-filtered.txt-shallow-20200710-230531-9fwg4-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDMTN-filtered.txt-shallow-20200710-230531-9fwg4-urls.txt 106128 download
urls-archive.max.fan-twitter-@NYPDMTN-filtered.txt-shallow-20200710-230531-9fwg4.json 329 download   job
urls-archive.max.fan-twitter-@NYPDMTS-filtered.txt-shallow-20200710-230408-5a9v6-00000.warc.gz 486842978 download   job
urls-archive.max.fan-twitter-@NYPDMTS-filtered.txt-shallow-20200710-230408-5a9v6-00000.warc.os.cdx.gz 553113 download
urls-archive.max.fan-twitter-@NYPDMTS-filtered.txt-shallow-20200710-230408-5a9v6-meta.warc.gz 296495 download   job
urls-archive.max.fan-twitter-@NYPDMTS-filtered.txt-shallow-20200710-230408-5a9v6-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDMTS-filtered.txt-shallow-20200710-230408-5a9v6-urls.txt 121091 download
urls-archive.max.fan-twitter-@NYPDMTS-filtered.txt-shallow-20200710-230408-5a9v6.json 329 download   job
urls-archive.max.fan-twitter-@NYPDNieves-filtered.txt-shallow-20200710-230341-8stn5-00000.warc.gz 228369456 download   job
urls-archive.max.fan-twitter-@NYPDNieves-filtered.txt-shallow-20200710-230341-8stn5-00000.warc.os.cdx.gz 320939 download
urls-archive.max.fan-twitter-@NYPDNieves-filtered.txt-shallow-20200710-230341-8stn5-meta.warc.gz 175307 download   job
urls-archive.max.fan-twitter-@NYPDNieves-filtered.txt-shallow-20200710-230341-8stn5-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDNieves-filtered.txt-shallow-20200710-230341-8stn5-urls.txt 84094 download
urls-archive.max.fan-twitter-@NYPDNieves-filtered.txt-shallow-20200710-230341-8stn5.json 335 download   job
urls-archive.max.fan-twitter-@NYPDPBBronx-filtered.txt-shallow-20200710-230341-7xu50-00000.warc.gz 519503985 download   job
urls-archive.max.fan-twitter-@NYPDPBBronx-filtered.txt-shallow-20200710-230341-7xu50-00000.warc.os.cdx.gz 440103 download
urls-archive.max.fan-twitter-@NYPDPBBronx-filtered.txt-shallow-20200710-230341-7xu50-meta.warc.gz 234988 download   job
urls-archive.max.fan-twitter-@NYPDPBBronx-filtered.txt-shallow-20200710-230341-7xu50-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDPBBronx-filtered.txt-shallow-20200710-230341-7xu50-urls.txt 75767 download
urls-archive.max.fan-twitter-@NYPDPBBronx-filtered.txt-shallow-20200710-230341-7xu50.json 337 download   job
urls-archive.max.fan-twitter-@NYPDPBMN-filtered.txt-shallow-20200710-230318-ccbf4-00000.warc.gz 450705593 download   job
urls-archive.max.fan-twitter-@NYPDPBMN-filtered.txt-shallow-20200710-230318-ccbf4-00000.warc.os.cdx.gz 423266 download
urls-archive.max.fan-twitter-@NYPDPBMN-filtered.txt-shallow-20200710-230318-ccbf4-meta.warc.gz 227462 download   job
urls-archive.max.fan-twitter-@NYPDPBMN-filtered.txt-shallow-20200710-230318-ccbf4-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDPBMN-filtered.txt-shallow-20200710-230318-ccbf4-urls.txt 68069 download
urls-archive.max.fan-twitter-@NYPDPBMN-filtered.txt-shallow-20200710-230318-ccbf4.json 331 download   job
urls-archive.max.fan-twitter-@NYPDPBMS-filtered.txt-shallow-20200710-230315-7s8r7-00000.warc.gz 108473327 download   job
urls-archive.max.fan-twitter-@NYPDPBMS-filtered.txt-shallow-20200710-230315-7s8r7-00000.warc.os.cdx.gz 144119 download
urls-archive.max.fan-twitter-@NYPDPBMS-filtered.txt-shallow-20200710-230315-7s8r7-meta.warc.gz 80724 download   job
urls-archive.max.fan-twitter-@NYPDPBMS-filtered.txt-shallow-20200710-230315-7s8r7-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDPBMS-filtered.txt-shallow-20200710-230315-7s8r7-urls.txt 23528 download
urls-archive.max.fan-twitter-@NYPDPBMS-filtered.txt-shallow-20200710-230315-7s8r7.json 331 download   job
urls-archive.max.fan-twitter-@NYPDPSA1-filtered.txt-shallow-20200710-224440-nahw6-00000.warc.gz 520960429 download   job
urls-archive.max.fan-twitter-@NYPDPSA1-filtered.txt-shallow-20200710-224440-nahw6-00000.warc.os.cdx.gz 425158 download
urls-archive.max.fan-twitter-@NYPDPSA1-filtered.txt-shallow-20200710-224440-nahw6-meta.warc.gz 226877 download   job
urls-archive.max.fan-twitter-@NYPDPSA1-filtered.txt-shallow-20200710-224440-nahw6-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDPSA1-filtered.txt-shallow-20200710-224440-nahw6-urls.txt 113725 download
urls-archive.max.fan-twitter-@NYPDPSA1-filtered.txt-shallow-20200710-224440-nahw6.json 331 download   job
urls-archive.max.fan-twitter-@NYPDPSA2-filtered.txt-shallow-20200710-224257-mixa9-00000.warc.gz 342467016 download   job
urls-archive.max.fan-twitter-@NYPDPSA2-filtered.txt-shallow-20200710-224257-mixa9-00000.warc.os.cdx.gz 314629 download
urls-archive.max.fan-twitter-@NYPDPSA2-filtered.txt-shallow-20200710-224257-mixa9-meta.warc.gz 169805 download   job
urls-archive.max.fan-twitter-@NYPDPSA2-filtered.txt-shallow-20200710-224257-mixa9-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDPSA2-filtered.txt-shallow-20200710-224257-mixa9-urls.txt 84998 download
urls-archive.max.fan-twitter-@NYPDPSA2-filtered.txt-shallow-20200710-224257-mixa9.json 331 download   job
urls-archive.max.fan-twitter-@NYPDPSA3-filtered.txt-shallow-20200710-224257-4ehza-00000.warc.gz 294462672 download   job
urls-archive.max.fan-twitter-@NYPDPSA3-filtered.txt-shallow-20200710-224257-4ehza-00000.warc.os.cdx.gz 268836 download
urls-archive.max.fan-twitter-@NYPDPSA3-filtered.txt-shallow-20200710-224257-4ehza-meta.warc.gz 146150 download   job
urls-archive.max.fan-twitter-@NYPDPSA3-filtered.txt-shallow-20200710-224257-4ehza-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDPSA3-filtered.txt-shallow-20200710-224257-4ehza-urls.txt 63195 download
urls-archive.max.fan-twitter-@NYPDPSA3-filtered.txt-shallow-20200710-224257-4ehza.json 331 download   job
urls-archive.max.fan-twitter-@NYPDPSA4-filtered.txt-shallow-20200710-223446-hfiot-00000.warc.gz 266744314 download   job
urls-archive.max.fan-twitter-@NYPDPSA4-filtered.txt-shallow-20200710-223446-hfiot-00000.warc.os.cdx.gz 248816 download
urls-archive.max.fan-twitter-@NYPDPSA4-filtered.txt-shallow-20200710-223446-hfiot-meta.warc.gz 135264 download   job
urls-archive.max.fan-twitter-@NYPDPSA4-filtered.txt-shallow-20200710-223446-hfiot-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDPSA4-filtered.txt-shallow-20200710-223446-hfiot-urls.txt 66070 download
urls-archive.max.fan-twitter-@NYPDPSA4-filtered.txt-shallow-20200710-223446-hfiot.json 331 download   job
urls-archive.max.fan-twitter-@NYPDPSA5-filtered.txt-shallow-20200710-223443-blp3r-00000.warc.gz 273473729 download   job
urls-archive.max.fan-twitter-@NYPDPSA5-filtered.txt-shallow-20200710-223443-blp3r-00000.warc.os.cdx.gz 241469 download
urls-archive.max.fan-twitter-@NYPDPSA5-filtered.txt-shallow-20200710-223443-blp3r-meta.warc.gz 131160 download   job
urls-archive.max.fan-twitter-@NYPDPSA5-filtered.txt-shallow-20200710-223443-blp3r-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDPSA5-filtered.txt-shallow-20200710-223443-blp3r-urls.txt 52967 download
urls-archive.max.fan-twitter-@NYPDPSA5-filtered.txt-shallow-20200710-223443-blp3r.json 331 download   job
urls-archive.max.fan-twitter-@NYPDPSA6-filtered.txt-shallow-20200710-223440-ezmjr-00000.warc.gz 350730370 download   job
urls-archive.max.fan-twitter-@NYPDPSA6-filtered.txt-shallow-20200710-223440-ezmjr-00000.warc.os.cdx.gz 335121 download
urls-archive.max.fan-twitter-@NYPDPSA6-filtered.txt-shallow-20200710-223440-ezmjr-meta.warc.gz 179177 download   job
urls-archive.max.fan-twitter-@NYPDPSA6-filtered.txt-shallow-20200710-223440-ezmjr-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDPSA6-filtered.txt-shallow-20200710-223440-ezmjr-urls.txt 96175 download
urls-archive.max.fan-twitter-@NYPDPSA6-filtered.txt-shallow-20200710-223440-ezmjr.json 331 download   job
urls-archive.max.fan-twitter-@NYPDPSA7-filtered.txt-shallow-20200710-223415-16ns4-00000.warc.gz 689298381 download   job
urls-archive.max.fan-twitter-@NYPDPSA7-filtered.txt-shallow-20200710-223415-16ns4-00000.warc.os.cdx.gz 580615 download
urls-archive.max.fan-twitter-@NYPDPSA7-filtered.txt-shallow-20200710-223415-16ns4-meta.warc.gz 309221 download   job
urls-archive.max.fan-twitter-@NYPDPSA7-filtered.txt-shallow-20200710-223415-16ns4-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDPSA7-filtered.txt-shallow-20200710-223415-16ns4-urls.txt 142787 download
urls-archive.max.fan-twitter-@NYPDPSA7-filtered.txt-shallow-20200710-223415-16ns4.json 331 download   job
urls-archive.max.fan-twitter-@NYPDPSA8-filtered.txt-shallow-20200710-221812-2eqdh-00000.warc.gz 789133794 download   job
urls-archive.max.fan-twitter-@NYPDPSA8-filtered.txt-shallow-20200710-221812-2eqdh-00000.warc.os.cdx.gz 616596 download
urls-archive.max.fan-twitter-@NYPDPSA8-filtered.txt-shallow-20200710-221812-2eqdh-meta.warc.gz 326644 download   job
urls-archive.max.fan-twitter-@NYPDPSA8-filtered.txt-shallow-20200710-221812-2eqdh-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDPSA8-filtered.txt-shallow-20200710-221812-2eqdh-urls.txt 138825 download
urls-archive.max.fan-twitter-@NYPDPSA8-filtered.txt-shallow-20200710-221812-2eqdh.json 331 download   job
urls-archive.max.fan-twitter-@NYPDPSA9-filtered.txt-shallow-20200710-221810-rl96k-00000.warc.gz 424006727 download   job
urls-archive.max.fan-twitter-@NYPDPSA9-filtered.txt-shallow-20200710-221810-rl96k-00000.warc.os.cdx.gz 341283 download
urls-archive.max.fan-twitter-@NYPDPSA9-filtered.txt-shallow-20200710-221810-rl96k-meta.warc.gz 182422 download   job
urls-archive.max.fan-twitter-@NYPDPSA9-filtered.txt-shallow-20200710-221810-rl96k-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDPSA9-filtered.txt-shallow-20200710-221810-rl96k-urls.txt 100510 download
urls-archive.max.fan-twitter-@NYPDPSA9-filtered.txt-shallow-20200710-221810-rl96k.json 331 download   job
urls-archive.max.fan-twitter-@NYPDQueensNorth-filtered.txt-shallow-20200710-221805-3pwik-00000.warc.gz 503285903 download   job
urls-archive.max.fan-twitter-@NYPDQueensNorth-filtered.txt-shallow-20200710-221805-3pwik-00000.warc.os.cdx.gz 408718 download
urls-archive.max.fan-twitter-@NYPDQueensNorth-filtered.txt-shallow-20200710-221805-3pwik-meta.warc.gz 218719 download   job
urls-archive.max.fan-twitter-@NYPDQueensNorth-filtered.txt-shallow-20200710-221805-3pwik-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDQueensNorth-filtered.txt-shallow-20200710-221805-3pwik-urls.txt 81893 download
urls-archive.max.fan-twitter-@NYPDQueensNorth-filtered.txt-shallow-20200710-221805-3pwik.json 345 download   job
urls-archive.max.fan-twitter-@NYPDQueensSouth-filtered.txt-shallow-20200710-221755-avqcq-00000.warc.gz 551264498 download   job
urls-archive.max.fan-twitter-@NYPDQueensSouth-filtered.txt-shallow-20200710-221755-avqcq-00000.warc.os.cdx.gz 384109 download
urls-archive.max.fan-twitter-@NYPDQueensSouth-filtered.txt-shallow-20200710-221755-avqcq-meta.warc.gz 204558 download   job
urls-archive.max.fan-twitter-@NYPDQueensSouth-filtered.txt-shallow-20200710-221755-avqcq-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDQueensSouth-filtered.txt-shallow-20200710-221755-avqcq-urls.txt 92205 download
urls-archive.max.fan-twitter-@NYPDQueensSouth-filtered.txt-shallow-20200710-221755-avqcq.json 345 download   job
urls-archive.max.fan-twitter-@NYPDSVU-filtered.txt-shallow-20200710-220650-28sd8-00000.warc.gz 100040196 download   job
urls-archive.max.fan-twitter-@NYPDSVU-filtered.txt-shallow-20200710-220650-28sd8-00000.warc.os.cdx.gz 180921 download
urls-archive.max.fan-twitter-@NYPDSVU-filtered.txt-shallow-20200710-220650-28sd8-meta.warc.gz 101261 download   job
urls-archive.max.fan-twitter-@NYPDSVU-filtered.txt-shallow-20200710-220650-28sd8-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDSVU-filtered.txt-shallow-20200710-220650-28sd8-urls.txt 35278 download
urls-archive.max.fan-twitter-@NYPDSVU-filtered.txt-shallow-20200710-220650-28sd8.json 329 download   job
urls-archive.max.fan-twitter-@NYPDSchools-filtered.txt-shallow-20200710-221746-cdww9-00000.warc.gz 937182363 download   job
urls-archive.max.fan-twitter-@NYPDSchools-filtered.txt-shallow-20200710-221746-cdww9-00000.warc.os.cdx.gz 703478 download
urls-archive.max.fan-twitter-@NYPDSchools-filtered.txt-shallow-20200710-221746-cdww9-meta.warc.gz 370003 download   job
urls-archive.max.fan-twitter-@NYPDSchools-filtered.txt-shallow-20200710-221746-cdww9-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDSchools-filtered.txt-shallow-20200710-221746-cdww9-urls.txt 178450 download
urls-archive.max.fan-twitter-@NYPDSchools-filtered.txt-shallow-20200710-221746-cdww9.json 337 download   job
urls-archive.max.fan-twitter-@NYPDShea-filtered.txt-shallow-20200710-221743-8vm3u-00000.warc.gz 628010829 download   job
urls-archive.max.fan-twitter-@NYPDShea-filtered.txt-shallow-20200710-221743-8vm3u-00000.warc.os.cdx.gz 1116604 download
urls-archive.max.fan-twitter-@NYPDShea-filtered.txt-shallow-20200710-221743-8vm3u-urls.txt 101851 download
urls-archive.max.fan-twitter-@NYPDShea-filtered.txt-shallow-20200710-221743-8vm3u.json 331 download   job
urls-archive.max.fan-twitter-@NYPDSpecialops-filtered.txt-shallow-20200710-220924-1cjz3-00000.warc.gz 904860706 download   job
urls-archive.max.fan-twitter-@NYPDSpecialops-filtered.txt-shallow-20200710-220924-1cjz3-00000.warc.os.cdx.gz 1094889 download
urls-archive.max.fan-twitter-@NYPDSpecialops-filtered.txt-shallow-20200710-220924-1cjz3-meta.warc.gz 577259 download   job
urls-archive.max.fan-twitter-@NYPDSpecialops-filtered.txt-shallow-20200710-220924-1cjz3-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDSpecialops-filtered.txt-shallow-20200710-220924-1cjz3-urls.txt 134543 download
urls-archive.max.fan-twitter-@NYPDSpecialops-filtered.txt-shallow-20200710-220924-1cjz3.json 343 download   job
urls-archive.max.fan-twitter-@NYPDSport-filtered.txt-shallow-20200710-220924-8rn0e-00000.warc.gz 148932066 download   job
urls-archive.max.fan-twitter-@NYPDSport-filtered.txt-shallow-20200710-220924-8rn0e-00000.warc.os.cdx.gz 118845 download
urls-archive.max.fan-twitter-@NYPDSport-filtered.txt-shallow-20200710-220924-8rn0e-meta.warc.gz 67331 download   job
urls-archive.max.fan-twitter-@NYPDSport-filtered.txt-shallow-20200710-220924-8rn0e-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDSport-filtered.txt-shallow-20200710-220924-8rn0e-urls.txt 19262 download
urls-archive.max.fan-twitter-@NYPDSport-filtered.txt-shallow-20200710-220924-8rn0e.json 333 download   job
urls-archive.max.fan-twitter-@NYPDTimesSquare-filtered.txt-shallow-20200710-220642-91776-00000.warc.gz 99640600 download   job
urls-archive.max.fan-twitter-@NYPDTimesSquare-filtered.txt-shallow-20200710-220642-91776-00000.warc.os.cdx.gz 157485 download
urls-archive.max.fan-twitter-@NYPDTimesSquare-filtered.txt-shallow-20200710-220642-91776-meta.warc.gz 88036 download   job
urls-archive.max.fan-twitter-@NYPDTimesSquare-filtered.txt-shallow-20200710-220642-91776-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDTimesSquare-filtered.txt-shallow-20200710-220642-91776-urls.txt 27693 download
urls-archive.max.fan-twitter-@NYPDTimesSquare-filtered.txt-shallow-20200710-220642-91776.json 345 download   job
urls-archive.max.fan-twitter-@NYPDTips-filtered.txt-shallow-20200710-220423-8fy9a-00000.warc.gz 627249826 download   job
urls-archive.max.fan-twitter-@NYPDTips-filtered.txt-shallow-20200710-220423-8fy9a-00000.warc.os.cdx.gz 925415 download
urls-archive.max.fan-twitter-@NYPDTips-filtered.txt-shallow-20200710-220423-8fy9a-meta.warc.gz 494879 download   job
urls-archive.max.fan-twitter-@NYPDTips-filtered.txt-shallow-20200710-220423-8fy9a-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDTips-filtered.txt-shallow-20200710-220423-8fy9a-urls.txt 239624 download
urls-archive.max.fan-twitter-@NYPDTips-filtered.txt-shallow-20200710-220423-8fy9a.json 331 download   job
urls-archive.max.fan-twitter-@NYPDTraining-filtered.txt-shallow-20200710-220423-4wizn-00000.warc.gz 81778782 download   job
urls-archive.max.fan-twitter-@NYPDTraining-filtered.txt-shallow-20200710-220423-4wizn-00000.warc.os.cdx.gz 91513 download
urls-archive.max.fan-twitter-@NYPDTraining-filtered.txt-shallow-20200710-220423-4wizn-meta.warc.gz 52673 download   job
urls-archive.max.fan-twitter-@NYPDTraining-filtered.txt-shallow-20200710-220423-4wizn-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDTraining-filtered.txt-shallow-20200710-220423-4wizn-urls.txt 17925 download
urls-archive.max.fan-twitter-@NYPDTraining-filtered.txt-shallow-20200710-220423-4wizn.json 339 download   job
urls-archive.max.fan-twitter-@NYPDTransit-filtered.txt-shallow-20200710-215915-50fpc-00000.warc.gz 1264889332 download   job
urls-archive.max.fan-twitter-@NYPDTransit-filtered.txt-shallow-20200710-215915-50fpc-00000.warc.os.cdx.gz 1423463 download
urls-archive.max.fan-twitter-@NYPDTransit-filtered.txt-shallow-20200710-215915-50fpc-meta.warc.gz 741815 download   job
urls-archive.max.fan-twitter-@NYPDTransit-filtered.txt-shallow-20200710-215915-50fpc-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDTransit-filtered.txt-shallow-20200710-215915-50fpc-urls.txt 340055 download
urls-archive.max.fan-twitter-@NYPDTransit-filtered.txt-shallow-20200710-215915-50fpc.json 337 download   job
urls-archive.max.fan-twitter-@NYPDTransport-filtered.txt-shallow-20200710-215906-4w92j-00000.warc.gz 414111958 download   job
urls-archive.max.fan-twitter-@NYPDTransport-filtered.txt-shallow-20200710-215906-4w92j-00000.warc.os.cdx.gz 405482 download
urls-archive.max.fan-twitter-@NYPDTransport-filtered.txt-shallow-20200710-215906-4w92j-meta.warc.gz 216832 download   job
urls-archive.max.fan-twitter-@NYPDTransport-filtered.txt-shallow-20200710-215906-4w92j-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDTransport-filtered.txt-shallow-20200710-215906-4w92j-urls.txt 81331 download
urls-archive.max.fan-twitter-@NYPDTransport-filtered.txt-shallow-20200710-215906-4w92j.json 341 download   job
urls-archive.max.fan-twitter-@NYPDchaplains-filtered.txt-shallow-20200710-232831-b8cow-00000.warc.gz 442349933 download   job
urls-archive.max.fan-twitter-@NYPDchaplains-filtered.txt-shallow-20200710-232831-b8cow-00000.warc.os.cdx.gz 524204 download
urls-archive.max.fan-twitter-@NYPDchaplains-filtered.txt-shallow-20200710-232831-b8cow-meta.warc.gz 280545 download   job
urls-archive.max.fan-twitter-@NYPDchaplains-filtered.txt-shallow-20200710-232831-b8cow-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDchaplains-filtered.txt-shallow-20200710-232831-b8cow-urls.txt 103749 download
urls-archive.max.fan-twitter-@NYPDchaplains-filtered.txt-shallow-20200710-232831-b8cow.json 341 download   job
urls-archive.max.fan-twitter-@NYPDstatenIslnd-filtered.txt-shallow-20200710-220921-d8fpx-00000.warc.gz 115928071 download   job
urls-archive.max.fan-twitter-@NYPDstatenIslnd-filtered.txt-shallow-20200710-220921-d8fpx-00000.warc.os.cdx.gz 125920 download
urls-archive.max.fan-twitter-@NYPDstatenIslnd-filtered.txt-shallow-20200710-220921-d8fpx-meta.warc.gz 71275 download   job
urls-archive.max.fan-twitter-@NYPDstatenIslnd-filtered.txt-shallow-20200710-220921-d8fpx-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYPDstatenIslnd-filtered.txt-shallow-20200710-220921-d8fpx-urls.txt 23758 download
urls-archive.max.fan-twitter-@NYPDstatenIslnd-filtered.txt-shallow-20200710-220921-d8fpx.json 345 download   job
urls-archive.max.fan-twitter-@NYT4thDownBot-filtered.txt-shallow-20200710-215700-5vdbm-00000.warc.gz 281660670 download   job
urls-archive.max.fan-twitter-@NYT4thDownBot-filtered.txt-shallow-20200710-215700-5vdbm-00000.warc.os.cdx.gz 696649 download
urls-archive.max.fan-twitter-@NYT4thDownBot-filtered.txt-shallow-20200710-215700-5vdbm-meta.warc.gz 372352 download   job
urls-archive.max.fan-twitter-@NYT4thDownBot-filtered.txt-shallow-20200710-215700-5vdbm-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYT4thDownBot-filtered.txt-shallow-20200710-215700-5vdbm-urls.txt 234840 download
urls-archive.max.fan-twitter-@NYT4thDownBot-filtered.txt-shallow-20200710-215700-5vdbm.json 341 download   job
urls-archive.max.fan-twitter-@NYTArchives-filtered.txt-shallow-20200710-215655-bpgyg-00000.warc.gz 1743996867 download   job
urls-archive.max.fan-twitter-@NYTArchives-filtered.txt-shallow-20200710-215655-bpgyg-00000.warc.os.cdx.gz 3037850 download
urls-archive.max.fan-twitter-@NYTArchives-filtered.txt-shallow-20200710-215655-bpgyg-meta.warc.gz 1564399 download   job
urls-archive.max.fan-twitter-@NYTArchives-filtered.txt-shallow-20200710-215655-bpgyg-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYTArchives-filtered.txt-shallow-20200710-215655-bpgyg-urls.txt 409821 download
urls-archive.max.fan-twitter-@NYTArchives-filtered.txt-shallow-20200710-215655-bpgyg.json 337 download   job
urls-archive.max.fan-twitter-@NYTObits-filtered.txt-shallow-20200710-211552-2uhg1-00000.warc.gz 3055564445 download   job
urls-archive.max.fan-twitter-@NYTObits-filtered.txt-shallow-20200710-211552-2uhg1-00000.warc.os.cdx.gz 4583423 download
urls-archive.max.fan-twitter-@NYTObits-filtered.txt-shallow-20200710-211552-2uhg1-meta.warc.gz 2373781 download   job
urls-archive.max.fan-twitter-@NYTObits-filtered.txt-shallow-20200710-211552-2uhg1-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYTObits-filtered.txt-shallow-20200710-211552-2uhg1-urls.txt 2293066 download
urls-archive.max.fan-twitter-@NYTimesAtWar-filtered.txt-shallow-20200710-214515-c1a2h-00000.warc.gz 887378706 download   job
urls-archive.max.fan-twitter-@NYTimesAtWar-filtered.txt-shallow-20200710-214515-c1a2h-00000.warc.os.cdx.gz 1843005 download
urls-archive.max.fan-twitter-@NYTimesAtWar-filtered.txt-shallow-20200710-214515-c1a2h-meta.warc.gz 954750 download   job
urls-archive.max.fan-twitter-@NYTimesAtWar-filtered.txt-shallow-20200710-214515-c1a2h-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYTimesAtWar-filtered.txt-shallow-20200710-214515-c1a2h-urls.txt 669310 download
urls-archive.max.fan-twitter-@NYTimesAtWar-filtered.txt-shallow-20200710-214515-c1a2h.json 339 download   job
urls-archive.max.fan-twitter-@NYTnickc-filtered.txt-shallow-20200710-211554-88b58-00000.warc.gz 2102457213 download   job
urls-archive.max.fan-twitter-@NYTnickc-filtered.txt-shallow-20200710-211554-88b58-00000.warc.os.cdx.gz 4092752 download
urls-archive.max.fan-twitter-@NYTnickc-filtered.txt-shallow-20200710-211554-88b58-meta.warc.gz 2145808 download   job
urls-archive.max.fan-twitter-@NYTnickc-filtered.txt-shallow-20200710-211554-88b58-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NYTnickc-filtered.txt-shallow-20200710-211554-88b58-urls.txt 1150338 download
urls-archive.max.fan-twitter-@NYTnickc-filtered.txt-shallow-20200710-211554-88b58.json 331 download   job
urls-archive.max.fan-twitter-@NZUN-filtered.txt-shallow-20200710-211315-blgwy-urls.txt 210821 download
urls-archive.max.fan-twitter-@OCHAIraq-filtered.txt-shallow-20200710-210632-du947-meta.warc.gz 428290 download   job
urls-archive.max.fan-twitter-@OCHAIraq-filtered.txt-shallow-20200710-210632-du947-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@OCHAPhilippines-filtered.txt-shallow-20200710-210448-2sgq2-meta.warc.gz 187078 download   job
urls-archive.max.fan-twitter-@OCHAPhilippines-filtered.txt-shallow-20200710-210448-2sgq2-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@OCHAYemen-filtered.txt-shallow-20200710-205445-6hek0-00000.warc.gz 470594500 download   job
urls-archive.max.fan-twitter-@OCHAYemen-filtered.txt-shallow-20200710-205445-6hek0-00000.warc.os.cdx.gz 721199 download
urls-archive.max.fan-twitter-@OCHA_CAR-filtered.txt-shallow-20200710-211105-7xc5t-00000.warc.gz 336304154 download   job
urls-archive.max.fan-twitter-@OCHA_CAR-filtered.txt-shallow-20200710-211105-7xc5t-00000.warc.os.cdx.gz 363797 download
urls-archive.max.fan-twitter-@OCHA_Mali-filtered.txt-shallow-20200710-210632-eqc05-meta.warc.gz 114090 download   job
urls-archive.max.fan-twitter-@OCHA_Mali-filtered.txt-shallow-20200710-210632-eqc05-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@OCHA_Mali-filtered.txt-shallow-20200710-210632-eqc05.json 333 download   job
urls-archive.max.fan-twitter-@OHCHR_MENA-filtered.txt-shallow-20200710-205302-en4j1-00000.warc.gz 144158844 download   job
urls-archive.max.fan-twitter-@OHCHR_MENA-filtered.txt-shallow-20200710-205302-en4j1-00000.warc.os.cdx.gz 248723 download
urls-archive.max.fan-twitter-@OHCHR_MENA-filtered.txt-shallow-20200710-205302-en4j1-meta.warc.gz 135842 download   job
urls-archive.max.fan-twitter-@OHCHR_MENA-filtered.txt-shallow-20200710-205302-en4j1-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@OHCHR_MENA-filtered.txt-shallow-20200710-205302-en4j1-urls.txt 50063 download
urls-archive.max.fan-twitter-@OHCHR_MENA-filtered.txt-shallow-20200710-205302-en4j1.json 335 download   job
urls-archive.max.fan-twitter-@OITinfo-filtered.txt-shallow-20200710-205258-8tnab-00000.warc.gz 345897838 download   job
urls-archive.max.fan-twitter-@OITinfo-filtered.txt-shallow-20200710-205258-8tnab-00000.warc.os.cdx.gz 408898 download
urls-archive.max.fan-twitter-@OITinfo-filtered.txt-shallow-20200710-205258-8tnab-meta.warc.gz 221299 download   job
urls-archive.max.fan-twitter-@OITinfo-filtered.txt-shallow-20200710-205258-8tnab-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@OITinfo-filtered.txt-shallow-20200710-205258-8tnab-urls.txt 135029 download
urls-archive.max.fan-twitter-@ONEinAfrica-filtered.txt-shallow-20200710-204736-ez8ql-urls.txt 281866 download
urls-archive.max.fan-twitter-@ONEinEU-filtered.txt-shallow-20200710-203955-2j2rq-urls.txt 451120 download
urls-archive.max.fan-twitter-@ONEinEU-filtered.txt-shallow-20200710-203955-2j2rq.json 329 download   job
urls-archive.max.fan-twitter-@ONEintheUK-filtered.txt-shallow-20200710-203951-71u1q.json 335 download   job
urls-archive.max.fan-twitter-@ONUMX-filtered.txt-shallow-20200710-202746-ad5zg-00000.warc.gz 1183818360 download   job
urls-archive.max.fan-twitter-@ONUMX-filtered.txt-shallow-20200710-202746-ad5zg-00000.warc.os.cdx.gz 2205098 download
urls-archive.max.fan-twitter-@ONUMujeresMX-filtered.txt-shallow-20200710-203332-c7uxz-00000.warc.gz 2283652474 download   job
urls-archive.max.fan-twitter-@ONUMujeresMX-filtered.txt-shallow-20200710-203332-c7uxz-00000.warc.os.cdx.gz 3945910 download
urls-archive.max.fan-twitter-@ONUMujeresMX-filtered.txt-shallow-20200710-203332-c7uxz-meta.warc.gz 2050815 download   job
urls-archive.max.fan-twitter-@ONUMujeresMX-filtered.txt-shallow-20200710-203332-c7uxz-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ONUMujeresMX-filtered.txt-shallow-20200710-203332-c7uxz-urls.txt 860006 download
urls-archive.max.fan-twitter-@ONUMujeresMX-filtered.txt-shallow-20200710-203332-c7uxz.json 339 download   job
urls-archive.max.fan-twitter-@ONUVENuevaYork-filtered.txt-shallow-20200710-202130-5xht4-meta.warc.gz 826255 download   job
urls-archive.max.fan-twitter-@ONUVENuevaYork-filtered.txt-shallow-20200710-202130-5xht4-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ONUVENuevaYork-filtered.txt-shallow-20200710-202130-5xht4.json 343 download   job
urls-archive.max.fan-twitter-@ONUecuador-filtered.txt-shallow-20200710-203803-f04nt-meta.warc.gz 322330 download   job
urls-archive.max.fan-twitter-@ONUecuador-filtered.txt-shallow-20200710-203803-f04nt-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@OlympicNP-filtered.txt-shallow-20200710-204739-7i8j6-meta.warc.gz 164864 download   job
urls-archive.max.fan-twitter-@OlympicNP-filtered.txt-shallow-20200710-204739-7i8j6-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@OlympicNP-filtered.txt-shallow-20200710-204739-7i8j6.json 333 download   job
urls-archive.max.fan-twitter-@PCJalisco-filtered.txt-shallow-20200710-190418-4m0ym-meta.warc.gz 1410317 download   job
urls-archive.max.fan-twitter-@PCJalisco-filtered.txt-shallow-20200710-190418-4m0ym-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@PCJalisco-filtered.txt-shallow-20200710-190418-4m0ym.json 333 download   job
urls-archive.max.fan-twitter-@PC_Estatal-filtered.txt-shallow-20200710-190637-6ka4y-00000.warc.gz 5360857637 download   job
urls-archive.max.fan-twitter-@PC_Estatal-filtered.txt-shallow-20200710-190637-6ka4y-00000.warc.os.cdx.gz 3838070 download
urls-archive.max.fan-twitter-@PC_Estatal-filtered.txt-shallow-20200710-190637-6ka4y-meta.warc.gz 1986052 download   job
urls-archive.max.fan-twitter-@PC_Estatal-filtered.txt-shallow-20200710-190637-6ka4y-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@PC_Estatal-filtered.txt-shallow-20200710-190637-6ka4y-urls.txt 1191757 download
urls-archive.max.fan-twitter-@PC_Estatal-filtered.txt-shallow-20200710-190637-6ka4y.json 335 download   job
urls-archive.max.fan-twitter-@Plaid_Cymru-filtered.txt-shallow-20200710-183034-bmqaz-00000.warc.gz 3308463919 download   job
urls-archive.max.fan-twitter-@Plaid_Cymru-filtered.txt-shallow-20200710-183034-bmqaz-00000.warc.os.cdx.gz 3879372 download
urls-archive.max.fan-twitter-@Plaid_Cymru-filtered.txt-shallow-20200710-183034-bmqaz-meta.warc.gz 2081303 download   job
urls-archive.max.fan-twitter-@Plaid_Cymru-filtered.txt-shallow-20200710-183034-bmqaz-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Plaid_Cymru-filtered.txt-shallow-20200710-183034-bmqaz-urls.txt 1544406 download
urls-archive.max.fan-twitter-@Plaid_Cymru-filtered.txt-shallow-20200710-183034-bmqaz.json 337 download   job
urls-archive.max.fan-twitter-@nytdavidbrooks-filtered.txt-shallow-20200710-215259-d4rqr-00000.warc.gz 181292051 download   job
urls-archive.max.fan-twitter-@nytdavidbrooks-filtered.txt-shallow-20200710-215259-d4rqr-00000.warc.os.cdx.gz 1015630 download
urls-archive.max.fan-twitter-@nytdavidbrooks-filtered.txt-shallow-20200710-215259-d4rqr-meta.warc.gz 532351 download   job
urls-archive.max.fan-twitter-@nytdavidbrooks-filtered.txt-shallow-20200710-215259-d4rqr-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@nytdavidbrooks-filtered.txt-shallow-20200710-215259-d4rqr-urls.txt 69801 download
urls-archive.max.fan-twitter-@nytdavidbrooks-filtered.txt-shallow-20200710-215259-d4rqr.json 343 download   job
urls-archive.max.fan-twitter-@nytgraphics-filtered.txt-shallow-20200710-215037-dg1le-00000.warc.gz 991739766 download   job
urls-archive.max.fan-twitter-@nytgraphics-filtered.txt-shallow-20200710-215037-dg1le-00000.warc.os.cdx.gz 2835127 download
urls-archive.max.fan-twitter-@nytgraphics-filtered.txt-shallow-20200710-215037-dg1le-meta.warc.gz 1474272 download   job
urls-archive.max.fan-twitter-@nytgraphics-filtered.txt-shallow-20200710-215037-dg1le-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@nytgraphics-filtered.txt-shallow-20200710-215037-dg1le-urls.txt 317290 download
urls-archive.max.fan-twitter-@nytgraphics-filtered.txt-shallow-20200710-215037-dg1le.json 337 download   job
urls-archive.max.fan-twitter-@nytimesphoto-filtered.txt-shallow-20200710-213819-185jr-00000.warc.gz 4786033748 download   job
urls-archive.max.fan-twitter-@nytimesphoto-filtered.txt-shallow-20200710-213819-185jr-00000.warc.os.cdx.gz 7828288 download
urls-archive.max.fan-twitter-@nytimesphoto-filtered.txt-shallow-20200710-213819-185jr-meta.warc.gz 4077487 download   job
urls-archive.max.fan-twitter-@nytimesphoto-filtered.txt-shallow-20200710-213819-185jr-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@nytimesphoto-filtered.txt-shallow-20200710-213819-185jr-urls.txt 1306190 download
urls-archive.max.fan-twitter-@nytimesphoto-filtered.txt-shallow-20200710-213819-185jr.json 339 download   job
urls-archive.max.fan-twitter-@nytimesvows-filtered.txt-shallow-20200710-213724-346up-00000.warc.gz 2005419528 download   job
urls-archive.max.fan-twitter-@nytimesvows-filtered.txt-shallow-20200710-213724-346up-00000.warc.os.cdx.gz 1974873 download
urls-archive.max.fan-twitter-@nytimesvows-filtered.txt-shallow-20200710-213724-346up-meta.warc.gz 1026432 download   job
urls-archive.max.fan-twitter-@nytimesvows-filtered.txt-shallow-20200710-213724-346up-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@nytimesvows-filtered.txt-shallow-20200710-213724-346up-urls.txt 968756 download
urls-archive.max.fan-twitter-@nytimesvows-filtered.txt-shallow-20200710-213724-346up.json 337 download   job
urls-archive.max.fan-twitter-@nytimeswell-filtered.txt-shallow-20200710-213724-3ca4s-00000.warc.gz 1410917276 download   job
urls-archive.max.fan-twitter-@nytimeswell-filtered.txt-shallow-20200710-213724-3ca4s-00000.warc.os.cdx.gz 4137159 download
urls-archive.max.fan-twitter-@nytimeswell-filtered.txt-shallow-20200710-213724-3ca4s-meta.warc.gz 2173056 download   job
urls-archive.max.fan-twitter-@nytimeswell-filtered.txt-shallow-20200710-213724-3ca4s-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@nytimeswell-filtered.txt-shallow-20200710-213724-3ca4s-urls.txt 970595 download
urls-archive.max.fan-twitter-@nytimeswell-filtered.txt-shallow-20200710-213724-3ca4s.json 337 download   job
urls-archive.max.fan-twitter-@nytmay-filtered.txt-shallow-20200710-211951-4s21b-00000.warc.gz 147947292 download   job
urls-archive.max.fan-twitter-@nytmay-filtered.txt-shallow-20200710-211951-4s21b-00000.warc.os.cdx.gz 299376 download
urls-archive.max.fan-twitter-@nytmay-filtered.txt-shallow-20200710-211951-4s21b.json 327 download   job
urls-archive.max.fan-twitter-@nytstevek-filtered.txt-shallow-20200710-211434-a8yvl.json 333 download   job
urls-archive.max.fan-twitter-@nytvideo-filtered.txt-shallow-20200710-211432-4757s-00000.warc.gz 2574697016 download   job
urls-archive.max.fan-twitter-@nytvideo-filtered.txt-shallow-20200710-211432-4757s-00000.warc.os.cdx.gz 3768117 download
urls-archive.max.fan-twitter-@nytvideo-filtered.txt-shallow-20200710-211432-4757s-meta.warc.gz 1984433 download   job
urls-archive.max.fan-twitter-@nytvideo-filtered.txt-shallow-20200710-211432-4757s-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@nytvideo-filtered.txt-shallow-20200710-211432-4757s-urls.txt 851860 download
urls-archive.max.fan-twitter-@nytvideo-filtered.txt-shallow-20200710-211432-4757s.json 331 download   job
urls-archive.max.fan-twitter-@obalilty-filtered.txt-shallow-20200710-211313-atyst.json 331 download   job
urls-archive.max.fan-twitter-@ochagulf-filtered.txt-shallow-20200710-210634-7ng0f-00000.warc.gz 248378899 download   job
urls-archive.max.fan-twitter-@ochagulf-filtered.txt-shallow-20200710-210634-7ng0f-00000.warc.os.cdx.gz 87595 download
urls-archive.max.fan-twitter-@ochagulf-filtered.txt-shallow-20200710-210634-7ng0f-meta.warc.gz 51017 download   job
urls-archive.max.fan-twitter-@ochagulf-filtered.txt-shallow-20200710-210634-7ng0f-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ochamyanmar-filtered.txt-shallow-20200710-210629-eweue.json 337 download   job
urls-archive.max.fan-twitter-@ochapolicy-filtered.txt-shallow-20200710-210444-bq61x-meta.warc.gz 88358 download   job
urls-archive.max.fan-twitter-@ochapolicy-filtered.txt-shallow-20200710-210444-bq61x-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ocharomena-filtered.txt-shallow-20200710-205447-7idw3-urls.txt 57466 download
urls-archive.max.fan-twitter-@onumujeresEcu-filtered.txt-shallow-20200710-203336-114ty-urls.txt 289374 download
urls-archive.max.fan-twitter-@onumujeresEcu-filtered.txt-shallow-20200710-203336-114ty.json 341 download   job
urls-archive.max.fan-twitter-@panphil-filtered.txt-shallow-20200710-201227-batbp-meta.warc.gz 565210 download   job
urls-archive.max.fan-twitter-@panphil-filtered.txt-shallow-20200710-201227-batbp-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@panphil-filtered.txt-shallow-20200710-201227-batbp.json 329 download   job
urls-archive.max.fan-twitter-@parlamentoUE-filtered.txt-shallow-20200710-200339-3owdu.json 339 download   job
urls-archive.max.fan-twitter-@pxwhittle-filtered.txt-shallow-20200710-173527-928jb-00000.warc.gz 5132417341 download   job
urls-archive.max.fan-twitter-@pxwhittle-filtered.txt-shallow-20200710-173527-928jb-00000.warc.os.cdx.gz 5501093 download
urls-archive.max.fan-twitter-@realDonaldTrump-filtered.txt-shallow-20200710-171253-7bo9b-00001.warc.gz 4123427144 download   job
urls-archive.max.fan-twitter-@realDonaldTrump-filtered.txt-shallow-20200710-171253-7bo9b-00001.warc.os.cdx.gz 18606543 download
urls-archive.max.fan-twitter-@realDonaldTrump-filtered.txt-shallow-20200710-171253-7bo9b-meta.warc.gz 14419922 download   job
urls-archive.max.fan-twitter-@realDonaldTrump-filtered.txt-shallow-20200710-171253-7bo9b-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@realDonaldTrump-filtered.txt-shallow-20200710-171253-7bo9b-urls.txt 2703800 download
urls-archive.max.fan-twitter-@realDonaldTrump-filtered.txt-shallow-20200710-171253-7bo9b.json 345 download   job
urls-transfer.notkiska.pw-facebook-@fieldcraftsurvival-shallow-20200710-225005-as74e-00000.warc.gz 5396632415 download   job
urls-transfer.notkiska.pw-facebook-@fieldcraftsurvival-shallow-20200710-225005-as74e-00000.warc.os.cdx.gz 352910 download
urls-transfer.notkiska.pw-facebook-@overlandtrainingusa-shallow-20200710-224922-1j3jf-00000.warc.gz 115164030 download   job
urls-transfer.notkiska.pw-facebook-@overlandtrainingusa-shallow-20200710-224922-1j3jf-00000.warc.os.cdx.gz 80468 download
urls-transfer.notkiska.pw-facebook-@overlandtrainingusa-shallow-20200710-224922-1j3jf-meta.warc.gz 49919 download   job
urls-transfer.notkiska.pw-facebook-@overlandtrainingusa-shallow-20200710-224922-1j3jf-meta.warc.os.cdx.gz 47 download
urls-transfer.notkiska.pw-facebook-@overlandtrainingusa-shallow-20200710-224922-1j3jf-urls.txt 1722 download
urls-transfer.notkiska.pw-facebook-@overlandtrainingusa-shallow-20200710-224922-1j3jf.json 352 download   job
urls-transfer.notkiska.pw-twitter-%23HongKong-shallow-20191011-144913-dze3i-00262.warc.gz 5368824375 download   job
urls-transfer.notkiska.pw-twitter-%23HongKong-shallow-20191011-144913-dze3i-00262.warc.os.cdx.gz 1021577 download
urls-transfer.notkiska.pw-twitter-%23notmypresident-shallow-20200530-220957-2c0z0-00105.warc.gz 5931581688 download   job
urls-transfer.notkiska.pw-twitter-%23notmypresident-shallow-20200530-220957-2c0z0-00105.warc.os.cdx.gz 665897 download
urls-transfer.notkiska.pw-twitter-%23schoolsreopening-shallow-20200709-165902-2kyn5-00018.warc.gz 5372388927 download   job
urls-transfer.notkiska.pw-twitter-%23schoolsreopening-shallow-20200709-165902-2kyn5-00018.warc.os.cdx.gz 4723183 download
urls-transfer.notkiska.pw-twitter-%23schoolsreopening-shallow-20200709-165902-2kyn5-00019.warc.gz 2246120276 download   job
urls-transfer.notkiska.pw-twitter-%23schoolsreopening-shallow-20200709-165902-2kyn5-00019.warc.os.cdx.gz 1492868 download
urls-transfer.notkiska.pw-twitter-%23schoolsreopening-shallow-20200709-165902-2kyn5-meta.warc.gz 30841925 download   job
urls-transfer.notkiska.pw-twitter-%23schoolsreopening-shallow-20200709-165902-2kyn5-meta.warc.os.cdx.gz 47 download
urls-transfer.notkiska.pw-twitter-%23schoolsreopening-shallow-20200709-165902-2kyn5-urls.txt 4448773 download
urls-transfer.notkiska.pw-twitter-%23schoolsreopening-shallow-20200709-165902-2kyn5.json 348 download   job
urls-transfer.notkiska.pw-twitter-@UnicornPlushy-shallow-20200710-165141-dm2x5-00001.warc.gz 2778050734 download   job
urls-transfer.notkiska.pw-twitter-@UnicornPlushy-shallow-20200710-165141-dm2x5-00001.warc.os.cdx.gz 1319361 download
urls-transfer.notkiska.pw-twitter-@UnicornPlushy-shallow-20200710-165141-dm2x5-meta.warc.gz 3066017 download   job
urls-transfer.notkiska.pw-twitter-@UnicornPlushy-shallow-20200710-165141-dm2x5-meta.warc.os.cdx.gz 47 download
urls-transfer.notkiska.pw-twitter-@UnicornPlushy-shallow-20200710-165141-dm2x5-urls.txt 1527390 download
urls-transfer.notkiska.pw-twitter-@UnicornPlushy-shallow-20200710-165141-dm2x5.json 338 download   job
urls-transfer.notkiska.pw-twitter-@boomerangfu-shallow-20200710-233510-1pd0r-00000.warc.gz 161567174 download   job
urls-transfer.notkiska.pw-twitter-@boomerangfu-shallow-20200710-233510-1pd0r-00000.warc.os.cdx.gz 282778 download
urls-transfer.notkiska.pw-twitter-@boomerangfu-shallow-20200710-233510-1pd0r-meta.warc.gz 168519 download   job
urls-transfer.notkiska.pw-twitter-@boomerangfu-shallow-20200710-233510-1pd0r-meta.warc.os.cdx.gz 47 download
urls-transfer.notkiska.pw-twitter-@boomerangfu-shallow-20200710-233510-1pd0r-urls.txt 17959 download
urls-transfer.notkiska.pw-twitter-@boomerangfu-shallow-20200710-233510-1pd0r.json 334 download   job
urls-transfer.notkiska.pw-twitter-@fieldcrafttweet-shallow-20200710-224418-2rklm-00001.warc.gz 5401970825 download   job
urls-transfer.notkiska.pw-twitter-@fieldcrafttweet-shallow-20200710-224418-2rklm-00001.warc.os.cdx.gz 177737 download
urls-transfer.notkiska.pw-twitter-@fieldcrafttweet-shallow-20200710-224418-2rklm-00002.warc.gz 501393372 download   job
urls-transfer.notkiska.pw-twitter-@fieldcrafttweet-shallow-20200710-224418-2rklm-00002.warc.os.cdx.gz 5504 download
urls-transfer.notkiska.pw-twitter-@fieldcrafttweet-shallow-20200710-224418-2rklm-meta.warc.gz 212035 download   job
urls-transfer.notkiska.pw-twitter-@fieldcrafttweet-shallow-20200710-224418-2rklm-meta.warc.os.cdx.gz 47 download
urls-transfer.notkiska.pw-twitter-@fieldcrafttweet-shallow-20200710-224418-2rklm-urls.txt 33152 download
urls-transfer.notkiska.pw-twitter-@fieldcrafttweet-shallow-20200710-224418-2rklm.json 342 download   job
urls-transfer.notkiska.pw-twitter-@interactolabs-shallow-20200710-210829-5t92t-00000.warc.gz 49205175 download   job
urls-transfer.notkiska.pw-twitter-@interactolabs-shallow-20200710-210829-5t92t-00000.warc.os.cdx.gz 118096 download
urls-transfer.notkiska.pw-twitter-@ovrlndtraining-shallow-20200710-224858-55wcl-00000.warc.gz 994180 download   job
urls-transfer.notkiska.pw-twitter-@ovrlndtraining-shallow-20200710-224858-55wcl-00000.warc.os.cdx.gz 4072 download
urls-transfer.notkiska.pw-twitter-@ovrlndtraining-shallow-20200710-224858-55wcl-meta.warc.gz 6140 download   job
urls-transfer.notkiska.pw-twitter-@ovrlndtraining-shallow-20200710-224858-55wcl-meta.warc.os.cdx.gz 47 download
urls-transfer.notkiska.pw-twitter-@ovrlndtraining-shallow-20200710-224858-55wcl-urls.txt 143 download
urls-transfer.notkiska.pw-twitter-@ovrlndtraining-shallow-20200710-224858-55wcl.json 340 download   job
urls-transfer.notkiska.pw-vkontakte-poshlaya_molly-shallow-20200710-225017-86ifc-00000.warc.gz 311078276 download   job
urls-transfer.notkiska.pw-vkontakte-poshlaya_molly-shallow-20200710-225017-86ifc-00000.warc.os.cdx.gz 1038312 download
urls-transfer.notkiska.pw-vkontakte-poshlaya_molly-shallow-20200710-225017-86ifc-meta.warc.gz 621432 download   job
urls-transfer.notkiska.pw-vkontakte-poshlaya_molly-shallow-20200710-225017-86ifc-meta.warc.os.cdx.gz 47 download
urls-transfer.notkiska.pw-vkontakte-poshlaya_molly-shallow-20200710-225017-86ifc-urls.txt 4858 download
urls-transfer.notkiska.pw-vkontakte-poshlaya_molly-shallow-20200710-225017-86ifc.json 342 download   job
waronguns.blogspot.com-inf-20200603-132815-5fv0d-00076.warc.gz 5368839207 download   job
waronguns.blogspot.com-inf-20200603-132815-5fv0d-00076.warc.os.cdx.gz 3561791 download
wii5.taiko-ch.net-inf-20200710-212143-em9wc.json 246 download   job
wiiu.taiko-ch.net-inf-20200710-212048-987o1-00000.warc.gz 59519367 download   job
wiiu.taiko-ch.net-inf-20200710-212048-987o1-00000.warc.os.cdx.gz 40012 download
wiiu.taiko-ch.net-inf-20200710-212048-987o1-meta.warc.gz 25976 download   job
wiiu.taiko-ch.net-inf-20200710-212048-987o1-meta.warc.os.cdx.gz 47 download
wiiu3.taiko-ch.net-inf-20200710-212130-8ufvz-00000.warc.gz 46337443 download   job
wiiu3.taiko-ch.net-inf-20200710-212130-8ufvz-00000.warc.os.cdx.gz 51525 download
www.boomerangfu.com-inf-20200710-233441-7k64i-00000.warc.gz 45900389 download   job
www.boomerangfu.com-inf-20200710-233441-7k64i-00000.warc.os.cdx.gz 86191 download
www.boomerangfu.com-inf-20200710-233441-7k64i-meta.warc.gz 53005 download   job
www.boomerangfu.com-inf-20200710-233441-7k64i-meta.warc.os.cdx.gz 47 download
www.notcot.com-inf-20200709-213423-116f3-00005.warc.gz 5369567279 download   job
www.notcot.com-inf-20200709-213423-116f3-00005.warc.os.cdx.gz 2728802 download