Item archiveteam_archivebot_go_20200712040004

View on Internet Archive

Filename Size
archiveteam_archivebot_go_20200712040004.cdx.gz 99721978 download
archiveteam_archivebot_go_20200712040004.cdx.idx 81681 download
archiveteam_archivebot_go_20200712040004_files.xml 0 download
archiveteam_archivebot_go_20200712040004_meta.sqlite 578560 download
archiveteam_archivebot_go_20200712040004_meta.xml 969 download
cliqz.com-inf-20200501-194732-82yzf-00248.warc.gz 5368759204 download   job
cliqz.com-inf-20200501-194732-82yzf-00248.warc.os.cdx.gz 1331274 download
daimon-games.blogspot.com-inf-20200711-210330-42kb4-00000.warc.gz 1165488121 download   job
daimon-games.blogspot.com-inf-20200711-210330-42kb4-00000.warc.os.cdx.gz 958217 download
daimon-games.blogspot.com-inf-20200711-210330-42kb4-meta.warc.gz 618273 download   job
daimon-games.blogspot.com-inf-20200711-210330-42kb4-meta.warc.os.cdx.gz 47 download
daimon-games.blogspot.com-inf-20200711-210330-42kb4.json 250 download   job
dyverscampaign.blogspot.com-inf-20200711-230125-6y293-00000.warc.gz 5484562851 download   job
dyverscampaign.blogspot.com-inf-20200711-230125-6y293-00000.warc.os.cdx.gz 3429487 download
forgottenrunes.blogspot.com-inf-20200711-230443-atjrw-00000.warc.gz 3760680041 download   job
forgottenrunes.blogspot.com-inf-20200711-230443-atjrw-00000.warc.os.cdx.gz 3083393 download
forgottenrunes.blogspot.com-inf-20200711-230443-atjrw-meta.warc.gz 1915498 download   job
forgottenrunes.blogspot.com-inf-20200711-230443-atjrw-meta.warc.os.cdx.gz 47 download
frothsofdnd.blogspot.com-inf-20200711-230513-4s710-00000.warc.gz 5368783681 download   job
frothsofdnd.blogspot.com-inf-20200711-230513-4s710-00000.warc.os.cdx.gz 3775945 download
history/files/urls-transfer.notkiska.pw-twitter-%23BlackTwitter-shallow-20200710-163004-dpwry-00002.warc.gz.~1~ 5368770222 download
igemathome.org-inf-20200712-001946-3wyb1-meta.warc.gz 582366 download   job
igemathome.org-inf-20200712-001946-3wyb1-meta.warc.os.cdx.gz 47 download
leavingscientology.wordpress.com-inf-20200712-024122-17ezt-00000.warc.gz 5399365980 download   job
leavingscientology.wordpress.com-inf-20200712-024122-17ezt-00000.warc.os.cdx.gz 611281 download
leavingscientology.wordpress.com-inf-20200712-024122-17ezt-00001.warc.gz 5369541060 download   job
leavingscientology.wordpress.com-inf-20200712-024122-17ezt-00001.warc.os.cdx.gz 110434 download
likebeingreadtofromdictionaries.blogspot.com-inf-20200712-003854-44oxt-00000.warc.gz 1793716115 download   job
likebeingreadtofromdictionaries.blogspot.com-inf-20200712-003854-44oxt-00000.warc.os.cdx.gz 1682129 download
likebeingreadtofromdictionaries.blogspot.com-inf-20200712-003854-44oxt-meta.warc.gz 1093369 download   job
likebeingreadtofromdictionaries.blogspot.com-inf-20200712-003854-44oxt-meta.warc.os.cdx.gz 47 download
likebeingreadtofromdictionaries.blogspot.com-inf-20200712-003854-44oxt.json 269 download   job
lotbieth.blogspot.com-inf-20200712-003944-5bzel-00000.warc.gz 3902513939 download   job
lotbieth.blogspot.com-inf-20200712-003944-5bzel-00000.warc.os.cdx.gz 2249208 download
lotbieth.blogspot.com-inf-20200712-003944-5bzel-meta.warc.gz 1541077 download   job
lotbieth.blogspot.com-inf-20200712-003944-5bzel-meta.warc.os.cdx.gz 47 download
lotbieth.blogspot.com-inf-20200712-003944-5bzel.json 246 download   job
luc.devroye.org-inf-20200629-195003-6kmq5-00050.warc.gz 5369952327 download   job
luc.devroye.org-inf-20200629-195003-6kmq5-00050.warc.os.cdx.gz 4205386 download
old.reddit.com-inf-20200711-222159-93eeb-00000.warc.gz 5368761469 download   job
old.reddit.com-inf-20200711-222159-93eeb-00000.warc.os.cdx.gz 2860198 download
old.reddit.com-inf-20200711-222159-93eeb-00001.warc.gz 8168656972 download   job
old.reddit.com-inf-20200711-222159-93eeb-00001.warc.os.cdx.gz 267321 download
old.reddit.com-inf-20200712-011850-9p1ex-00000.warc.gz 2215887432 download   job
old.reddit.com-inf-20200712-011850-9p1ex-00000.warc.os.cdx.gz 1331484 download
old.reddit.com-inf-20200712-011850-9p1ex-meta.warc.gz 1039709 download   job
old.reddit.com-inf-20200712-011850-9p1ex-meta.warc.os.cdx.gz 47 download
old.reddit.com-inf-20200712-011850-9p1ex.json 259 download   job
old.reddit.com-inf-20200712-011859-hbn6z-00000.warc.gz 1555748909 download   job
old.reddit.com-inf-20200712-011859-hbn6z-00000.warc.os.cdx.gz 716803 download
old.reddit.com-inf-20200712-011859-hbn6z-meta.warc.gz 598397 download   job
old.reddit.com-inf-20200712-011859-hbn6z-meta.warc.os.cdx.gz 47 download
old.reddit.com-inf-20200712-011859-hbn6z.json 254 download   job
player.fm-inf-20200501-233943-6recr-00684.warc.gz 5380466728 download   job
player.fm-inf-20200501-233943-6recr-00684.warc.os.cdx.gz 835633 download
research.davecoss.com-inf-20200712-022404-11mlx-00000.warc.gz 8060952 download   job
research.davecoss.com-inf-20200712-022404-11mlx-00000.warc.os.cdx.gz 17757 download
research.davecoss.com-inf-20200712-022404-11mlx-meta.warc.gz 14546 download   job
research.davecoss.com-inf-20200712-022404-11mlx-meta.warc.os.cdx.gz 47 download
research.davecoss.com-inf-20200712-022404-11mlx.json 245 download   job
roundup-tracker.org-inf-20200712-020434-czw1n-00000.warc.gz 58917226 download   job
roundup-tracker.org-inf-20200712-020434-czw1n-00000.warc.os.cdx.gz 125417 download
roundup-tracker.org-inf-20200712-020434-czw1n-meta.warc.gz 80768 download   job
roundup-tracker.org-inf-20200712-020434-czw1n-meta.warc.os.cdx.gz 47 download
roundup-tracker.org-inf-20200712-020434-czw1n.json 243 download   job
so.12371.cn-inf-20200711-224034-dlg59-00000.warc.gz 5368789993 download   job
so.12371.cn-inf-20200711-224034-dlg59-00000.warc.os.cdx.gz 5190416 download
so.12371.cn-inf-20200711-224034-dlg59-00001.warc.gz 352737489 download   job
so.12371.cn-inf-20200711-224034-dlg59-00001.warc.os.cdx.gz 99420 download
so.12371.cn-inf-20200711-224034-dlg59-meta.warc.gz 3029967 download   job
so.12371.cn-inf-20200711-224034-dlg59-meta.warc.os.cdx.gz 47 download
so.12371.cn-inf-20200711-224034-dlg59.json 245 download   job
starringthecomputer.com-inf-20200712-014532-1ft2f-00000.warc.gz 1473372206 download   job
starringthecomputer.com-inf-20200712-014532-1ft2f-00000.warc.os.cdx.gz 1104481 download
starringthecomputer.com-inf-20200712-014532-1ft2f-meta.warc.gz 663394 download   job
starringthecomputer.com-inf-20200712-014532-1ft2f-meta.warc.os.cdx.gz 47 download
starringthecomputer.com-inf-20200712-014532-1ft2f.json 251 download   job
thevirustracker.com-inf-20200620-170113-b912c-00022.warc.gz 5369426723 download   job
thevirustracker.com-inf-20200620-170113-b912c-00022.warc.os.cdx.gz 5158635 download
transfer.notkiska.pw-shallow-20200712-033303-5hw0i-00000.warc.gz 4063 download   job
transfer.notkiska.pw-shallow-20200712-033303-5hw0i-00000.warc.os.cdx.gz 236 download
transfer.notkiska.pw-shallow-20200712-033303-5hw0i-meta.warc.gz 3530 download   job
transfer.notkiska.pw-shallow-20200712-033303-5hw0i-meta.warc.os.cdx.gz 47 download
transfer.notkiska.pw-shallow-20200712-033303-5hw0i.json 275 download   job
transfer.notkiska.pw-shallow-20200712-033413-6t2ew-meta.warc.gz 3529 download   job
transfer.notkiska.pw-shallow-20200712-033413-6t2ew-meta.warc.os.cdx.gz 47 download
transfer.notkiska.pw-shallow-20200712-033413-6t2ew.json 279 download   job
urls-archive.max.fan-twitter-@LKeath-filtered.txt-shallow-20200712-034232-nuq96-00000.warc.gz 67854973 download   job
urls-archive.max.fan-twitter-@LKeath-filtered.txt-shallow-20200712-034232-nuq96-00000.warc.os.cdx.gz 131549 download
urls-archive.max.fan-twitter-@LKeath-filtered.txt-shallow-20200712-034232-nuq96-meta.warc.gz 75109 download   job
urls-archive.max.fan-twitter-@LKeath-filtered.txt-shallow-20200712-034232-nuq96-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LKeath-filtered.txt-shallow-20200712-034232-nuq96.json 327 download   job
urls-archive.max.fan-twitter-@LSNJEE-filtered.txt-shallow-20200712-033942-d1ru1-00000.warc.gz 131853087 download   job
urls-archive.max.fan-twitter-@LSNJEE-filtered.txt-shallow-20200712-033942-d1ru1-00000.warc.os.cdx.gz 57322 download
urls-archive.max.fan-twitter-@LSNJEE-filtered.txt-shallow-20200712-033942-d1ru1-meta.warc.gz 35700 download   job
urls-archive.max.fan-twitter-@LSNJEE-filtered.txt-shallow-20200712-033942-d1ru1-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LSNJEE-filtered.txt-shallow-20200712-033942-d1ru1.json 327 download   job
urls-archive.max.fan-twitter-@LT_MFA_Stratcom-filtered.txt-shallow-20200712-033227-8yvu0-00000.warc.gz 192469407 download   job
urls-archive.max.fan-twitter-@LT_MFA_Stratcom-filtered.txt-shallow-20200712-033227-8yvu0-00000.warc.os.cdx.gz 389006 download
urls-archive.max.fan-twitter-@LT_MFA_Stratcom-filtered.txt-shallow-20200712-033227-8yvu0-meta.warc.gz 212597 download   job
urls-archive.max.fan-twitter-@LT_MFA_Stratcom-filtered.txt-shallow-20200712-033227-8yvu0-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LT_MFA_Stratcom-filtered.txt-shallow-20200712-033227-8yvu0-urls.txt 78080 download
urls-archive.max.fan-twitter-@LizTwistMP-filtered.txt-shallow-20200712-034235-15a59-00000.warc.gz 66001679 download   job
urls-archive.max.fan-twitter-@LizTwistMP-filtered.txt-shallow-20200712-034235-15a59-00000.warc.os.cdx.gz 104957 download
urls-archive.max.fan-twitter-@LizTwistMP-filtered.txt-shallow-20200712-034235-15a59-meta.warc.gz 60285 download   job
urls-archive.max.fan-twitter-@LizTwistMP-filtered.txt-shallow-20200712-034235-15a59-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LizTwistMP-filtered.txt-shallow-20200712-034235-15a59-urls.txt 23838 download
urls-archive.max.fan-twitter-@LizTwistMP-filtered.txt-shallow-20200712-034235-15a59.json 335 download   job
urls-archive.max.fan-twitter-@LtGovDavidson-filtered.txt-shallow-20200712-033915-96n9e-00000.warc.gz 10670682 download   job
urls-archive.max.fan-twitter-@LtGovDavidson-filtered.txt-shallow-20200712-033915-96n9e-00000.warc.os.cdx.gz 20772 download
urls-archive.max.fan-twitter-@LtGovDavidson-filtered.txt-shallow-20200712-033915-96n9e-meta.warc.gz 15403 download   job
urls-archive.max.fan-twitter-@LtGovDavidson-filtered.txt-shallow-20200712-033915-96n9e-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LtGovDavidson-filtered.txt-shallow-20200712-033915-96n9e-urls.txt 1952 download
urls-archive.max.fan-twitter-@LtGovDavidson-filtered.txt-shallow-20200712-033915-96n9e.json 341 download   job
urls-archive.max.fan-twitter-@LuetkemeyerB-filtered.txt-shallow-20200712-033221-7jbe8-00000.warc.gz 72212117 download   job
urls-archive.max.fan-twitter-@LuetkemeyerB-filtered.txt-shallow-20200712-033221-7jbe8-00000.warc.os.cdx.gz 95947 download
urls-archive.max.fan-twitter-@LuetkemeyerB-filtered.txt-shallow-20200712-033221-7jbe8-meta.warc.gz 55821 download   job
urls-archive.max.fan-twitter-@LuetkemeyerB-filtered.txt-shallow-20200712-033221-7jbe8-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LuetkemeyerB-filtered.txt-shallow-20200712-033221-7jbe8-urls.txt 42757 download
urls-archive.max.fan-twitter-@LuetkemeyerB-filtered.txt-shallow-20200712-033221-7jbe8.json 339 download   job
urls-archive.max.fan-twitter-@LuisCarrilhoPC-filtered.txt-shallow-20200712-032444-67sam-00000.warc.gz 117623638 download   job
urls-archive.max.fan-twitter-@LuisCarrilhoPC-filtered.txt-shallow-20200712-032444-67sam-00000.warc.os.cdx.gz 157159 download
urls-archive.max.fan-twitter-@LuisCarrilhoPC-filtered.txt-shallow-20200712-032444-67sam-meta.warc.gz 87044 download   job
urls-archive.max.fan-twitter-@LuisCarrilhoPC-filtered.txt-shallow-20200712-032444-67sam-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LuisCarrilhoPC-filtered.txt-shallow-20200712-032444-67sam-urls.txt 37801 download
urls-archive.max.fan-twitter-@LuisCarrilhoPC-filtered.txt-shallow-20200712-032444-67sam.json 343 download   job
urls-archive.max.fan-twitter-@LukeHall-filtered.txt-shallow-20200712-032441-e0j0l-meta.warc.gz 32900 download   job
urls-archive.max.fan-twitter-@LukeHall-filtered.txt-shallow-20200712-032441-e0j0l-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LukeHall-filtered.txt-shallow-20200712-032441-e0j0l-urls.txt 6832 download
urls-archive.max.fan-twitter-@LukeHall-filtered.txt-shallow-20200712-032441-e0j0l.json 331 download   job
urls-archive.max.fan-twitter-@LutzHen-filtered.txt-shallow-20200712-032413-aw2ri-meta.warc.gz 7017 download   job
urls-archive.max.fan-twitter-@LutzHen-filtered.txt-shallow-20200712-032413-aw2ri-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LutzHen-filtered.txt-shallow-20200712-032413-aw2ri-urls.txt 1080 download
urls-archive.max.fan-twitter-@LutzHen-filtered.txt-shallow-20200712-032413-aw2ri.json 329 download   job
urls-archive.max.fan-twitter-@LuxembourgUN-filtered.txt-shallow-20200712-032236-7w4ey-00000.warc.gz 396611509 download   job
urls-archive.max.fan-twitter-@LuxembourgUN-filtered.txt-shallow-20200712-032236-7w4ey-00000.warc.os.cdx.gz 551417 download
urls-archive.max.fan-twitter-@LuxembourgUN-filtered.txt-shallow-20200712-032236-7w4ey-meta.warc.gz 292085 download   job
urls-archive.max.fan-twitter-@LuxembourgUN-filtered.txt-shallow-20200712-032236-7w4ey-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LuxembourgUN-filtered.txt-shallow-20200712-032236-7w4ey-urls.txt 186637 download
urls-archive.max.fan-twitter-@LuxembourgUN-filtered.txt-shallow-20200712-032236-7w4ey.json 339 download   job
urls-archive.max.fan-twitter-@LynneSladky-filtered.txt-shallow-20200712-032236-7z745-00000.warc.gz 21283785 download   job
urls-archive.max.fan-twitter-@LynneSladky-filtered.txt-shallow-20200712-032236-7z745-00000.warc.os.cdx.gz 31613 download
urls-archive.max.fan-twitter-@LynneSladky-filtered.txt-shallow-20200712-032236-7z745-meta.warc.gz 21626 download   job
urls-archive.max.fan-twitter-@LynneSladky-filtered.txt-shallow-20200712-032236-7z745-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LynneSladky-filtered.txt-shallow-20200712-032236-7z745-urls.txt 16092 download
urls-archive.max.fan-twitter-@LynneSladky-filtered.txt-shallow-20200712-032236-7z745.json 337 download   job
urls-archive.max.fan-twitter-@MAECHaiti-filtered.txt-shallow-20200712-032140-8do53-meta.warc.gz 22196 download   job
urls-archive.max.fan-twitter-@MAECHaiti-filtered.txt-shallow-20200712-032140-8do53-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MAECHaiti-filtered.txt-shallow-20200712-032140-8do53-urls.txt 6832 download
urls-archive.max.fan-twitter-@MAECHaiti-filtered.txt-shallow-20200712-032140-8do53.json 333 download   job
urls-archive.max.fan-twitter-@MANovicki-filtered.txt-shallow-20200712-030111-ahch8-00000.warc.gz 42813456 download   job
urls-archive.max.fan-twitter-@MANovicki-filtered.txt-shallow-20200712-030111-ahch8-00000.warc.os.cdx.gz 49813 download
urls-archive.max.fan-twitter-@MANovicki-filtered.txt-shallow-20200712-030111-ahch8-meta.warc.gz 31362 download   job
urls-archive.max.fan-twitter-@MANovicki-filtered.txt-shallow-20200712-030111-ahch8-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MANovicki-filtered.txt-shallow-20200712-030111-ahch8-urls.txt 16039 download
urls-archive.max.fan-twitter-@MANovicki-filtered.txt-shallow-20200712-030111-ahch8.json 333 download   job
urls-archive.max.fan-twitter-@MBPDSC-filtered.txt-shallow-20200712-014111-3g7t7-urls.txt 129592 download
urls-archive.max.fan-twitter-@MCheathamW-filtered.txt-shallow-20200712-013735-85pny-urls.txt 39844 download
urls-archive.max.fan-twitter-@MDPDenEspanol-filtered.txt-shallow-20200712-011743-2a3om-00000.warc.gz 535722884 download   job
urls-archive.max.fan-twitter-@MDPDenEspanol-filtered.txt-shallow-20200712-011743-2a3om-00000.warc.os.cdx.gz 355765 download
urls-archive.max.fan-twitter-@MESecOfState-filtered.txt-shallow-20200712-010910-ahm3a-00000.warc.gz 134451311 download   job
urls-archive.max.fan-twitter-@MESecOfState-filtered.txt-shallow-20200712-010910-ahm3a-00000.warc.os.cdx.gz 181953 download
urls-archive.max.fan-twitter-@MaggieAstor-filtered.txt-shallow-20200712-031455-93r9k-meta.warc.gz 219150 download   job
urls-archive.max.fan-twitter-@MaggieAstor-filtered.txt-shallow-20200712-031455-93r9k-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MaggieAstor-filtered.txt-shallow-20200712-031455-93r9k-urls.txt 93397 download
urls-archive.max.fan-twitter-@MaggieAstor-filtered.txt-shallow-20200712-031455-93r9k.json 337 download   job
urls-archive.max.fan-twitter-@MaimunahSharif-filtered.txt-shallow-20200712-031334-9t53n-00000.warc.gz 736604826 download   job
urls-archive.max.fan-twitter-@MaimunahSharif-filtered.txt-shallow-20200712-031334-9t53n-00000.warc.os.cdx.gz 819137 download
urls-archive.max.fan-twitter-@MaimunahSharif-filtered.txt-shallow-20200712-031334-9t53n-meta.warc.gz 427265 download   job
urls-archive.max.fan-twitter-@MaimunahSharif-filtered.txt-shallow-20200712-031334-9t53n-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MaimunahSharif-filtered.txt-shallow-20200712-031334-9t53n.json 343 download   job
urls-archive.max.fan-twitter-@MaithripalaS-filtered.txt-shallow-20200712-031334-1c9x4-00000.warc.gz 514637064 download   job
urls-archive.max.fan-twitter-@MaithripalaS-filtered.txt-shallow-20200712-031334-1c9x4-00000.warc.os.cdx.gz 941995 download
urls-archive.max.fan-twitter-@MaithripalaS-filtered.txt-shallow-20200712-031334-1c9x4-meta.warc.gz 493338 download   job
urls-archive.max.fan-twitter-@MaithripalaS-filtered.txt-shallow-20200712-031334-1c9x4-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MaithripalaS-filtered.txt-shallow-20200712-031334-1c9x4-urls.txt 161081 download
urls-archive.max.fan-twitter-@MaithripalaS-filtered.txt-shallow-20200712-031334-1c9x4.json 339 download   job
urls-archive.max.fan-twitter-@Malala-filtered.txt-shallow-20200712-031333-czuuk-meta.warc.gz 231911 download   job
urls-archive.max.fan-twitter-@Malala-filtered.txt-shallow-20200712-031333-czuuk-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Malala-filtered.txt-shallow-20200712-031333-czuuk-urls.txt 23993 download
urls-archive.max.fan-twitter-@Malala-filtered.txt-shallow-20200712-031333-czuuk.json 327 download   job
urls-archive.max.fan-twitter-@MalcolmRitter-filtered.txt-shallow-20200712-030115-7d2kj-00000.warc.gz 85012056 download   job
urls-archive.max.fan-twitter-@MalcolmRitter-filtered.txt-shallow-20200712-030115-7d2kj-00000.warc.os.cdx.gz 130666 download
urls-archive.max.fan-twitter-@MalcolmRitter-filtered.txt-shallow-20200712-030115-7d2kj-meta.warc.gz 74427 download   job
urls-archive.max.fan-twitter-@MalcolmRitter-filtered.txt-shallow-20200712-030115-7d2kj-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MalcolmRitter-filtered.txt-shallow-20200712-030115-7d2kj-urls.txt 71428 download
urls-archive.max.fan-twitter-@MangalaLK-filtered.txt-shallow-20200712-030112-53duq-00000.warc.gz 348316304 download   job
urls-archive.max.fan-twitter-@MangalaLK-filtered.txt-shallow-20200712-030112-53duq-00000.warc.os.cdx.gz 620321 download
urls-archive.max.fan-twitter-@MangalaLK-filtered.txt-shallow-20200712-030112-53duq-meta.warc.gz 331130 download   job
urls-archive.max.fan-twitter-@MangalaLK-filtered.txt-shallow-20200712-030112-53duq-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MangalaLK-filtered.txt-shallow-20200712-030112-53duq.json 333 download   job
urls-archive.max.fan-twitter-@MarcDelatte-filtered.txt-shallow-20200712-030106-3ohyi-00000.warc.gz 332858361 download   job
urls-archive.max.fan-twitter-@MarcDelatte-filtered.txt-shallow-20200712-030106-3ohyi-00000.warc.os.cdx.gz 233671 download
urls-archive.max.fan-twitter-@MarcDelatte-filtered.txt-shallow-20200712-030106-3ohyi-meta.warc.gz 126643 download   job
urls-archive.max.fan-twitter-@MarcDelatte-filtered.txt-shallow-20200712-030106-3ohyi-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MarcDelatte-filtered.txt-shallow-20200712-030106-3ohyi-urls.txt 131144 download
urls-archive.max.fan-twitter-@MarcDelatte-filtered.txt-shallow-20200712-030106-3ohyi.json 337 download   job
urls-archive.max.fan-twitter-@MarcSantoraNYT-filtered.txt-shallow-20200712-025402-1juzr-00000.warc.gz 57229251 download   job
urls-archive.max.fan-twitter-@MarcSantoraNYT-filtered.txt-shallow-20200712-025402-1juzr-00000.warc.os.cdx.gz 107974 download
urls-archive.max.fan-twitter-@MarcSantoraNYT-filtered.txt-shallow-20200712-025402-1juzr-meta.warc.gz 60874 download   job
urls-archive.max.fan-twitter-@MarcSantoraNYT-filtered.txt-shallow-20200712-025402-1juzr-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MarcSantoraNYT-filtered.txt-shallow-20200712-025402-1juzr-urls.txt 26947 download
urls-archive.max.fan-twitter-@MarcSantoraNYT-filtered.txt-shallow-20200712-025402-1juzr.json 343 download   job
urls-archive.max.fan-twitter-@MarieLouise_MT-filtered.txt-shallow-20200712-025359-6j8gs-meta.warc.gz 175988 download   job
urls-archive.max.fan-twitter-@MarieLouise_MT-filtered.txt-shallow-20200712-025359-6j8gs-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MarieLouise_MT-filtered.txt-shallow-20200712-025359-6j8gs-urls.txt 106096 download
urls-archive.max.fan-twitter-@MarieLouise_MT-filtered.txt-shallow-20200712-025359-6j8gs.json 343 download   job
urls-archive.max.fan-twitter-@MarkLeibovich-filtered.txt-shallow-20200712-025128-cf4tr-00000.warc.gz 774500138 download   job
urls-archive.max.fan-twitter-@MarkLeibovich-filtered.txt-shallow-20200712-025128-cf4tr-00000.warc.os.cdx.gz 1811042 download
urls-archive.max.fan-twitter-@MarkLeibovich-filtered.txt-shallow-20200712-025128-cf4tr-meta.warc.gz 960734 download   job
urls-archive.max.fan-twitter-@MarkLeibovich-filtered.txt-shallow-20200712-025128-cf4tr-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MarkLeibovich-filtered.txt-shallow-20200712-025128-cf4tr-urls.txt 596376 download
urls-archive.max.fan-twitter-@MarkLeibovich-filtered.txt-shallow-20200712-025128-cf4tr.json 341 download   job
urls-archive.max.fan-twitter-@MartaLavandier-filtered.txt-shallow-20200712-025125-9aom3-00000.warc.gz 14575305 download   job
urls-archive.max.fan-twitter-@MartaLavandier-filtered.txt-shallow-20200712-025125-9aom3-00000.warc.os.cdx.gz 24549 download
urls-archive.max.fan-twitter-@MartaLavandier-filtered.txt-shallow-20200712-025125-9aom3-meta.warc.gz 17765 download   job
urls-archive.max.fan-twitter-@MartaLavandier-filtered.txt-shallow-20200712-025125-9aom3-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MartaLavandier-filtered.txt-shallow-20200712-025125-9aom3-urls.txt 10635 download
urls-archive.max.fan-twitter-@MartaLavandier-filtered.txt-shallow-20200712-025125-9aom3.json 343 download   job
urls-archive.max.fan-twitter-@MartinVizcarraC-filtered.txt-shallow-20200712-023053-9otsm-00000.warc.gz 2226892 download   job
urls-archive.max.fan-twitter-@MartinVizcarraC-filtered.txt-shallow-20200712-023053-9otsm-00000.warc.os.cdx.gz 9398 download
urls-archive.max.fan-twitter-@MartinVizcarraC-filtered.txt-shallow-20200712-023053-9otsm-meta.warc.gz 9213 download   job
urls-archive.max.fan-twitter-@MartinVizcarraC-filtered.txt-shallow-20200712-023053-9otsm-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MartinVizcarraC-filtered.txt-shallow-20200712-023053-9otsm-urls.txt 189 download
urls-archive.max.fan-twitter-@MartinVizcarraC-filtered.txt-shallow-20200712-023053-9otsm.json 345 download   job
urls-archive.max.fan-twitter-@MartineWonner-filtered.txt-shallow-20200712-025121-2p0u1-00000.warc.gz 432258300 download   job
urls-archive.max.fan-twitter-@MartineWonner-filtered.txt-shallow-20200712-025121-2p0u1-00000.warc.os.cdx.gz 487749 download
urls-archive.max.fan-twitter-@MartineWonner-filtered.txt-shallow-20200712-025121-2p0u1-meta.warc.gz 258070 download   job
urls-archive.max.fan-twitter-@MartineWonner-filtered.txt-shallow-20200712-025121-2p0u1-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MartineWonner-filtered.txt-shallow-20200712-025121-2p0u1-urls.txt 89894 download
urls-archive.max.fan-twitter-@MartineWonner-filtered.txt-shallow-20200712-025121-2p0u1.json 341 download   job
urls-archive.max.fan-twitter-@MartinezSoliman-filtered.txt-shallow-20200712-023055-99afc-00000.warc.gz 307284375 download   job
urls-archive.max.fan-twitter-@MartinezSoliman-filtered.txt-shallow-20200712-023055-99afc-00000.warc.os.cdx.gz 427514 download
urls-archive.max.fan-twitter-@MartinezSoliman-filtered.txt-shallow-20200712-023055-99afc-meta.warc.gz 227899 download   job
urls-archive.max.fan-twitter-@MartinezSoliman-filtered.txt-shallow-20200712-023055-99afc-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MartinezSoliman-filtered.txt-shallow-20200712-023055-99afc-urls.txt 128461 download
urls-archive.max.fan-twitter-@MartinezSoliman-filtered.txt-shallow-20200712-023055-99afc.json 345 download   job
urls-archive.max.fan-twitter-@Mary_Luisa_AG-filtered.txt-shallow-20200712-022739-1tx93-00000.warc.gz 528443006 download   job
urls-archive.max.fan-twitter-@Mary_Luisa_AG-filtered.txt-shallow-20200712-022739-1tx93-00000.warc.os.cdx.gz 789360 download
urls-archive.max.fan-twitter-@Mary_Luisa_AG-filtered.txt-shallow-20200712-022739-1tx93-meta.warc.gz 418837 download   job
urls-archive.max.fan-twitter-@Mary_Luisa_AG-filtered.txt-shallow-20200712-022739-1tx93-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Mary_Luisa_AG-filtered.txt-shallow-20200712-022739-1tx93-urls.txt 95648 download
urls-archive.max.fan-twitter-@Mary_Luisa_AG-filtered.txt-shallow-20200712-022739-1tx93.json 341 download   job
urls-archive.max.fan-twitter-@MaryamMonsef-filtered.txt-shallow-20200712-022747-6ekwt-00000.warc.gz 1882172155 download   job
urls-archive.max.fan-twitter-@MaryamMonsef-filtered.txt-shallow-20200712-022747-6ekwt-00000.warc.os.cdx.gz 2327264 download
urls-archive.max.fan-twitter-@MaryamMonsef-filtered.txt-shallow-20200712-022747-6ekwt-meta.warc.gz 1224347 download   job
urls-archive.max.fan-twitter-@MaryamMonsef-filtered.txt-shallow-20200712-022747-6ekwt-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MaryamMonsef-filtered.txt-shallow-20200712-022747-6ekwt-urls.txt 912604 download
urls-archive.max.fan-twitter-@MaryamMonsef-filtered.txt-shallow-20200712-022747-6ekwt.json 339 download   job
urls-archive.max.fan-twitter-@Maryclairedale-filtered.txt-shallow-20200712-022744-38flp-00000.warc.gz 61285985 download   job
urls-archive.max.fan-twitter-@Maryclairedale-filtered.txt-shallow-20200712-022744-38flp-00000.warc.os.cdx.gz 91601 download
urls-archive.max.fan-twitter-@Maryclairedale-filtered.txt-shallow-20200712-022744-38flp-meta.warc.gz 52852 download   job
urls-archive.max.fan-twitter-@Maryclairedale-filtered.txt-shallow-20200712-022744-38flp-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Maryclairedale-filtered.txt-shallow-20200712-022744-38flp-urls.txt 49019 download
urls-archive.max.fan-twitter-@Maryclairedale-filtered.txt-shallow-20200712-022744-38flp.json 343 download   job
urls-archive.max.fan-twitter-@Masood__Khan-filtered.txt-shallow-20200712-022558-7cdgw-00000.warc.gz 2883770057 download   job
urls-archive.max.fan-twitter-@Masood__Khan-filtered.txt-shallow-20200712-022558-7cdgw-00000.warc.os.cdx.gz 2287605 download
urls-archive.max.fan-twitter-@Masood__Khan-filtered.txt-shallow-20200712-022558-7cdgw-meta.warc.gz 1160345 download   job
urls-archive.max.fan-twitter-@Masood__Khan-filtered.txt-shallow-20200712-022558-7cdgw-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Masood__Khan-filtered.txt-shallow-20200712-022558-7cdgw-urls.txt 688633 download
urls-archive.max.fan-twitter-@MassLS-filtered.txt-shallow-20200712-022557-77jky-00000.warc.gz 13687334 download   job
urls-archive.max.fan-twitter-@MassLS-filtered.txt-shallow-20200712-022557-77jky-00000.warc.os.cdx.gz 20833 download
urls-archive.max.fan-twitter-@MassLS-filtered.txt-shallow-20200712-022557-77jky-meta.warc.gz 15771 download   job
urls-archive.max.fan-twitter-@MassLS-filtered.txt-shallow-20200712-022557-77jky-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MassLS-filtered.txt-shallow-20200712-022557-77jky-urls.txt 9707 download
urls-archive.max.fan-twitter-@MassLS-filtered.txt-shallow-20200712-022557-77jky.json 327 download   job
urls-archive.max.fan-twitter-@MattFutterman-filtered.txt-shallow-20200712-022221-2gzwg-00000.warc.gz 99708332 download   job
urls-archive.max.fan-twitter-@MattFutterman-filtered.txt-shallow-20200712-022221-2gzwg-00000.warc.os.cdx.gz 218131 download
urls-archive.max.fan-twitter-@MattFutterman-filtered.txt-shallow-20200712-022221-2gzwg-meta.warc.gz 119907 download   job
urls-archive.max.fan-twitter-@MattFutterman-filtered.txt-shallow-20200712-022221-2gzwg-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MattFutterman-filtered.txt-shallow-20200712-022221-2gzwg-urls.txt 76337 download
urls-archive.max.fan-twitter-@MattFutterman-filtered.txt-shallow-20200712-022221-2gzwg.json 341 download   job
urls-archive.max.fan-twitter-@MattHerdman-filtered.txt-shallow-20200712-021921-2eyvd-00000.warc.gz 1145995598 download   job
urls-archive.max.fan-twitter-@MattHerdman-filtered.txt-shallow-20200712-021921-2eyvd-00000.warc.os.cdx.gz 1200418 download
urls-archive.max.fan-twitter-@MattHerdman-filtered.txt-shallow-20200712-021921-2eyvd-meta.warc.gz 629318 download   job
urls-archive.max.fan-twitter-@MattHerdman-filtered.txt-shallow-20200712-021921-2eyvd-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MattHerdman-filtered.txt-shallow-20200712-021921-2eyvd-urls.txt 750246 download
urls-archive.max.fan-twitter-@MattHerdman-filtered.txt-shallow-20200712-021921-2eyvd.json 337 download   job
urls-archive.max.fan-twitter-@MaudPetit_AN94-filtered.txt-shallow-20200712-021822-2ftdr-00000.warc.gz 146572070 download   job
urls-archive.max.fan-twitter-@MaudPetit_AN94-filtered.txt-shallow-20200712-021822-2ftdr-00000.warc.os.cdx.gz 196685 download
urls-archive.max.fan-twitter-@MaudPetit_AN94-filtered.txt-shallow-20200712-021822-2ftdr-meta.warc.gz 107656 download   job
urls-archive.max.fan-twitter-@MaudPetit_AN94-filtered.txt-shallow-20200712-021822-2ftdr-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MaudPetit_AN94-filtered.txt-shallow-20200712-021822-2ftdr-urls.txt 43318 download
urls-archive.max.fan-twitter-@MaudPetit_AN94-filtered.txt-shallow-20200712-021822-2ftdr.json 343 download   job
urls-archive.max.fan-twitter-@MayorAdler-filtered.txt-shallow-20200712-020632-9jv60-00000.warc.gz 698067071 download   job
urls-archive.max.fan-twitter-@MayorAdler-filtered.txt-shallow-20200712-020632-9jv60-00000.warc.os.cdx.gz 1428180 download
urls-archive.max.fan-twitter-@MayorAdler-filtered.txt-shallow-20200712-020632-9jv60-meta.warc.gz 749440 download   job
urls-archive.max.fan-twitter-@MayorAdler-filtered.txt-shallow-20200712-020632-9jv60-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MayorAdler-filtered.txt-shallow-20200712-020632-9jv60-urls.txt 285491 download
urls-archive.max.fan-twitter-@MayorAdler-filtered.txt-shallow-20200712-020632-9jv60.json 335 download   job
urls-archive.max.fan-twitter-@MayorBowser-filtered.txt-shallow-20200712-020631-e8g4l-00000.warc.gz 54915858 download   job
urls-archive.max.fan-twitter-@MayorBowser-filtered.txt-shallow-20200712-020631-e8g4l-00000.warc.os.cdx.gz 191262 download
urls-archive.max.fan-twitter-@MayorBowser-filtered.txt-shallow-20200712-020631-e8g4l-meta.warc.gz 103856 download   job
urls-archive.max.fan-twitter-@MayorBowser-filtered.txt-shallow-20200712-020631-e8g4l-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MayorBowser-filtered.txt-shallow-20200712-020631-e8g4l-urls.txt 13332 download
urls-archive.max.fan-twitter-@MayorBowser-filtered.txt-shallow-20200712-020631-e8g4l.json 337 download   job
urls-archive.max.fan-twitter-@MayorGallego-filtered.txt-shallow-20200712-020623-72q3a-00000.warc.gz 291839665 download   job
urls-archive.max.fan-twitter-@MayorGallego-filtered.txt-shallow-20200712-020623-72q3a-00000.warc.os.cdx.gz 472022 download
urls-archive.max.fan-twitter-@MayorGallego-filtered.txt-shallow-20200712-020623-72q3a-meta.warc.gz 251222 download   job
urls-archive.max.fan-twitter-@MayorGallego-filtered.txt-shallow-20200712-020623-72q3a-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MayorGallego-filtered.txt-shallow-20200712-020623-72q3a-urls.txt 69240 download
urls-archive.max.fan-twitter-@MayorGallego-filtered.txt-shallow-20200712-020623-72q3a.json 339 download   job
urls-archive.max.fan-twitter-@MayorGinther-filtered.txt-shallow-20200712-020621-3pvxz-00000.warc.gz 383190394 download   job
urls-archive.max.fan-twitter-@MayorGinther-filtered.txt-shallow-20200712-020621-3pvxz-00000.warc.os.cdx.gz 645167 download
urls-archive.max.fan-twitter-@MayorGinther-filtered.txt-shallow-20200712-020621-3pvxz-meta.warc.gz 344845 download   job
urls-archive.max.fan-twitter-@MayorGinther-filtered.txt-shallow-20200712-020621-3pvxz-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MayorGinther-filtered.txt-shallow-20200712-020621-3pvxz-urls.txt 136102 download
urls-archive.max.fan-twitter-@MayorGinther-filtered.txt-shallow-20200712-020621-3pvxz.json 339 download   job
urls-archive.max.fan-twitter-@MayorJenny-filtered.txt-shallow-20200712-020008-27awx-00000.warc.gz 15329410 download   job
urls-archive.max.fan-twitter-@MayorJenny-filtered.txt-shallow-20200712-020008-27awx-00000.warc.os.cdx.gz 79649 download
urls-archive.max.fan-twitter-@MayorJenny-filtered.txt-shallow-20200712-020008-27awx-meta.warc.gz 46531 download   job
urls-archive.max.fan-twitter-@MayorJenny-filtered.txt-shallow-20200712-020008-27awx-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MayorJenny-filtered.txt-shallow-20200712-020008-27awx-urls.txt 4524 download
urls-archive.max.fan-twitter-@MayorJenny-filtered.txt-shallow-20200712-020008-27awx.json 335 download   job
urls-archive.max.fan-twitter-@MayorMemphis-filtered.txt-shallow-20200712-020005-2nw0r-00000.warc.gz 882873694 download   job
urls-archive.max.fan-twitter-@MayorMemphis-filtered.txt-shallow-20200712-020005-2nw0r-00000.warc.os.cdx.gz 973502 download
urls-archive.max.fan-twitter-@MayorMemphis-filtered.txt-shallow-20200712-020005-2nw0r-meta.warc.gz 515835 download   job
urls-archive.max.fan-twitter-@MayorMemphis-filtered.txt-shallow-20200712-020005-2nw0r-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MayorMemphis-filtered.txt-shallow-20200712-020005-2nw0r-urls.txt 232109 download
urls-archive.max.fan-twitter-@MayorMemphis-filtered.txt-shallow-20200712-020005-2nw0r.json 339 download   job
urls-archive.max.fan-twitter-@MayorMikeDuggan-filtered.txt-shallow-20200712-014113-7ik37-00000.warc.gz 910461577 download   job
urls-archive.max.fan-twitter-@MayorMikeDuggan-filtered.txt-shallow-20200712-014113-7ik37-00000.warc.os.cdx.gz 1320777 download
urls-archive.max.fan-twitter-@MayorMikeDuggan-filtered.txt-shallow-20200712-014113-7ik37-meta.warc.gz 700262 download   job
urls-archive.max.fan-twitter-@MayorMikeDuggan-filtered.txt-shallow-20200712-014113-7ik37-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MayorMikeDuggan-filtered.txt-shallow-20200712-014113-7ik37-urls.txt 610993 download
urls-archive.max.fan-twitter-@MayorMikeDuggan-filtered.txt-shallow-20200712-014113-7ik37.json 345 download   job
urls-archive.max.fan-twitter-@MetroPhotoPete-filtered.txt-shallow-20200712-010910-ac9ya-meta.warc.gz 34510 download   job
urls-archive.max.fan-twitter-@MetroPhotoPete-filtered.txt-shallow-20200712-010910-ac9ya-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MeyerFalcon-filtered.txt-shallow-20200712-010529-9ji82-00000.warc.gz 630974104 download   job
urls-archive.max.fan-twitter-@MeyerFalcon-filtered.txt-shallow-20200712-010529-9ji82-00000.warc.os.cdx.gz 1024606 download
urls-archive.max.fan-twitter-@MezardJacques-filtered.txt-shallow-20200712-010526-87cn1-meta.warc.gz 101186 download   job
urls-archive.max.fan-twitter-@MezardJacques-filtered.txt-shallow-20200712-010526-87cn1-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@MfaEgypt-filtered.txt-shallow-20200712-003648-ds9o0-urls.txt 191253 download
urls-archive.max.fan-twitter-@NAACP_LDF-filtered.txt-shallow-20200711-213244-ck5cg-urls.txt 1454515 download
urls-archive.max.fan-twitter-@NWSTampaBay-filtered.txt-shallow-20200711-194621-d8zz5-00000.warc.gz 4315791611 download   job
urls-archive.max.fan-twitter-@NWSTampaBay-filtered.txt-shallow-20200711-194621-d8zz5-00000.warc.os.cdx.gz 3630966 download
urls-archive.max.fan-twitter-@NWSTampaBay-filtered.txt-shallow-20200711-194621-d8zz5-meta.warc.gz 1872659 download   job
urls-archive.max.fan-twitter-@NWSTampaBay-filtered.txt-shallow-20200711-194621-d8zz5-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@NWSTampaBay-filtered.txt-shallow-20200711-194621-d8zz5.json 337 download   job
urls-archive.max.fan-twitter-@little_pengelly-filtered.txt-shallow-20200712-034419-acoqe-00000.warc.gz 88022399 download   job
urls-archive.max.fan-twitter-@little_pengelly-filtered.txt-shallow-20200712-034419-acoqe-00000.warc.os.cdx.gz 188432 download
urls-archive.max.fan-twitter-@little_pengelly-filtered.txt-shallow-20200712-034419-acoqe-meta.warc.gz 104752 download   job
urls-archive.max.fan-twitter-@little_pengelly-filtered.txt-shallow-20200712-034419-acoqe-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@little_pengelly-filtered.txt-shallow-20200712-034419-acoqe-urls.txt 36285 download
urls-archive.max.fan-twitter-@little_pengelly-filtered.txt-shallow-20200712-034419-acoqe.json 345 download   job
urls-archive.max.fan-twitter-@lopezobrador_-filtered.txt-shallow-20200712-033943-ejd7h-00000.warc.gz 31574587 download   job
urls-archive.max.fan-twitter-@lopezobrador_-filtered.txt-shallow-20200712-033943-ejd7h-00000.warc.os.cdx.gz 72362 download
urls-archive.max.fan-twitter-@lopezobrador_-filtered.txt-shallow-20200712-033943-ejd7h-meta.warc.gz 42410 download   job
urls-archive.max.fan-twitter-@lopezobrador_-filtered.txt-shallow-20200712-033943-ejd7h-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@lopezobrador_-filtered.txt-shallow-20200712-033943-ejd7h-urls.txt 3843 download
urls-archive.max.fan-twitter-@loracorkelley-filtered.txt-shallow-20200712-033942-b5l64-00000.warc.gz 142513702 download   job
urls-archive.max.fan-twitter-@loracorkelley-filtered.txt-shallow-20200712-033942-b5l64-00000.warc.os.cdx.gz 174773 download
urls-archive.max.fan-twitter-@loracorkelley-filtered.txt-shallow-20200712-033942-b5l64-meta.warc.gz 97139 download   job
urls-archive.max.fan-twitter-@loracorkelley-filtered.txt-shallow-20200712-033942-b5l64-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@loracorkelley-filtered.txt-shallow-20200712-033942-b5l64-urls.txt 101590 download
urls-archive.max.fan-twitter-@ltgovmeyer-filtered.txt-shallow-20200712-033913-db6uy-00000.warc.gz 1129414 download   job
urls-archive.max.fan-twitter-@ltgovmeyer-filtered.txt-shallow-20200712-033913-db6uy-00000.warc.os.cdx.gz 4387 download
urls-archive.max.fan-twitter-@ltgovmeyer-filtered.txt-shallow-20200712-033913-db6uy-meta.warc.gz 6339 download   job
urls-archive.max.fan-twitter-@ltgovmeyer-filtered.txt-shallow-20200712-033913-db6uy-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@lucyfrazermp-filtered.txt-shallow-20200712-033223-equu5-00000.warc.gz 11824026 download   job
urls-archive.max.fan-twitter-@lucyfrazermp-filtered.txt-shallow-20200712-033223-equu5-00000.warc.os.cdx.gz 45829 download
urls-archive.max.fan-twitter-@lucyfrazermp-filtered.txt-shallow-20200712-033223-equu5-meta.warc.gz 29027 download   job
urls-archive.max.fan-twitter-@lucyfrazermp-filtered.txt-shallow-20200712-033223-equu5-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@lucyfrazermp-filtered.txt-shallow-20200712-033223-equu5-urls.txt 3180 download
urls-archive.max.fan-twitter-@lucyfrazermp-filtered.txt-shallow-20200712-033223-equu5.json 339 download   job
urls-archive.max.fan-twitter-@lucymcbath-filtered.txt-shallow-20200712-033221-2qsfv-00000.warc.gz 8885227 download   job
urls-archive.max.fan-twitter-@lucymcbath-filtered.txt-shallow-20200712-033221-2qsfv-00000.warc.os.cdx.gz 41291 download
urls-archive.max.fan-twitter-@lucymcbath-filtered.txt-shallow-20200712-033221-2qsfv-meta.warc.gz 26241 download   job
urls-archive.max.fan-twitter-@lucymcbath-filtered.txt-shallow-20200712-033221-2qsfv-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@lucymcbath-filtered.txt-shallow-20200712-033221-2qsfv-urls.txt 2088 download
urls-archive.max.fan-twitter-@lucymcbath-filtered.txt-shallow-20200712-033221-2qsfv.json 335 download   job
urls-archive.max.fan-twitter-@luisalonsolugo-filtered.txt-shallow-20200712-032444-86t08-00000.warc.gz 252209712 download   job
urls-archive.max.fan-twitter-@luisalonsolugo-filtered.txt-shallow-20200712-032444-86t08-00000.warc.os.cdx.gz 349328 download
urls-archive.max.fan-twitter-@luisalonsolugo-filtered.txt-shallow-20200712-032444-86t08-meta.warc.gz 188017 download   job
urls-archive.max.fan-twitter-@luisalonsolugo-filtered.txt-shallow-20200712-032444-86t08-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@luisalonsolugo-filtered.txt-shallow-20200712-032444-86t08-urls.txt 239487 download
urls-archive.max.fan-twitter-@luisalonsolugo-filtered.txt-shallow-20200712-032444-86t08.json 343 download   job
urls-archive.max.fan-twitter-@m_ebrard-filtered.txt-shallow-20200712-011742-20tm2-meta.warc.gz 78218 download   job
urls-archive.max.fan-twitter-@m_ebrard-filtered.txt-shallow-20200712-011742-20tm2-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@maddiemcgarvey-filtered.txt-shallow-20200712-032141-bxeyx-00000.warc.gz 401000151 download   job
urls-archive.max.fan-twitter-@maddiemcgarvey-filtered.txt-shallow-20200712-032141-bxeyx-00000.warc.os.cdx.gz 497389 download
urls-archive.max.fan-twitter-@maddiemcgarvey-filtered.txt-shallow-20200712-032141-bxeyx-meta.warc.gz 269438 download   job
urls-archive.max.fan-twitter-@maddiemcgarvey-filtered.txt-shallow-20200712-032141-bxeyx-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@maddiemcgarvey-filtered.txt-shallow-20200712-032141-bxeyx-urls.txt 302273 download
urls-archive.max.fan-twitter-@maddiemcgarvey-filtered.txt-shallow-20200712-032141-bxeyx.json 343 download   job
urls-archive.max.fan-twitter-@magancrane-filtered.txt-shallow-20200712-031458-123ih-00000.warc.gz 241271583 download   job
urls-archive.max.fan-twitter-@magancrane-filtered.txt-shallow-20200712-031458-123ih-00000.warc.os.cdx.gz 281922 download
urls-archive.max.fan-twitter-@magancrane-filtered.txt-shallow-20200712-031458-123ih-meta.warc.gz 153208 download   job
urls-archive.max.fan-twitter-@magancrane-filtered.txt-shallow-20200712-031458-123ih-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@magancrane-filtered.txt-shallow-20200712-031458-123ih-urls.txt 159748 download
urls-archive.max.fan-twitter-@magancrane-filtered.txt-shallow-20200712-031458-123ih.json 335 download   job
urls-archive.max.fan-twitter-@mannyNYT-filtered.txt-shallow-20200712-030112-99939-00000.warc.gz 94038267 download   job
urls-archive.max.fan-twitter-@mannyNYT-filtered.txt-shallow-20200712-030112-99939-00000.warc.os.cdx.gz 303461 download
urls-archive.max.fan-twitter-@mannyNYT-filtered.txt-shallow-20200712-030112-99939-meta.warc.gz 165332 download   job
urls-archive.max.fan-twitter-@mannyNYT-filtered.txt-shallow-20200712-030112-99939-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@mannyNYT-filtered.txt-shallow-20200712-030112-99939-urls.txt 48333 download
urls-archive.max.fan-twitter-@mannyNYT-filtered.txt-shallow-20200712-030112-99939.json 331 download   job
urls-archive.max.fan-twitter-@marclauritsen-filtered.txt-shallow-20200712-030103-1ur3w-meta.warc.gz 193482 download   job
urls-archive.max.fan-twitter-@marclauritsen-filtered.txt-shallow-20200712-030103-1ur3w-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@marclauritsen-filtered.txt-shallow-20200712-030103-1ur3w-urls.txt 219567 download
urls-archive.max.fan-twitter-@mariarussobooks-filtered.txt-shallow-20200712-025400-56kpz-00000.warc.gz 102721770 download   job
urls-archive.max.fan-twitter-@mariarussobooks-filtered.txt-shallow-20200712-025400-56kpz-00000.warc.os.cdx.gz 269310 download
urls-archive.max.fan-twitter-@mariarussobooks-filtered.txt-shallow-20200712-025400-56kpz-meta.warc.gz 145336 download   job
urls-archive.max.fan-twitter-@mariarussobooks-filtered.txt-shallow-20200712-025400-56kpz-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@mariarussobooks-filtered.txt-shallow-20200712-025400-56kpz-urls.txt 66946 download
urls-archive.max.fan-twitter-@mariarussobooks-filtered.txt-shallow-20200712-025400-56kpz.json 345 download   job
urls-archive.max.fan-twitter-@marion_lenne-filtered.txt-shallow-20200712-025359-1b2cp-00000.warc.gz 109201572 download   job
urls-archive.max.fan-twitter-@marion_lenne-filtered.txt-shallow-20200712-025359-1b2cp-00000.warc.os.cdx.gz 128616 download
urls-archive.max.fan-twitter-@marion_lenne-filtered.txt-shallow-20200712-025359-1b2cp-meta.warc.gz 72235 download   job
urls-archive.max.fan-twitter-@marion_lenne-filtered.txt-shallow-20200712-025359-1b2cp-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@marion_lenne-filtered.txt-shallow-20200712-025359-1b2cp-urls.txt 59872 download
urls-archive.max.fan-twitter-@marion_lenne-filtered.txt-shallow-20200712-025359-1b2cp.json 339 download   job
urls-archive.max.fan-twitter-@marwilliamson-filtered.txt-shallow-20200712-023051-3id5d-meta.warc.gz 1626497 download   job
urls-archive.max.fan-twitter-@marwilliamson-filtered.txt-shallow-20200712-023051-3id5d-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@marwilliamson-filtered.txt-shallow-20200712-023051-3id5d-urls.txt 440847 download
urls-archive.max.fan-twitter-@marwilliamson-filtered.txt-shallow-20200712-023051-3id5d.json 341 download   job
urls-archive.max.fan-twitter-@matcatch-filtered.txt-shallow-20200712-022225-37wtw-00000.warc.gz 379154265 download   job
urls-archive.max.fan-twitter-@matcatch-filtered.txt-shallow-20200712-022225-37wtw-00000.warc.os.cdx.gz 432731 download
urls-archive.max.fan-twitter-@matcatch-filtered.txt-shallow-20200712-022225-37wtw-meta.warc.gz 229568 download   job
urls-archive.max.fan-twitter-@matcatch-filtered.txt-shallow-20200712-022225-37wtw-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@matcatch-filtered.txt-shallow-20200712-022225-37wtw-urls.txt 325010 download
urls-archive.max.fan-twitter-@matcatch-filtered.txt-shallow-20200712-022225-37wtw.json 331 download   job
urls-archive.max.fan-twitter-@mattfleg-filtered.txt-shallow-20200712-022221-680fb-00000.warc.gz 549496588 download   job
urls-archive.max.fan-twitter-@mattfleg-filtered.txt-shallow-20200712-022221-680fb-00000.warc.os.cdx.gz 1477599 download
urls-archive.max.fan-twitter-@mattfleg-filtered.txt-shallow-20200712-022221-680fb-meta.warc.gz 779460 download   job
urls-archive.max.fan-twitter-@mattfleg-filtered.txt-shallow-20200712-022221-680fb-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@mattfleg-filtered.txt-shallow-20200712-022221-680fb-urls.txt 389759 download
urls-archive.max.fan-twitter-@mattfleg-filtered.txt-shallow-20200712-022221-680fb.json 331 download   job
urls-archive.max.fan-twitter-@mayaalleruzzo-filtered.txt-shallow-20200712-021821-1xaja-00000.warc.gz 171824316 download   job
urls-archive.max.fan-twitter-@mayaalleruzzo-filtered.txt-shallow-20200712-021821-1xaja-00000.warc.os.cdx.gz 215536 download
urls-archive.max.fan-twitter-@mayaalleruzzo-filtered.txt-shallow-20200712-021821-1xaja-meta.warc.gz 117547 download   job
urls-archive.max.fan-twitter-@mayaalleruzzo-filtered.txt-shallow-20200712-021821-1xaja-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@mayaalleruzzo-filtered.txt-shallow-20200712-021821-1xaja-urls.txt 98146 download
urls-archive.max.fan-twitter-@mayaalleruzzo-filtered.txt-shallow-20200712-021821-1xaja.json 341 download   job
urls-archive.max.fan-twitter-@mayasweedler-filtered.txt-shallow-20200712-021416-9y5o8-00000.warc.gz 67176287 download   job
urls-archive.max.fan-twitter-@mayasweedler-filtered.txt-shallow-20200712-021416-9y5o8-00000.warc.os.cdx.gz 132127 download
urls-archive.max.fan-twitter-@mayasweedler-filtered.txt-shallow-20200712-021416-9y5o8-meta.warc.gz 74709 download   job
urls-archive.max.fan-twitter-@mayasweedler-filtered.txt-shallow-20200712-021416-9y5o8-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@mayasweedler-filtered.txt-shallow-20200712-021416-9y5o8-urls.txt 33697 download
urls-archive.max.fan-twitter-@mayasweedler-filtered.txt-shallow-20200712-021416-9y5o8.json 339 download   job
urls-archive.max.fan-twitter-@mayor_margo-filtered.txt-shallow-20200712-020007-ae4q1-00000.warc.gz 98267416 download   job
urls-archive.max.fan-twitter-@mayor_margo-filtered.txt-shallow-20200712-020007-ae4q1-00000.warc.os.cdx.gz 160356 download
urls-archive.max.fan-twitter-@mayor_margo-filtered.txt-shallow-20200712-020007-ae4q1-meta.warc.gz 89021 download   job
urls-archive.max.fan-twitter-@mayor_margo-filtered.txt-shallow-20200712-020007-ae4q1-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@mayor_margo-filtered.txt-shallow-20200712-020007-ae4q1-urls.txt 29710 download
urls-archive.max.fan-twitter-@mayor_margo-filtered.txt-shallow-20200712-020007-ae4q1.json 337 download   job
urls-archive.max.fan-twitter-@mclaudebibeau-filtered.txt-shallow-20200712-013714-1mr0i-00000.warc.gz 886704087 download   job
urls-archive.max.fan-twitter-@mclaudebibeau-filtered.txt-shallow-20200712-013714-1mr0i-00000.warc.os.cdx.gz 1066044 download
urls-archive.max.fan-twitter-@mclaudebibeau-filtered.txt-shallow-20200712-013714-1mr0i-meta.warc.gz 572900 download   job
urls-archive.max.fan-twitter-@mclaudebibeau-filtered.txt-shallow-20200712-013714-1mr0i-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@mclaudebibeau-filtered.txt-shallow-20200712-013714-1mr0i-urls.txt 216074 download
urls-archive.max.fan-twitter-@mclaudebibeau-filtered.txt-shallow-20200712-013714-1mr0i.json 341 download   job
urls-archive.max.fan-twitter-@meghanbarr-filtered.txt-shallow-20200712-011337-3pqzr-urls.txt 137497 download
urls-archive.max.fan-twitter-@meghanbarr-filtered.txt-shallow-20200712-011337-3pqzr.json 335 download   job
urls-archive.max.fan-twitter-@melbournecoal-filtered.txt-shallow-20200712-011335-8pgp2.json 341 download   job
urls-transfer.notkiska.pw-asylums.insanejournal.com-clever_girl-ctl8k-remaining-f-shallow-20200622-171611-dij0q-00002.warc.gz 5374771579 download   job
urls-transfer.notkiska.pw-asylums.insanejournal.com-clever_girl-ctl8k-remaining-f-shallow-20200622-171611-dij0q-00002.warc.os.cdx.gz 3693010 download
urls-transfer.notkiska.pw-old.reddit.com_selected_threads_Gallifrey_20200712-shallow-20200712-014630-23b1q-00000.warc.gz 2264822444 download   job
urls-transfer.notkiska.pw-old.reddit.com_selected_threads_Gallifrey_20200712-shallow-20200712-014630-23b1q-00000.warc.os.cdx.gz 1752262 download
urls-transfer.notkiska.pw-old.reddit.com_selected_threads_Gallifrey_20200712-shallow-20200712-014630-23b1q-meta.warc.gz 1016442 download   job
urls-transfer.notkiska.pw-old.reddit.com_selected_threads_Gallifrey_20200712-shallow-20200712-014630-23b1q-meta.warc.os.cdx.gz 47 download
urls-transfer.notkiska.pw-old.reddit.com_selected_threads_Gallifrey_20200712-shallow-20200712-014630-23b1q-urls.txt 576920 download
urls-transfer.notkiska.pw-old.reddit.com_selected_threads_Gallifrey_20200712-shallow-20200712-014630-23b1q.json 388 download   job
urls-transfer.notkiska.pw-twitter-%23BlackHistoryMonth-shallow-20200610-132545-46qdq-00200.warc.gz 5369375576 download   job
urls-transfer.notkiska.pw-twitter-%23BlackHistoryMonth-shallow-20200610-132545-46qdq-00200.warc.os.cdx.gz 597632 download
urls-transfer.notkiska.pw-twitter-%23BlackTwitter-shallow-20200710-163004-dpwry-00002.warc.gz 5368770222 download   job
urls-transfer.notkiska.pw-twitter-%23BlackTwitter-shallow-20200710-163004-dpwry-00002.warc.os.cdx.gz 7188219 download
urls-transfer.notkiska.pw-twitter-%23Srebrenitsa-shallow-20200711-202724-ccuwz-00000.warc.gz 5368820092 download   job
urls-transfer.notkiska.pw-twitter-%23Srebrenitsa-shallow-20200711-202724-ccuwz-00000.warc.os.cdx.gz 6401343 download
urls-transfer.notkiska.pw-twitter-%23qanon-shallow-20200531-053932-8yw79-00075.warc.gz 5369956760 download   job
urls-transfer.notkiska.pw-twitter-%23qanon-shallow-20200531-053932-8yw79-00075.warc.os.cdx.gz 1823658 download
urls-transfer.notkiska.pw-twitter-@Gamecheat13-shallow-20200711-220448-7phho-meta.warc.gz 1574432 download   job
urls-transfer.notkiska.pw-twitter-@Gamecheat13-shallow-20200711-220448-7phho-meta.warc.os.cdx.gz 47 download
urls-transfer.notkiska.pw-twitter-@Gamecheat13-shallow-20200711-220448-7phho.json 336 download   job
urls-transfer.notkiska.pw-twitter-@LogicFairy-shallow-20200712-003933-btmr0-00000.warc.gz 1598372022 download   job
urls-transfer.notkiska.pw-twitter-@LogicFairy-shallow-20200712-003933-btmr0-00000.warc.os.cdx.gz 864750 download
urls-transfer.notkiska.pw-twitter-@LogicFairy-shallow-20200712-003933-btmr0-meta.warc.gz 490764 download   job
urls-transfer.notkiska.pw-twitter-@LogicFairy-shallow-20200712-003933-btmr0-meta.warc.os.cdx.gz 47 download
urls-transfer.notkiska.pw-twitter-@LogicFairy-shallow-20200712-003933-btmr0-urls.txt 136068 download
urls-transfer.notkiska.pw-twitter-@LogicFairy-shallow-20200712-003933-btmr0.json 332 download   job
urls-transfer.notkiska.pw-twitter-@NYCCouncil-shallow-20200711-202213-4ibxb-00003.warc.gz 6661698135 download   job
urls-transfer.notkiska.pw-twitter-@NYCCouncil-shallow-20200711-202213-4ibxb-00003.warc.os.cdx.gz 2158834 download
urls-transfer.notkiska.pw-twitter-@NYCCouncil-shallow-20200711-202213-4ibxb-00004.warc.gz 6223695305 download   job
urls-transfer.notkiska.pw-twitter-@NYCCouncil-shallow-20200711-202213-4ibxb-00004.warc.os.cdx.gz 672188 download
urls-transfer.notkiska.pw-twitter-@sarahmanavis-shallow-20200712-003056-5tmlk-meta.warc.gz 2342827 download   job
urls-transfer.notkiska.pw-twitter-@sarahmanavis-shallow-20200712-003056-5tmlk-meta.warc.os.cdx.gz 47 download
urls-transfer.notkiska.pw-twitter-@sarahmanavis-shallow-20200712-003056-5tmlk-urls.txt 908247 download
urls-transfer.notkiska.pw-twitter-@sarahmanavis-shallow-20200712-003056-5tmlk.json 336 download   job
www.chinadaily.com.cn-inf-20190927-102302-505np-00464.warc.gz 1073845308 download   job
www.chinadaily.com.cn-inf-20190927-102302-505np-00464.warc.os.cdx.gz 1823474 download
www.luxrenderfarm.de-shallow-20200712-022834-6ph29-00000.warc.gz 163103 download   job
www.luxrenderfarm.de-shallow-20200712-022834-6ph29-00000.warc.os.cdx.gz 1058 download
www.luxrenderfarm.de-shallow-20200712-022834-6ph29-meta.warc.gz 4085 download   job
www.luxrenderfarm.de-shallow-20200712-022834-6ph29-meta.warc.os.cdx.gz 47 download
www.luxrenderfarm.de-shallow-20200712-022834-6ph29.json 248 download   job
www.mudcrutch.com-inf-20200710-231811-ablr0-00002.warc.gz 5374209375 download   job
www.mudcrutch.com-inf-20200710-231811-ablr0-00002.warc.os.cdx.gz 4234365 download
www.refinery29.com-inf-20191002-211042-3symg-00657.warc.gz 5368836294 download   job
www.refinery29.com-inf-20191002-211042-3symg-00657.warc.os.cdx.gz 3002289 download
xenu-directory.net-inf-20200711-222522-1zlrs-aborted-00000.warc.gz 433337472 download   job
xenu-directory.net-inf-20200711-222522-1zlrs-aborted-00000.warc.os.cdx.gz 462069 download
xenu-directory.net-inf-20200711-222522-1zlrs-aborted.json 241 download   job