Item archiveteam_archivebot_go_20200712070003

View on Internet Archive

Filename Size
archiveteam_archivebot_go_20200712070003.cdx.gz 95662403 download
archiveteam_archivebot_go_20200712070003.cdx.idx 80773 download
archiveteam_archivebot_go_20200712070003_files.xml 0 download
archiveteam_archivebot_go_20200712070003_meta.sqlite 734208 download
archiveteam_archivebot_go_20200712070003_meta.xml 969 download
cdn1.ruarxive.org-inf-20200602-221412-82e21-00602.warc.gz 5890861891 download   job
cdn1.ruarxive.org-inf-20200602-221412-82e21-00602.warc.os.cdx.gz 5780 download
deltasdnd.blogspot.com-inf-20200711-221329-d4y7q-meta.warc.gz 4384328 download   job
deltasdnd.blogspot.com-inf-20200711-221329-d4y7q-meta.warc.os.cdx.gz 47 download
ektoplazm.com-inf-20200704-233408-66i1h-00026.warc.gz 5420782029 download   job
ektoplazm.com-inf-20200704-233408-66i1h-00026.warc.os.cdx.gz 3742 download
frothsofdnd.blogspot.com-inf-20200711-230513-4s710-00001.warc.gz 5462415839 download   job
frothsofdnd.blogspot.com-inf-20200711-230513-4s710-00001.warc.os.cdx.gz 1904438 download
frothsofdnd.blogspot.com-inf-20200711-230513-4s710-00002.warc.gz 5377256828 download   job
frothsofdnd.blogspot.com-inf-20200711-230513-4s710-00002.warc.os.cdx.gz 1319789 download
getsatisfaction.com-inf-20200708-234031-epnla-00015.warc.gz 5368744926 download   job
getsatisfaction.com-inf-20200708-234031-epnla-00015.warc.os.cdx.gz 6392568 download
leavingscientology.wordpress.com-inf-20200712-024122-17ezt-00002.warc.gz 5379983230 download   job
leavingscientology.wordpress.com-inf-20200712-024122-17ezt-00002.warc.os.cdx.gz 86331 download
leavingscientology.wordpress.com-inf-20200712-024122-17ezt-00003.warc.gz 5369050796 download   job
leavingscientology.wordpress.com-inf-20200712-024122-17ezt-00003.warc.os.cdx.gz 821345 download
lordofthegreendragons.blogspot.com-inf-20200711-223834-71hcd-00000.warc.gz 1813971344 download   job
lordofthegreendragons.blogspot.com-inf-20200711-223834-71hcd-00000.warc.os.cdx.gz 2329600 download
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00118.warc.gz 5535990487 download   job
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00118.warc.os.cdx.gz 2476597 download
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00119.warc.gz 5368943398 download   job
mediaset.sdasofia.org-inf-20200709-091713-c8wet-00119.warc.os.cdx.gz 51074 download
nyjm.albany.edu-inf-20200709-221433-qbgey-00000.warc.gz 836208538 download   job
nyjm.albany.edu-inf-20200709-221433-qbgey-00000.warc.os.cdx.gz 360992 download
nyjm.albany.edu-inf-20200709-221433-qbgey-meta.warc.gz 212488 download   job
nyjm.albany.edu-inf-20200709-221433-qbgey-meta.warc.os.cdx.gz 47 download
nyjm.albany.edu-inf-20200709-221433-qbgey.json 239 download   job
old.reddit.com-inf-20200711-222159-93eeb-00002.warc.gz 5368710693 download   job
old.reddit.com-inf-20200711-222159-93eeb-00002.warc.os.cdx.gz 2684983 download
old.reddit.com-inf-20200711-222159-93eeb-00003.warc.gz 3561936539 download   job
old.reddit.com-inf-20200711-222159-93eeb-00003.warc.os.cdx.gz 210434 download
old.reddit.com-inf-20200711-222159-93eeb.json 250 download   job
old.reddit.com-inf-20200712-011858-coxzo-00000.warc.gz 5368737268 download   job
old.reddit.com-inf-20200712-011858-coxzo-00000.warc.os.cdx.gz 2051983 download
player.fm-inf-20200501-233943-6recr-00685.warc.gz 5456732112 download   job
player.fm-inf-20200501-233943-6recr-00685.warc.os.cdx.gz 82767 download
transfer.notkiska.pw-shallow-20200712-033413-6t2ew-00000.warc.gz 58189 download   job
transfer.notkiska.pw-shallow-20200712-033413-6t2ew-00000.warc.os.cdx.gz 240 download
urls-archive.max.fan-twitter-@JRBlake-filtered.txt-shallow-20200712-061849-967m5-00000.warc.gz 359530190 download   job
urls-archive.max.fan-twitter-@JRBlake-filtered.txt-shallow-20200712-061849-967m5-00000.warc.os.cdx.gz 908571 download
urls-archive.max.fan-twitter-@JRBlake-filtered.txt-shallow-20200712-061849-967m5-meta.warc.gz 489150 download   job
urls-archive.max.fan-twitter-@JRBlake-filtered.txt-shallow-20200712-061849-967m5-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@JY_LeDrian-filtered.txt-shallow-20200712-055305-dg6qj-00000.warc.gz 295813156 download   job
urls-archive.max.fan-twitter-@JY_LeDrian-filtered.txt-shallow-20200712-055305-dg6qj-00000.warc.os.cdx.gz 699152 download
urls-archive.max.fan-twitter-@JocelynNoveckAP-filtered.txt-shallow-20200712-065438-1zi4b-00000.warc.gz 17310742 download   job
urls-archive.max.fan-twitter-@JocelynNoveckAP-filtered.txt-shallow-20200712-065438-1zi4b-00000.warc.os.cdx.gz 35644 download
urls-archive.max.fan-twitter-@JocelynNoveckAP-filtered.txt-shallow-20200712-065438-1zi4b-meta.warc.gz 23775 download   job
urls-archive.max.fan-twitter-@JocelynNoveckAP-filtered.txt-shallow-20200712-065438-1zi4b-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@JocelynNoveckAP-filtered.txt-shallow-20200712-065438-1zi4b-urls.txt 10982 download
urls-archive.max.fan-twitter-@JocelynNoveckAP-filtered.txt-shallow-20200712-065438-1zi4b.json 345 download   job
urls-archive.max.fan-twitter-@JoeSestak-filtered.txt-shallow-20200712-065224-4iq3l-meta.warc.gz 7048 download   job
urls-archive.max.fan-twitter-@JoeSestak-filtered.txt-shallow-20200712-065224-4iq3l-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@JoeSestak-filtered.txt-shallow-20200712-065224-4iq3l.json 333 download   job
urls-archive.max.fan-twitter-@JohnFetterman-filtered.txt-shallow-20200712-064343-7ktbr.json 341 download   job
urls-archive.max.fan-twitter-@JohnFlesher-filtered.txt-shallow-20200712-064341-9ivvt-00000.warc.gz 37164198 download   job
urls-archive.max.fan-twitter-@JohnFlesher-filtered.txt-shallow-20200712-064341-9ivvt-00000.warc.os.cdx.gz 66409 download
urls-archive.max.fan-twitter-@JohnFlesher-filtered.txt-shallow-20200712-064341-9ivvt-meta.warc.gz 40223 download   job
urls-archive.max.fan-twitter-@JohnFlesher-filtered.txt-shallow-20200712-064341-9ivvt-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@JohnMuirNPS-filtered.txt-shallow-20200712-064341-eoa9k-meta.warc.gz 44060 download   job
urls-archive.max.fan-twitter-@JohnMuirNPS-filtered.txt-shallow-20200712-064341-eoa9k-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@JonathanWNV-filtered.txt-shallow-20200712-064225-1a9p6.json 337 download   job
urls-archive.max.fan-twitter-@JoshiSmiley-filtered.txt-shallow-20200712-062830-91za6-urls.txt 103517 download
urls-archive.max.fan-twitter-@JoshiSmiley-filtered.txt-shallow-20200712-062830-91za6.json 337 download   job
urls-archive.max.fan-twitter-@JuliaLopezMP-filtered.txt-shallow-20200712-061123-4r8ta-00000.warc.gz 158738167 download   job
urls-archive.max.fan-twitter-@JuliaLopezMP-filtered.txt-shallow-20200712-061123-4r8ta-00000.warc.os.cdx.gz 193484 download
urls-archive.max.fan-twitter-@JuliaLopezMP-filtered.txt-shallow-20200712-061123-4r8ta-urls.txt 33339 download
urls-archive.max.fan-twitter-@JuliaLopezMP-filtered.txt-shallow-20200712-061123-4r8ta.json 339 download   job
urls-archive.max.fan-twitter-@JulianCastro-filtered.txt-shallow-20200712-061123-27akd-00000.warc.gz 144890388 download   job
urls-archive.max.fan-twitter-@JulianCastro-filtered.txt-shallow-20200712-061123-27akd-00000.warc.os.cdx.gz 649801 download
urls-archive.max.fan-twitter-@JulianCastro-filtered.txt-shallow-20200712-061123-27akd-urls.txt 35519 download
urls-archive.max.fan-twitter-@JulianCastro-filtered.txt-shallow-20200712-061123-27akd.json 339 download   job
urls-archive.max.fan-twitter-@JyotiThottam-filtered.txt-shallow-20200712-055021-tps9r-00000.warc.gz 26907118 download   job
urls-archive.max.fan-twitter-@JyotiThottam-filtered.txt-shallow-20200712-055021-tps9r-00000.warc.os.cdx.gz 54928 download
urls-archive.max.fan-twitter-@JyotiThottam-filtered.txt-shallow-20200712-055021-tps9r-meta.warc.gz 34009 download   job
urls-archive.max.fan-twitter-@JyotiThottam-filtered.txt-shallow-20200712-055021-tps9r-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@JyotiThottam-filtered.txt-shallow-20200712-055021-tps9r-urls.txt 20642 download
urls-archive.max.fan-twitter-@JyotiThottam-filtered.txt-shallow-20200712-055021-tps9r.json 339 download   job
urls-archive.max.fan-twitter-@KYGOP-filtered.txt-shallow-20200712-044357-7sltz-00000.warc.gz 587994490 download   job
urls-archive.max.fan-twitter-@KYGOP-filtered.txt-shallow-20200712-044357-7sltz-00000.warc.os.cdx.gz 812149 download
urls-archive.max.fan-twitter-@KYGOP-filtered.txt-shallow-20200712-044357-7sltz-meta.warc.gz 435512 download   job
urls-archive.max.fan-twitter-@KYGOP-filtered.txt-shallow-20200712-044357-7sltz-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@KYGOP-filtered.txt-shallow-20200712-044357-7sltz-urls.txt 243952 download
urls-archive.max.fan-twitter-@KYGOP-filtered.txt-shallow-20200712-044357-7sltz.json 325 download   job
urls-archive.max.fan-twitter-@KYSecState-filtered.txt-shallow-20200712-044352-7kwgj-00000.warc.gz 141382199 download   job
urls-archive.max.fan-twitter-@KYSecState-filtered.txt-shallow-20200712-044352-7kwgj-00000.warc.os.cdx.gz 213000 download
urls-archive.max.fan-twitter-@KYSecState-filtered.txt-shallow-20200712-044352-7kwgj-meta.warc.gz 117695 download   job
urls-archive.max.fan-twitter-@KYSecState-filtered.txt-shallow-20200712-044352-7kwgj-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@KYSecState-filtered.txt-shallow-20200712-044352-7kwgj-urls.txt 37874 download
urls-archive.max.fan-twitter-@KYSecState-filtered.txt-shallow-20200712-044352-7kwgj.json 335 download   job
urls-archive.max.fan-twitter-@K_G_Andersen-filtered.txt-shallow-20200712-050949-43eut-00000.warc.gz 444736614 download   job
urls-archive.max.fan-twitter-@K_G_Andersen-filtered.txt-shallow-20200712-050949-43eut-00000.warc.os.cdx.gz 662933 download
urls-archive.max.fan-twitter-@K_G_Andersen-filtered.txt-shallow-20200712-050949-43eut-meta.warc.gz 354467 download   job
urls-archive.max.fan-twitter-@K_G_Andersen-filtered.txt-shallow-20200712-050949-43eut-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@K_G_Andersen-filtered.txt-shallow-20200712-050949-43eut-urls.txt 232605 download
urls-archive.max.fan-twitter-@K_G_Andersen-filtered.txt-shallow-20200712-050949-43eut.json 339 download   job
urls-archive.max.fan-twitter-@K_Jhinaoui-filtered.txt-shallow-20200712-050108-9oshm-00000.warc.gz 37977829 download   job
urls-archive.max.fan-twitter-@K_Jhinaoui-filtered.txt-shallow-20200712-050108-9oshm-00000.warc.os.cdx.gz 49657 download
urls-archive.max.fan-twitter-@K_Jhinaoui-filtered.txt-shallow-20200712-050108-9oshm-meta.warc.gz 43825 download   job
urls-archive.max.fan-twitter-@K_Jhinaoui-filtered.txt-shallow-20200712-050108-9oshm-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@K_Jhinaoui-filtered.txt-shallow-20200712-050108-9oshm-urls.txt 9657 download
urls-archive.max.fan-twitter-@K_Jhinaoui-filtered.txt-shallow-20200712-050108-9oshm.json 335 download   job
urls-archive.max.fan-twitter-@KansasGOP-filtered.txt-shallow-20200712-054044-4snh3-00000.warc.gz 411921380 download   job
urls-archive.max.fan-twitter-@KansasGOP-filtered.txt-shallow-20200712-054044-4snh3-00000.warc.os.cdx.gz 565071 download
urls-archive.max.fan-twitter-@KansasGOP-filtered.txt-shallow-20200712-054044-4snh3-meta.warc.gz 301941 download   job
urls-archive.max.fan-twitter-@KansasGOP-filtered.txt-shallow-20200712-054044-4snh3-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@KansasGOP-filtered.txt-shallow-20200712-054044-4snh3-urls.txt 230727 download
urls-archive.max.fan-twitter-@KansasGOP-filtered.txt-shallow-20200712-054044-4snh3.json 333 download   job
urls-archive.max.fan-twitter-@Kashmir4Youth-filtered.txt-shallow-20200712-053927-747j7-00000.warc.gz 4834228 download   job
urls-archive.max.fan-twitter-@Kashmir4Youth-filtered.txt-shallow-20200712-053927-747j7-00000.warc.os.cdx.gz 9896 download
urls-archive.max.fan-twitter-@Kashmir4Youth-filtered.txt-shallow-20200712-053927-747j7-meta.warc.gz 9410 download   job
urls-archive.max.fan-twitter-@Kashmir4Youth-filtered.txt-shallow-20200712-053927-747j7-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Kashmir4Youth-filtered.txt-shallow-20200712-053927-747j7-urls.txt 1440 download
urls-archive.max.fan-twitter-@Kashmir4Youth-filtered.txt-shallow-20200712-053927-747j7.json 341 download   job
urls-archive.max.fan-twitter-@KaylaForIowa-filtered.txt-shallow-20200712-053322-831bw-00000.warc.gz 1464363 download   job
urls-archive.max.fan-twitter-@KaylaForIowa-filtered.txt-shallow-20200712-053322-831bw-00000.warc.os.cdx.gz 6478 download
urls-archive.max.fan-twitter-@KaylaForIowa-filtered.txt-shallow-20200712-053322-831bw-meta.warc.gz 7607 download   job
urls-archive.max.fan-twitter-@KaylaForIowa-filtered.txt-shallow-20200712-053322-831bw-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@KaylaForIowa-filtered.txt-shallow-20200712-053322-831bw-urls.txt 240 download
urls-archive.max.fan-twitter-@KaylaForIowa-filtered.txt-shallow-20200712-053322-831bw.json 339 download   job
urls-archive.max.fan-twitter-@KayserEnneking-filtered.txt-shallow-20200712-053259-bx07y-00000.warc.gz 45696901 download   job
urls-archive.max.fan-twitter-@KayserEnneking-filtered.txt-shallow-20200712-053259-bx07y-00000.warc.os.cdx.gz 80070 download
urls-archive.max.fan-twitter-@KayserEnneking-filtered.txt-shallow-20200712-053259-bx07y-meta.warc.gz 46965 download   job
urls-archive.max.fan-twitter-@KayserEnneking-filtered.txt-shallow-20200712-053259-bx07y-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@KayserEnneking-filtered.txt-shallow-20200712-053259-bx07y-urls.txt 20782 download
urls-archive.max.fan-twitter-@KayserEnneking-filtered.txt-shallow-20200712-053259-bx07y.json 343 download   job
urls-archive.max.fan-twitter-@KemiBadenoch-filtered.txt-shallow-20200712-052935-ckz28-00000.warc.gz 3134823 download   job
urls-archive.max.fan-twitter-@KemiBadenoch-filtered.txt-shallow-20200712-052935-ckz28-00000.warc.os.cdx.gz 14716 download
urls-archive.max.fan-twitter-@KemiBadenoch-filtered.txt-shallow-20200712-052935-ckz28-meta.warc.gz 12173 download   job
urls-archive.max.fan-twitter-@KemiBadenoch-filtered.txt-shallow-20200712-052935-ckz28-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@KemiBadenoch-filtered.txt-shallow-20200712-052935-ckz28-urls.txt 779 download
urls-archive.max.fan-twitter-@KemiBadenoch-filtered.txt-shallow-20200712-052935-ckz28.json 339 download   job
urls-archive.max.fan-twitter-@KenyaMissionUN-filtered.txt-shallow-20200712-052933-ad44g-00000.warc.gz 183592230 download   job
urls-archive.max.fan-twitter-@KenyaMissionUN-filtered.txt-shallow-20200712-052933-ad44g-00000.warc.os.cdx.gz 197175 download
urls-archive.max.fan-twitter-@KenyaMissionUN-filtered.txt-shallow-20200712-052933-ad44g-meta.warc.gz 107196 download   job
urls-archive.max.fan-twitter-@KenyaMissionUN-filtered.txt-shallow-20200712-052933-ad44g-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@KenyaMissionUN-filtered.txt-shallow-20200712-052933-ad44g-urls.txt 46580 download
urls-archive.max.fan-twitter-@KenyaMissionUN-filtered.txt-shallow-20200712-052933-ad44g.json 343 download   job
urls-archive.max.fan-twitter-@KeralaSDMA-filtered.txt-shallow-20200712-052233-71i9o-00000.warc.gz 216103950 download   job
urls-archive.max.fan-twitter-@KeralaSDMA-filtered.txt-shallow-20200712-052233-71i9o-00000.warc.os.cdx.gz 227417 download
urls-archive.max.fan-twitter-@KeralaSDMA-filtered.txt-shallow-20200712-052233-71i9o-meta.warc.gz 120618 download   job
urls-archive.max.fan-twitter-@KeralaSDMA-filtered.txt-shallow-20200712-052233-71i9o-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@KeralaSDMA-filtered.txt-shallow-20200712-052233-71i9o-urls.txt 67048 download
urls-archive.max.fan-twitter-@KeralaSDMA-filtered.txt-shallow-20200712-052233-71i9o.json 335 download   job
urls-archive.max.fan-twitter-@KerstiKaljulaid-filtered.txt-shallow-20200712-052119-42e01-00000.warc.gz 11444758 download   job
urls-archive.max.fan-twitter-@KerstiKaljulaid-filtered.txt-shallow-20200712-052119-42e01-00000.warc.os.cdx.gz 30137 download
urls-archive.max.fan-twitter-@KerstiKaljulaid-filtered.txt-shallow-20200712-052119-42e01-meta.warc.gz 20301 download   job
urls-archive.max.fan-twitter-@KerstiKaljulaid-filtered.txt-shallow-20200712-052119-42e01-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@KerstiKaljulaid-filtered.txt-shallow-20200712-052119-42e01-urls.txt 2016 download
urls-archive.max.fan-twitter-@KerstiKaljulaid-filtered.txt-shallow-20200712-052119-42e01.json 345 download   job
urls-archive.max.fan-twitter-@KirkSemple-filtered.txt-shallow-20200712-050924-39auk-00000.warc.gz 110190131 download   job
urls-archive.max.fan-twitter-@KirkSemple-filtered.txt-shallow-20200712-050924-39auk-00000.warc.os.cdx.gz 251016 download
urls-archive.max.fan-twitter-@KirkSemple-filtered.txt-shallow-20200712-050924-39auk-meta.warc.gz 137327 download   job
urls-archive.max.fan-twitter-@KirkSemple-filtered.txt-shallow-20200712-050924-39auk-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@KirkSemple-filtered.txt-shallow-20200712-050924-39auk-urls.txt 83665 download
urls-archive.max.fan-twitter-@KirkSemple-filtered.txt-shallow-20200712-050924-39auk.json 335 download   job
urls-archive.max.fan-twitter-@Kirstene4Angus-filtered.txt-shallow-20200712-050855-13qcj-00000.warc.gz 1947214 download   job
urls-archive.max.fan-twitter-@Kirstene4Angus-filtered.txt-shallow-20200712-050855-13qcj-00000.warc.os.cdx.gz 7698 download
urls-archive.max.fan-twitter-@Kirstene4Angus-filtered.txt-shallow-20200712-050855-13qcj-meta.warc.gz 8328 download   job
urls-archive.max.fan-twitter-@Kirstene4Angus-filtered.txt-shallow-20200712-050855-13qcj-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Kirstene4Angus-filtered.txt-shallow-20200712-050855-13qcj-urls.txt 184 download
urls-archive.max.fan-twitter-@Kirstene4Angus-filtered.txt-shallow-20200712-050855-13qcj.json 343 download   job
urls-archive.max.fan-twitter-@KitFrieden-filtered.txt-shallow-20200712-050112-1m8rm-00000.warc.gz 77274898 download   job
urls-archive.max.fan-twitter-@KitFrieden-filtered.txt-shallow-20200712-050112-1m8rm-00000.warc.os.cdx.gz 88309 download
urls-archive.max.fan-twitter-@KitFrieden-filtered.txt-shallow-20200712-050112-1m8rm-meta.warc.gz 51964 download   job
urls-archive.max.fan-twitter-@KitFrieden-filtered.txt-shallow-20200712-050112-1m8rm-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@KitFrieden-filtered.txt-shallow-20200712-050112-1m8rm-urls.txt 57746 download
urls-archive.max.fan-twitter-@KitFrieden-filtered.txt-shallow-20200712-050112-1m8rm.json 335 download   job
urls-archive.max.fan-twitter-@KlausIohannis-filtered.txt-shallow-20200712-045616-36mao-00000.warc.gz 3303566 download   job
urls-archive.max.fan-twitter-@KlausIohannis-filtered.txt-shallow-20200712-045616-36mao-00000.warc.os.cdx.gz 10569 download
urls-archive.max.fan-twitter-@KlausIohannis-filtered.txt-shallow-20200712-045616-36mao-meta.warc.gz 9848 download   job
urls-archive.max.fan-twitter-@KlausIohannis-filtered.txt-shallow-20200712-045616-36mao-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@KlausIohannis-filtered.txt-shallow-20200712-045616-36mao-urls.txt 244 download
urls-archive.max.fan-twitter-@KlausIohannis-filtered.txt-shallow-20200712-045616-36mao.json 341 download   job
urls-archive.max.fan-twitter-@KolindaGK-filtered.txt-shallow-20200712-044827-3v6un-00000.warc.gz 256310610 download   job
urls-archive.max.fan-twitter-@KolindaGK-filtered.txt-shallow-20200712-044827-3v6un-00000.warc.os.cdx.gz 460941 download
urls-archive.max.fan-twitter-@KolindaGK-filtered.txt-shallow-20200712-044827-3v6un-meta.warc.gz 247088 download   job
urls-archive.max.fan-twitter-@KolindaGK-filtered.txt-shallow-20200712-044827-3v6un-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@KolindaGK-filtered.txt-shallow-20200712-044827-3v6un-urls.txt 61107 download
urls-archive.max.fan-twitter-@KolindaGK-filtered.txt-shallow-20200712-044827-3v6un.json 333 download   job
urls-archive.max.fan-twitter-@KwasiKwarteng-filtered.txt-shallow-20200712-044516-e1zq0-00000.warc.gz 8069091 download   job
urls-archive.max.fan-twitter-@KwasiKwarteng-filtered.txt-shallow-20200712-044516-e1zq0-00000.warc.os.cdx.gz 35151 download
urls-archive.max.fan-twitter-@KwasiKwarteng-filtered.txt-shallow-20200712-044516-e1zq0-meta.warc.gz 23246 download   job
urls-archive.max.fan-twitter-@KwasiKwarteng-filtered.txt-shallow-20200712-044516-e1zq0-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@KwasiKwarteng-filtered.txt-shallow-20200712-044516-e1zq0-urls.txt 1952 download
urls-archive.max.fan-twitter-@KwasiKwarteng-filtered.txt-shallow-20200712-044516-e1zq0.json 341 download   job
urls-archive.max.fan-twitter-@KyrgyzMissionUN-filtered.txt-shallow-20200712-044354-q5rmx-00000.warc.gz 67951046 download   job
urls-archive.max.fan-twitter-@KyrgyzMissionUN-filtered.txt-shallow-20200712-044354-q5rmx-00000.warc.os.cdx.gz 81718 download
urls-archive.max.fan-twitter-@KyrgyzMissionUN-filtered.txt-shallow-20200712-044354-q5rmx-meta.warc.gz 47491 download   job
urls-archive.max.fan-twitter-@KyrgyzMissionUN-filtered.txt-shallow-20200712-044354-q5rmx-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@KyrgyzMissionUN-filtered.txt-shallow-20200712-044354-q5rmx-urls.txt 15042 download
urls-archive.max.fan-twitter-@KyrgyzMissionUN-filtered.txt-shallow-20200712-044354-q5rmx.json 345 download   job
urls-archive.max.fan-twitter-@LACOFD-filtered.txt-shallow-20200712-042330-e8tul-00000.warc.gz 550455436 download   job
urls-archive.max.fan-twitter-@LACOFD-filtered.txt-shallow-20200712-042330-e8tul-00000.warc.os.cdx.gz 832846 download
urls-archive.max.fan-twitter-@LACOFD-filtered.txt-shallow-20200712-042330-e8tul-meta.warc.gz 448049 download   job
urls-archive.max.fan-twitter-@LACOFD-filtered.txt-shallow-20200712-042330-e8tul-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LACOFD-filtered.txt-shallow-20200712-042330-e8tul-urls.txt 151417 download
urls-archive.max.fan-twitter-@LACOFD-filtered.txt-shallow-20200712-042330-e8tul.json 327 download   job
urls-archive.max.fan-twitter-@LACity-filtered.txt-shallow-20200712-042448-1yaac-00000.warc.gz 412945941 download   job
urls-archive.max.fan-twitter-@LACity-filtered.txt-shallow-20200712-042448-1yaac-00000.warc.os.cdx.gz 665335 download
urls-archive.max.fan-twitter-@LACity-filtered.txt-shallow-20200712-042448-1yaac-meta.warc.gz 356968 download   job
urls-archive.max.fan-twitter-@LACity-filtered.txt-shallow-20200712-042448-1yaac-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LACity-filtered.txt-shallow-20200712-042448-1yaac-urls.txt 106189 download
urls-archive.max.fan-twitter-@LACity-filtered.txt-shallow-20200712-042448-1yaac.json 327 download   job
urls-archive.max.fan-twitter-@LACoFDPIO-filtered.txt-shallow-20200712-042352-60atk-00000.warc.gz 770697008 download   job
urls-archive.max.fan-twitter-@LACoFDPIO-filtered.txt-shallow-20200712-042352-60atk-00000.warc.os.cdx.gz 1273526 download
urls-archive.max.fan-twitter-@LACoFDPIO-filtered.txt-shallow-20200712-042352-60atk-meta.warc.gz 674117 download   job
urls-archive.max.fan-twitter-@LACoFDPIO-filtered.txt-shallow-20200712-042352-60atk-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LACoFDPIO-filtered.txt-shallow-20200712-042352-60atk-urls.txt 356897 download
urls-archive.max.fan-twitter-@LACoFDPIO-filtered.txt-shallow-20200712-042352-60atk.json 333 download   job
urls-archive.max.fan-twitter-@LKeath-filtered.txt-shallow-20200712-034232-nuq96-urls.txt 51433 download
urls-archive.max.fan-twitter-@LSNJEE-filtered.txt-shallow-20200712-033942-d1ru1-urls.txt 32538 download
urls-archive.max.fan-twitter-@LT_MFA_Stratcom-filtered.txt-shallow-20200712-033227-8yvu0.json 345 download   job
urls-archive.max.fan-twitter-@LUISFELIPE_P-filtered.txt-shallow-20200712-032443-3mbzy-00000.warc.gz 5368735136 download   job
urls-archive.max.fan-twitter-@LUISFELIPE_P-filtered.txt-shallow-20200712-032443-3mbzy-00000.warc.os.cdx.gz 6146610 download
urls-archive.max.fan-twitter-@LUISFELIPE_P-filtered.txt-shallow-20200712-032443-3mbzy-meta.warc.gz 3327100 download   job
urls-archive.max.fan-twitter-@LUISFELIPE_P-filtered.txt-shallow-20200712-032443-3mbzy-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LUISFELIPE_P-filtered.txt-shallow-20200712-032443-3mbzy-urls.txt 1569928 download
urls-archive.max.fan-twitter-@L_MacAulay-filtered.txt-shallow-20200712-034228-2owg9-00000.warc.gz 1331224386 download   job
urls-archive.max.fan-twitter-@L_MacAulay-filtered.txt-shallow-20200712-034228-2owg9-00000.warc.os.cdx.gz 1271435 download
urls-archive.max.fan-twitter-@L_MacAulay-filtered.txt-shallow-20200712-034228-2owg9-meta.warc.gz 679569 download   job
urls-archive.max.fan-twitter-@L_MacAulay-filtered.txt-shallow-20200712-034228-2owg9-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@L_MacAulay-filtered.txt-shallow-20200712-034228-2owg9-urls.txt 365050 download
urls-archive.max.fan-twitter-@L_MacAulay-filtered.txt-shallow-20200712-034228-2owg9.json 335 download   job
urls-archive.max.fan-twitter-@Lacroix_UN-filtered.txt-shallow-20200712-042330-b6p1p-00000.warc.gz 229849913 download   job
urls-archive.max.fan-twitter-@Lacroix_UN-filtered.txt-shallow-20200712-042330-b6p1p-00000.warc.os.cdx.gz 441846 download
urls-archive.max.fan-twitter-@Lacroix_UN-filtered.txt-shallow-20200712-042330-b6p1p-meta.warc.gz 236888 download   job
urls-archive.max.fan-twitter-@Lacroix_UN-filtered.txt-shallow-20200712-042330-b6p1p-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Lacroix_UN-filtered.txt-shallow-20200712-042330-b6p1p-urls.txt 48071 download
urls-archive.max.fan-twitter-@Lacroix_UN-filtered.txt-shallow-20200712-042330-b6p1p.json 335 download   job
urls-archive.max.fan-twitter-@LamYikFei-filtered.txt-shallow-20200712-042137-abjcu-00000.warc.gz 34859424 download   job
urls-archive.max.fan-twitter-@LamYikFei-filtered.txt-shallow-20200712-042137-abjcu-00000.warc.os.cdx.gz 81941 download
urls-archive.max.fan-twitter-@LamYikFei-filtered.txt-shallow-20200712-042137-abjcu-meta.warc.gz 47981 download   job
urls-archive.max.fan-twitter-@LamYikFei-filtered.txt-shallow-20200712-042137-abjcu-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LamYikFei-filtered.txt-shallow-20200712-042137-abjcu-urls.txt 7376 download
urls-archive.max.fan-twitter-@LamYikFei-filtered.txt-shallow-20200712-042137-abjcu.json 333 download   job
urls-archive.max.fan-twitter-@LankaMFA-filtered.txt-shallow-20200712-042110-75jw7-00000.warc.gz 1107371 download   job
urls-archive.max.fan-twitter-@LankaMFA-filtered.txt-shallow-20200712-042110-75jw7-00000.warc.os.cdx.gz 4412 download
urls-archive.max.fan-twitter-@LankaMFA-filtered.txt-shallow-20200712-042110-75jw7-meta.warc.gz 6321 download   job
urls-archive.max.fan-twitter-@LankaMFA-filtered.txt-shallow-20200712-042110-75jw7-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LankaMFA-filtered.txt-shallow-20200712-042110-75jw7-urls.txt 55 download
urls-archive.max.fan-twitter-@LankaMFA-filtered.txt-shallow-20200712-042110-75jw7.json 331 download   job
urls-archive.max.fan-twitter-@LarrySOdell-filtered.txt-shallow-20200712-041919-e5myk-00000.warc.gz 39625234 download   job
urls-archive.max.fan-twitter-@LarrySOdell-filtered.txt-shallow-20200712-041919-e5myk-00000.warc.os.cdx.gz 47852 download
urls-archive.max.fan-twitter-@LarrySOdell-filtered.txt-shallow-20200712-041919-e5myk-meta.warc.gz 30110 download   job
urls-archive.max.fan-twitter-@LarrySOdell-filtered.txt-shallow-20200712-041919-e5myk-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LarrySOdell-filtered.txt-shallow-20200712-041919-e5myk-urls.txt 34737 download
urls-archive.max.fan-twitter-@LarrySOdell-filtered.txt-shallow-20200712-041919-e5myk.json 337 download   job
urls-archive.max.fan-twitter-@LatDesk-filtered.txt-shallow-20200712-041611-ci1d2-00000.warc.gz 328663304 download   job
urls-archive.max.fan-twitter-@LatDesk-filtered.txt-shallow-20200712-041611-ci1d2-00000.warc.os.cdx.gz 370699 download
urls-archive.max.fan-twitter-@LatDesk-filtered.txt-shallow-20200712-041611-ci1d2-meta.warc.gz 201051 download   job
urls-archive.max.fan-twitter-@LatDesk-filtered.txt-shallow-20200712-041611-ci1d2-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LatDesk-filtered.txt-shallow-20200712-041611-ci1d2-urls.txt 131160 download
urls-archive.max.fan-twitter-@LatDesk-filtered.txt-shallow-20200712-041611-ci1d2.json 329 download   job
urls-archive.max.fan-twitter-@LatviaUN_NY-filtered.txt-shallow-20200712-041238-e6jvs-00000.warc.gz 143824397 download   job
urls-archive.max.fan-twitter-@LatviaUN_NY-filtered.txt-shallow-20200712-041238-e6jvs-00000.warc.os.cdx.gz 175851 download
urls-archive.max.fan-twitter-@LatviaUN_NY-filtered.txt-shallow-20200712-041238-e6jvs-meta.warc.gz 97449 download   job
urls-archive.max.fan-twitter-@LatviaUN_NY-filtered.txt-shallow-20200712-041238-e6jvs-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LatviaUN_NY-filtered.txt-shallow-20200712-041238-e6jvs-urls.txt 44851 download
urls-archive.max.fan-twitter-@LatviaUN_NY-filtered.txt-shallow-20200712-041238-e6jvs.json 337 download   job
urls-archive.max.fan-twitter-@LawrenceGostin-filtered.txt-shallow-20200712-041148-a4jqr-00000.warc.gz 320133032 download   job
urls-archive.max.fan-twitter-@LawrenceGostin-filtered.txt-shallow-20200712-041148-a4jqr-00000.warc.os.cdx.gz 595690 download
urls-archive.max.fan-twitter-@LawrenceGostin-filtered.txt-shallow-20200712-041148-a4jqr-meta.warc.gz 321467 download   job
urls-archive.max.fan-twitter-@LawrenceGostin-filtered.txt-shallow-20200712-041148-a4jqr-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LawrenceGostin-filtered.txt-shallow-20200712-041148-a4jqr-urls.txt 171924 download
urls-archive.max.fan-twitter-@LawrenceGostin-filtered.txt-shallow-20200712-041148-a4jqr.json 343 download   job
urls-archive.max.fan-twitter-@LegalAidAgency-filtered.txt-shallow-20200712-040742-7hok4-00000.warc.gz 5733660 download   job
urls-archive.max.fan-twitter-@LegalAidAgency-filtered.txt-shallow-20200712-040742-7hok4-00000.warc.os.cdx.gz 13384 download
urls-archive.max.fan-twitter-@LegalAidAgency-filtered.txt-shallow-20200712-040742-7hok4-meta.warc.gz 11622 download   job
urls-archive.max.fan-twitter-@LegalAidAgency-filtered.txt-shallow-20200712-040742-7hok4-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LegalAidAgency-filtered.txt-shallow-20200712-040742-7hok4-urls.txt 2170 download
urls-archive.max.fan-twitter-@LegalAidAgency-filtered.txt-shallow-20200712-040742-7hok4.json 343 download   job
urls-archive.max.fan-twitter-@LegalAidNYC-filtered.txt-shallow-20200712-040741-6sxpo-00000.warc.gz 718921441 download   job
urls-archive.max.fan-twitter-@LegalAidNYC-filtered.txt-shallow-20200712-040741-6sxpo-00000.warc.os.cdx.gz 1220584 download
urls-archive.max.fan-twitter-@LegalAidNYC-filtered.txt-shallow-20200712-040741-6sxpo-meta.warc.gz 639008 download   job
urls-archive.max.fan-twitter-@LegalAidNYC-filtered.txt-shallow-20200712-040741-6sxpo-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LegalAidNYC-filtered.txt-shallow-20200712-040741-6sxpo-urls.txt 365419 download
urls-archive.max.fan-twitter-@LegalAidNYC-filtered.txt-shallow-20200712-040741-6sxpo.json 337 download   job
urls-archive.max.fan-twitter-@LegalAidOntario-filtered.txt-shallow-20200712-040715-8d4ym-00000.warc.gz 1284553 download   job
urls-archive.max.fan-twitter-@LegalAidOntario-filtered.txt-shallow-20200712-040715-8d4ym-00000.warc.os.cdx.gz 6357 download
urls-archive.max.fan-twitter-@LegalAidOntario-filtered.txt-shallow-20200712-040715-8d4ym-meta.warc.gz 7562 download   job
urls-archive.max.fan-twitter-@LegalAidOntario-filtered.txt-shallow-20200712-040715-8d4ym-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LegalAidOntario-filtered.txt-shallow-20200712-040715-8d4ym-urls.txt 252 download
urls-archive.max.fan-twitter-@LegalAidOntario-filtered.txt-shallow-20200712-040715-8d4ym.json 345 download   job
urls-archive.max.fan-twitter-@LegalAidQld-filtered.txt-shallow-20200712-040648-2mp8e-00000.warc.gz 2026519 download   job
urls-archive.max.fan-twitter-@LegalAidQld-filtered.txt-shallow-20200712-040648-2mp8e-00000.warc.os.cdx.gz 5145 download
urls-archive.max.fan-twitter-@LegalAidQld-filtered.txt-shallow-20200712-040648-2mp8e-meta.warc.gz 6744 download   job
urls-archive.max.fan-twitter-@LegalAidQld-filtered.txt-shallow-20200712-040648-2mp8e-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LegalAidQld-filtered.txt-shallow-20200712-040648-2mp8e-urls.txt 295 download
urls-archive.max.fan-twitter-@LegalAidQld-filtered.txt-shallow-20200712-040648-2mp8e.json 337 download   job
urls-archive.max.fan-twitter-@LegalAidSoCal-filtered.txt-shallow-20200712-040621-lo9gn-00000.warc.gz 1569232 download   job
urls-archive.max.fan-twitter-@LegalAidSoCal-filtered.txt-shallow-20200712-040621-lo9gn-00000.warc.os.cdx.gz 4965 download
urls-archive.max.fan-twitter-@LegalAidSoCal-filtered.txt-shallow-20200712-040621-lo9gn-meta.warc.gz 6626 download   job
urls-archive.max.fan-twitter-@LegalAidSoCal-filtered.txt-shallow-20200712-040621-lo9gn-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LegalAidSoCal-filtered.txt-shallow-20200712-040621-lo9gn-urls.txt 366 download
urls-archive.max.fan-twitter-@LegalAidSoCal-filtered.txt-shallow-20200712-040621-lo9gn.json 341 download   job
urls-archive.max.fan-twitter-@LegalAid_Rwanda-filtered.txt-shallow-20200712-040649-du7ko-00000.warc.gz 1578966 download   job
urls-archive.max.fan-twitter-@LegalAid_Rwanda-filtered.txt-shallow-20200712-040649-du7ko-00000.warc.os.cdx.gz 5660 download
urls-archive.max.fan-twitter-@LegalAid_Rwanda-filtered.txt-shallow-20200712-040649-du7ko-meta.warc.gz 7041 download   job
urls-archive.max.fan-twitter-@LegalAid_Rwanda-filtered.txt-shallow-20200712-040649-du7ko-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LegalAid_Rwanda-filtered.txt-shallow-20200712-040649-du7ko-urls.txt 189 download
urls-archive.max.fan-twitter-@LegalAid_Rwanda-filtered.txt-shallow-20200712-040649-du7ko.json 345 download   job
urls-archive.max.fan-twitter-@LenerRenauld-filtered.txt-shallow-20200712-040306-eehug-00000.warc.gz 29755559 download   job
urls-archive.max.fan-twitter-@LenerRenauld-filtered.txt-shallow-20200712-040306-eehug-00000.warc.os.cdx.gz 57651 download
urls-archive.max.fan-twitter-@LenerRenauld-filtered.txt-shallow-20200712-040306-eehug-meta.warc.gz 35378 download   job
urls-archive.max.fan-twitter-@LenerRenauld-filtered.txt-shallow-20200712-040306-eehug-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LenerRenauld-filtered.txt-shallow-20200712-040306-eehug-urls.txt 14714 download
urls-archive.max.fan-twitter-@LenerRenauld-filtered.txt-shallow-20200712-040306-eehug.json 339 download   job
urls-archive.max.fan-twitter-@LewisClarkTrail-filtered.txt-shallow-20200712-040214-4r4nn-00000.warc.gz 139094415 download   job
urls-archive.max.fan-twitter-@LewisClarkTrail-filtered.txt-shallow-20200712-040214-4r4nn-00000.warc.os.cdx.gz 160807 download
urls-archive.max.fan-twitter-@LewisClarkTrail-filtered.txt-shallow-20200712-040214-4r4nn-meta.warc.gz 90094 download   job
urls-archive.max.fan-twitter-@LewisClarkTrail-filtered.txt-shallow-20200712-040214-4r4nn-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LewisClarkTrail-filtered.txt-shallow-20200712-040214-4r4nn-urls.txt 63489 download
urls-archive.max.fan-twitter-@LewisClarkTrail-filtered.txt-shallow-20200712-040214-4r4nn.json 345 download   job
urls-archive.max.fan-twitter-@LexieGRieg-filtered.txt-shallow-20200712-040213-wry8m-00000.warc.gz 49664438 download   job
urls-archive.max.fan-twitter-@LexieGRieg-filtered.txt-shallow-20200712-040213-wry8m-00000.warc.os.cdx.gz 68057 download
urls-archive.max.fan-twitter-@LexieGRieg-filtered.txt-shallow-20200712-040213-wry8m-meta.warc.gz 40991 download   job
urls-archive.max.fan-twitter-@LexieGRieg-filtered.txt-shallow-20200712-040213-wry8m-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LexieGRieg-filtered.txt-shallow-20200712-040213-wry8m-urls.txt 40306 download
urls-archive.max.fan-twitter-@LexieGRieg-filtered.txt-shallow-20200712-040213-wry8m.json 335 download   job
urls-archive.max.fan-twitter-@LiYuan6-filtered.txt-shallow-20200712-034418-8iqvr-00000.warc.gz 292628773 download   job
urls-archive.max.fan-twitter-@LiYuan6-filtered.txt-shallow-20200712-034418-8iqvr-00000.warc.os.cdx.gz 748082 download
urls-archive.max.fan-twitter-@LiYuan6-filtered.txt-shallow-20200712-034418-8iqvr-meta.warc.gz 397934 download   job
urls-archive.max.fan-twitter-@LiYuan6-filtered.txt-shallow-20200712-034418-8iqvr-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LiYuan6-filtered.txt-shallow-20200712-034418-8iqvr-urls.txt 153748 download
urls-archive.max.fan-twitter-@LiYuan6-filtered.txt-shallow-20200712-034418-8iqvr.json 329 download   job
urls-archive.max.fan-twitter-@LibDems-filtered.txt-shallow-20200712-040053-2f66p-urls.txt 1317182 download
urls-archive.max.fan-twitter-@LibDems-filtered.txt-shallow-20200712-040053-2f66p.json 329 download   job
urls-archive.max.fan-twitter-@LimaCop20-filtered.txt-shallow-20200712-035346-16r3e-00000.warc.gz 791918479 download   job
urls-archive.max.fan-twitter-@LimaCop20-filtered.txt-shallow-20200712-035346-16r3e-00000.warc.os.cdx.gz 1192079 download
urls-archive.max.fan-twitter-@LimaCop20-filtered.txt-shallow-20200712-035346-16r3e-meta.warc.gz 632596 download   job
urls-archive.max.fan-twitter-@LimaCop20-filtered.txt-shallow-20200712-035346-16r3e-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LimaCop20-filtered.txt-shallow-20200712-035346-16r3e-urls.txt 383040 download
urls-archive.max.fan-twitter-@LimaCop20-filtered.txt-shallow-20200712-035346-16r3e.json 333 download   job
urls-archive.max.fan-twitter-@LindseyGrahamSC-filtered.txt-shallow-20200712-035346-bn7fu-00000.warc.gz 181012746 download   job
urls-archive.max.fan-twitter-@LindseyGrahamSC-filtered.txt-shallow-20200712-035346-bn7fu-00000.warc.os.cdx.gz 484432 download
urls-archive.max.fan-twitter-@LindseyGrahamSC-filtered.txt-shallow-20200712-035346-bn7fu-meta.warc.gz 257332 download   job
urls-archive.max.fan-twitter-@LindseyGrahamSC-filtered.txt-shallow-20200712-035346-bn7fu-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LindseyGrahamSC-filtered.txt-shallow-20200712-035346-bn7fu-urls.txt 31248 download
urls-archive.max.fan-twitter-@LindseyGrahamSC-filtered.txt-shallow-20200712-035346-bn7fu.json 345 download   job
urls-archive.max.fan-twitter-@LindseyTanner-filtered.txt-shallow-20200712-035342-7xxkl-00000.warc.gz 277355806 download   job
urls-archive.max.fan-twitter-@LindseyTanner-filtered.txt-shallow-20200712-035342-7xxkl-00000.warc.os.cdx.gz 323679 download
urls-archive.max.fan-twitter-@LindseyTanner-filtered.txt-shallow-20200712-035342-7xxkl-meta.warc.gz 173830 download   job
urls-archive.max.fan-twitter-@LindseyTanner-filtered.txt-shallow-20200712-035342-7xxkl-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LindseyTanner-filtered.txt-shallow-20200712-035342-7xxkl-urls.txt 256477 download
urls-archive.max.fan-twitter-@LindseyTanner-filtered.txt-shallow-20200712-035342-7xxkl.json 341 download   job
urls-archive.max.fan-twitter-@Lise_Kingo-filtered.txt-shallow-20200712-035104-3mxuq-00000.warc.gz 118847922 download   job
urls-archive.max.fan-twitter-@Lise_Kingo-filtered.txt-shallow-20200712-035104-3mxuq-00000.warc.os.cdx.gz 226974 download
urls-archive.max.fan-twitter-@Lise_Kingo-filtered.txt-shallow-20200712-035104-3mxuq-meta.warc.gz 124960 download   job
urls-archive.max.fan-twitter-@Lise_Kingo-filtered.txt-shallow-20200712-035104-3mxuq-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Lise_Kingo-filtered.txt-shallow-20200712-035104-3mxuq-urls.txt 32443 download
urls-archive.max.fan-twitter-@Lise_Kingo-filtered.txt-shallow-20200712-035104-3mxuq.json 335 download   job
urls-archive.max.fan-twitter-@Lithuania-filtered.txt-shallow-20200712-034913-ajd9s-00000.warc.gz 338171852 download   job
urls-archive.max.fan-twitter-@Lithuania-filtered.txt-shallow-20200712-034913-ajd9s-00000.warc.os.cdx.gz 526646 download
urls-archive.max.fan-twitter-@Lithuania-filtered.txt-shallow-20200712-034913-ajd9s-meta.warc.gz 278070 download   job
urls-archive.max.fan-twitter-@Lithuania-filtered.txt-shallow-20200712-034913-ajd9s-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@Lithuania-filtered.txt-shallow-20200712-034913-ajd9s-urls.txt 96382 download
urls-archive.max.fan-twitter-@Lithuania-filtered.txt-shallow-20200712-034913-ajd9s.json 333 download   job
urls-archive.max.fan-twitter-@LizetteScribe-filtered.txt-shallow-20200712-034418-dxw2o-00000.warc.gz 154957118 download   job
urls-archive.max.fan-twitter-@LizetteScribe-filtered.txt-shallow-20200712-034418-dxw2o-00000.warc.os.cdx.gz 333295 download
urls-archive.max.fan-twitter-@LizetteScribe-filtered.txt-shallow-20200712-034418-dxw2o-meta.warc.gz 181779 download   job
urls-archive.max.fan-twitter-@LizetteScribe-filtered.txt-shallow-20200712-034418-dxw2o-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LizetteScribe-filtered.txt-shallow-20200712-034418-dxw2o-urls.txt 119946 download
urls-archive.max.fan-twitter-@LizetteScribe-filtered.txt-shallow-20200712-034418-dxw2o.json 341 download   job
urls-archive.max.fan-twitter-@LondonBreed-filtered.txt-shallow-20200712-033945-63ur4-00000.warc.gz 935528677 download   job
urls-archive.max.fan-twitter-@LondonBreed-filtered.txt-shallow-20200712-033945-63ur4-00000.warc.os.cdx.gz 1390566 download
urls-archive.max.fan-twitter-@LondonBreed-filtered.txt-shallow-20200712-033945-63ur4-meta.warc.gz 739469 download   job
urls-archive.max.fan-twitter-@LondonBreed-filtered.txt-shallow-20200712-033945-63ur4-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LondonBreed-filtered.txt-shallow-20200712-033945-63ur4-urls.txt 220062 download
urls-archive.max.fan-twitter-@LondonBreed-filtered.txt-shallow-20200712-033945-63ur4.json 337 download   job
urls-archive.max.fan-twitter-@LuisaAlcalde-filtered.txt-shallow-20200712-033219-6vrtz-00000.warc.gz 378345809 download   job
urls-archive.max.fan-twitter-@LuisaAlcalde-filtered.txt-shallow-20200712-033219-6vrtz-00000.warc.os.cdx.gz 1189270 download
urls-archive.max.fan-twitter-@LuisaAlcalde-filtered.txt-shallow-20200712-033219-6vrtz-meta.warc.gz 635743 download   job
urls-archive.max.fan-twitter-@LuisaAlcalde-filtered.txt-shallow-20200712-033219-6vrtz-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@LuisaAlcalde-filtered.txt-shallow-20200712-033219-6vrtz-urls.txt 193333 download
urls-archive.max.fan-twitter-@LuisaAlcalde-filtered.txt-shallow-20200712-033219-6vrtz.json 339 download   job
urls-archive.max.fan-twitter-@LukeHall-filtered.txt-shallow-20200712-032441-e0j0l-00000.warc.gz 19258714 download   job
urls-archive.max.fan-twitter-@LukeHall-filtered.txt-shallow-20200712-032441-e0j0l-00000.warc.os.cdx.gz 53522 download
urls-archive.max.fan-twitter-@LutzHen-filtered.txt-shallow-20200712-032413-aw2ri-00000.warc.gz 2143331 download   job
urls-archive.max.fan-twitter-@LutzHen-filtered.txt-shallow-20200712-032413-aw2ri-00000.warc.os.cdx.gz 5579 download
urls-archive.max.fan-twitter-@MAECHaiti-filtered.txt-shallow-20200712-032140-8do53-00000.warc.gz 23111651 download   job
urls-archive.max.fan-twitter-@MAECHaiti-filtered.txt-shallow-20200712-032140-8do53-00000.warc.os.cdx.gz 33584 download
urls-archive.max.fan-twitter-@MaggieAstor-filtered.txt-shallow-20200712-031455-93r9k-00000.warc.gz 168289743 download   job
urls-archive.max.fan-twitter-@MaggieAstor-filtered.txt-shallow-20200712-031455-93r9k-00000.warc.os.cdx.gz 416483 download
urls-archive.max.fan-twitter-@MaimunahSharif-filtered.txt-shallow-20200712-031334-9t53n-urls.txt 171531 download
urls-archive.max.fan-twitter-@Malala-filtered.txt-shallow-20200712-031333-czuuk-00000.warc.gz 128137603 download   job
urls-archive.max.fan-twitter-@Malala-filtered.txt-shallow-20200712-031333-czuuk-00000.warc.os.cdx.gz 441407 download
urls-archive.max.fan-twitter-@MalcolmRitter-filtered.txt-shallow-20200712-030115-7d2kj.json 341 download   job
urls-archive.max.fan-twitter-@MangalaLK-filtered.txt-shallow-20200712-030112-53duq-urls.txt 83222 download
urls-archive.max.fan-twitter-@MarieLouise_MT-filtered.txt-shallow-20200712-025359-6j8gs-00000.warc.gz 232395648 download   job
urls-archive.max.fan-twitter-@MarieLouise_MT-filtered.txt-shallow-20200712-025359-6j8gs-00000.warc.os.cdx.gz 331610 download
urls-archive.max.fan-twitter-@Masood__Khan-filtered.txt-shallow-20200712-022558-7cdgw.json 339 download   job
urls-archive.max.fan-twitter-@NWSTampaBay-filtered.txt-shallow-20200711-194621-d8zz5-urls.txt 2359344 download
urls-archive.max.fan-twitter-@jongambrellAP-filtered.txt-shallow-20200712-064223-4fjbr-00000.warc.gz 4824709 download   job
urls-archive.max.fan-twitter-@jongambrellAP-filtered.txt-shallow-20200712-064223-4fjbr-00000.warc.os.cdx.gz 18018 download
urls-archive.max.fan-twitter-@jongambrellAP-filtered.txt-shallow-20200712-064223-4fjbr-meta.warc.gz 14243 download   job
urls-archive.max.fan-twitter-@jongambrellAP-filtered.txt-shallow-20200712-064223-4fjbr-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@jongambrellAP-filtered.txt-shallow-20200712-064223-4fjbr-urls.txt 3899 download
urls-archive.max.fan-twitter-@jongambrellAP-filtered.txt-shallow-20200712-064223-4fjbr.json 341 download   job
urls-archive.max.fan-twitter-@joshuawongcf-filtered.txt-shallow-20200712-062256-7pw0i-00000.warc.gz 2816688 download   job
urls-archive.max.fan-twitter-@joshuawongcf-filtered.txt-shallow-20200712-062256-7pw0i-00000.warc.os.cdx.gz 14199 download
urls-archive.max.fan-twitter-@joshuawongcf-filtered.txt-shallow-20200712-062256-7pw0i-urls.txt 600 download
urls-archive.max.fan-twitter-@joshuawongcf-filtered.txt-shallow-20200712-062256-7pw0i.json 339 download   job
urls-archive.max.fan-twitter-@jotted-filtered.txt-shallow-20200712-062252-ad3xu-00000.warc.gz 396163755 download   job
urls-archive.max.fan-twitter-@jotted-filtered.txt-shallow-20200712-062252-ad3xu-00000.warc.os.cdx.gz 773714 download
urls-archive.max.fan-twitter-@jotted-filtered.txt-shallow-20200712-062252-ad3xu.json 327 download   job
urls-archive.max.fan-twitter-@joyindc-filtered.txt-shallow-20200712-062012-btinv-00000.warc.gz 22796329 download   job
urls-archive.max.fan-twitter-@joyindc-filtered.txt-shallow-20200712-062012-btinv-00000.warc.os.cdx.gz 52950 download
urls-archive.max.fan-twitter-@joyindc-filtered.txt-shallow-20200712-062012-btinv.json 329 download   job
urls-archive.max.fan-twitter-@juliehdavis-filtered.txt-shallow-20200712-061058-c1eoz-00000.warc.gz 666933184 download   job
urls-archive.max.fan-twitter-@juliehdavis-filtered.txt-shallow-20200712-061058-c1eoz-00000.warc.os.cdx.gz 2495116 download
urls-archive.max.fan-twitter-@juliehdavis-filtered.txt-shallow-20200712-061058-c1eoz.json 337 download   job
urls-archive.max.fan-twitter-@julieturkewitz-filtered.txt-shallow-20200712-060503-2snoq-00000.warc.gz 418442521 download   job
urls-archive.max.fan-twitter-@julieturkewitz-filtered.txt-shallow-20200712-060503-2snoq-00000.warc.os.cdx.gz 966884 download
urls-archive.max.fan-twitter-@jwf825-filtered.txt-shallow-20200712-060501-e4m7c-urls.txt 13458 download
urls-archive.max.fan-twitter-@jwnyt-filtered.txt-shallow-20200712-060433-c2ohe-meta.warc.gz 7563 download   job
urls-archive.max.fan-twitter-@jwnyt-filtered.txt-shallow-20200712-060433-c2ohe-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@jwnyt-filtered.txt-shallow-20200712-060433-c2ohe-urls.txt 884 download
urls-archive.max.fan-twitter-@jwnyt-filtered.txt-shallow-20200712-060433-c2ohe.json 325 download   job
urls-archive.max.fan-twitter-@jwpetersNYT-filtered.txt-shallow-20200712-055646-5en64-meta.warc.gz 742937 download   job
urls-archive.max.fan-twitter-@jwpetersNYT-filtered.txt-shallow-20200712-055646-5en64-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@jyduclos-filtered.txt-shallow-20200712-055306-dfnpz-00000.warc.gz 1465676868 download   job
urls-archive.max.fan-twitter-@jyduclos-filtered.txt-shallow-20200712-055306-dfnpz-00000.warc.os.cdx.gz 1349306 download
urls-archive.max.fan-twitter-@jyduclos-filtered.txt-shallow-20200712-055306-dfnpz-meta.warc.gz 721415 download   job
urls-archive.max.fan-twitter-@jyduclos-filtered.txt-shallow-20200712-055306-dfnpz-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@jyduclos-filtered.txt-shallow-20200712-055306-dfnpz.json 331 download   job
urls-archive.max.fan-twitter-@kantele10-filtered.txt-shallow-20200712-054043-1uroe-00000.warc.gz 93538578 download   job
urls-archive.max.fan-twitter-@kantele10-filtered.txt-shallow-20200712-054043-1uroe-00000.warc.os.cdx.gz 115703 download
urls-archive.max.fan-twitter-@kantele10-filtered.txt-shallow-20200712-054043-1uroe-meta.warc.gz 67106 download   job
urls-archive.max.fan-twitter-@kantele10-filtered.txt-shallow-20200712-054043-1uroe-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@kantele10-filtered.txt-shallow-20200712-054043-1uroe-urls.txt 74898 download
urls-archive.max.fan-twitter-@kantele10-filtered.txt-shallow-20200712-054043-1uroe.json 333 download   job
urls-archive.max.fan-twitter-@kateconger-filtered.txt-shallow-20200712-053926-6nohb-00000.warc.gz 158955415 download   job
urls-archive.max.fan-twitter-@kateconger-filtered.txt-shallow-20200712-053926-6nohb-00000.warc.os.cdx.gz 395507 download
urls-archive.max.fan-twitter-@kateconger-filtered.txt-shallow-20200712-053926-6nohb-meta.warc.gz 214498 download   job
urls-archive.max.fan-twitter-@kateconger-filtered.txt-shallow-20200712-053926-6nohb-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@kateconger-filtered.txt-shallow-20200712-053926-6nohb-urls.txt 68728 download
urls-archive.max.fan-twitter-@katieglueck-filtered.txt-shallow-20200712-053526-7xotk-00000.warc.gz 1027859693 download   job
urls-archive.max.fan-twitter-@katieglueck-filtered.txt-shallow-20200712-053526-7xotk-00000.warc.os.cdx.gz 2067419 download
urls-archive.max.fan-twitter-@katieglueck-filtered.txt-shallow-20200712-053526-7xotk-meta.warc.gz 1088890 download   job
urls-archive.max.fan-twitter-@katieglueck-filtered.txt-shallow-20200712-053526-7xotk-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@katieglueck-filtered.txt-shallow-20200712-053526-7xotk-urls.txt 760975 download
urls-archive.max.fan-twitter-@kayiveyforgov-filtered.txt-shallow-20200712-053414-czobk-00000.warc.gz 59288529 download   job
urls-archive.max.fan-twitter-@kayiveyforgov-filtered.txt-shallow-20200712-053414-czobk-00000.warc.os.cdx.gz 101250 download
urls-archive.max.fan-twitter-@kayiveyforgov-filtered.txt-shallow-20200712-053414-czobk-meta.warc.gz 58670 download   job
urls-archive.max.fan-twitter-@kayiveyforgov-filtered.txt-shallow-20200712-053414-czobk-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@kayiveyforgov-filtered.txt-shallow-20200712-053414-czobk-urls.txt 12772 download
urls-archive.max.fan-twitter-@kayiveyforgov-filtered.txt-shallow-20200712-053414-czobk.json 341 download   job
urls-archive.max.fan-twitter-@kchangnyt-filtered.txt-shallow-20200712-053034-b545v-00000.warc.gz 271298817 download   job
urls-archive.max.fan-twitter-@kchangnyt-filtered.txt-shallow-20200712-053034-b545v-00000.warc.os.cdx.gz 598633 download
urls-archive.max.fan-twitter-@kchangnyt-filtered.txt-shallow-20200712-053034-b545v-meta.warc.gz 321135 download   job
urls-archive.max.fan-twitter-@kchangnyt-filtered.txt-shallow-20200712-053034-b545v-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@kchangnyt-filtered.txt-shallow-20200712-053034-b545v-urls.txt 169234 download
urls-archive.max.fan-twitter-@kchangnyt-filtered.txt-shallow-20200712-053034-b545v.json 333 download   job
urls-archive.max.fan-twitter-@keithporcaro-filtered.txt-shallow-20200712-053033-brgdf-00000.warc.gz 22818332 download   job
urls-archive.max.fan-twitter-@keithporcaro-filtered.txt-shallow-20200712-053033-brgdf-00000.warc.os.cdx.gz 38245 download
urls-archive.max.fan-twitter-@keithporcaro-filtered.txt-shallow-20200712-053033-brgdf-meta.warc.gz 25591 download   job
urls-archive.max.fan-twitter-@keithporcaro-filtered.txt-shallow-20200712-053033-brgdf-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@keithporcaro-filtered.txt-shallow-20200712-053033-brgdf-urls.txt 17909 download
urls-archive.max.fan-twitter-@keithporcaro-filtered.txt-shallow-20200712-053033-brgdf.json 339 download   job
urls-archive.max.fan-twitter-@kellydaschle-filtered.txt-shallow-20200712-053030-cbmwv-00000.warc.gz 28121380 download   job
urls-archive.max.fan-twitter-@kellydaschle-filtered.txt-shallow-20200712-053030-cbmwv-00000.warc.os.cdx.gz 46081 download
urls-archive.max.fan-twitter-@kellydaschle-filtered.txt-shallow-20200712-053030-cbmwv-meta.warc.gz 29433 download   job
urls-archive.max.fan-twitter-@kellydaschle-filtered.txt-shallow-20200712-053030-cbmwv-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@kellydaschle-filtered.txt-shallow-20200712-053030-cbmwv-urls.txt 19850 download
urls-archive.max.fan-twitter-@kellydaschle-filtered.txt-shallow-20200712-053030-cbmwv.json 339 download   job
urls-archive.max.fan-twitter-@ketto-filtered.txt-shallow-20200712-052052-bnl2s-meta.warc.gz 1411330 download   job
urls-archive.max.fan-twitter-@ketto-filtered.txt-shallow-20200712-052052-bnl2s-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ketto-filtered.txt-shallow-20200712-052052-bnl2s-urls.txt 987612 download
urls-archive.max.fan-twitter-@ketto-filtered.txt-shallow-20200712-052052-bnl2s.json 325 download   job
urls-archive.max.fan-twitter-@kguggenheim-filtered.txt-shallow-20200712-050933-f1tk4-00000.warc.gz 101079426 download   job
urls-archive.max.fan-twitter-@kguggenheim-filtered.txt-shallow-20200712-050933-f1tk4-00000.warc.os.cdx.gz 178120 download
urls-archive.max.fan-twitter-@kguggenheim-filtered.txt-shallow-20200712-050933-f1tk4-meta.warc.gz 99019 download   job
urls-archive.max.fan-twitter-@kguggenheim-filtered.txt-shallow-20200712-050933-f1tk4-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@kguggenheim-filtered.txt-shallow-20200712-050933-f1tk4-urls.txt 90868 download
urls-archive.max.fan-twitter-@kguggenheim-filtered.txt-shallow-20200712-050933-f1tk4.json 337 download   job
urls-archive.max.fan-twitter-@khaledhosseini-filtered.txt-shallow-20200712-050932-5xay3-00000.warc.gz 78877877 download   job
urls-archive.max.fan-twitter-@khaledhosseini-filtered.txt-shallow-20200712-050932-5xay3-00000.warc.os.cdx.gz 277568 download
urls-archive.max.fan-twitter-@khaledhosseini-filtered.txt-shallow-20200712-050932-5xay3-meta.warc.gz 148635 download   job
urls-archive.max.fan-twitter-@khaledhosseini-filtered.txt-shallow-20200712-050932-5xay3-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@khaledhosseini-filtered.txt-shallow-20200712-050932-5xay3-urls.txt 30030 download
urls-archive.max.fan-twitter-@khaledhosseini-filtered.txt-shallow-20200712-050932-5xay3.json 343 download   job
urls-archive.max.fan-twitter-@khalil_hamra-filtered.txt-shallow-20200712-050929-2630r-00000.warc.gz 31307913 download   job
urls-archive.max.fan-twitter-@khalil_hamra-filtered.txt-shallow-20200712-050929-2630r-00000.warc.os.cdx.gz 44862 download
urls-archive.max.fan-twitter-@khalil_hamra-filtered.txt-shallow-20200712-050929-2630r-meta.warc.gz 28732 download   job
urls-archive.max.fan-twitter-@khalil_hamra-filtered.txt-shallow-20200712-050929-2630r-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@khalil_hamra-filtered.txt-shallow-20200712-050929-2630r-urls.txt 22210 download
urls-archive.max.fan-twitter-@khalil_hamra-filtered.txt-shallow-20200712-050929-2630r.json 339 download   job
urls-archive.max.fan-twitter-@kimgamel-filtered.txt-shallow-20200712-050926-eoab4-00000.warc.gz 197832698 download   job
urls-archive.max.fan-twitter-@kimgamel-filtered.txt-shallow-20200712-050926-eoab4-00000.warc.os.cdx.gz 387155 download
urls-archive.max.fan-twitter-@kimgamel-filtered.txt-shallow-20200712-050926-eoab4-meta.warc.gz 206976 download   job
urls-archive.max.fan-twitter-@kimgamel-filtered.txt-shallow-20200712-050926-eoab4-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@kimgamel-filtered.txt-shallow-20200712-050926-eoab4-urls.txt 123058 download
urls-archive.max.fan-twitter-@kimgamel-filtered.txt-shallow-20200712-050926-eoab4.json 331 download   job
urls-archive.max.fan-twitter-@kkusmer-filtered.txt-shallow-20200712-045715-93su0-00000.warc.gz 30812585 download   job
urls-archive.max.fan-twitter-@kkusmer-filtered.txt-shallow-20200712-045715-93su0-00000.warc.os.cdx.gz 33178 download
urls-archive.max.fan-twitter-@kkusmer-filtered.txt-shallow-20200712-045715-93su0-meta.warc.gz 22086 download   job
urls-archive.max.fan-twitter-@kkusmer-filtered.txt-shallow-20200712-045715-93su0-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@kkusmer-filtered.txt-shallow-20200712-045715-93su0-urls.txt 25475 download
urls-archive.max.fan-twitter-@kkusmer-filtered.txt-shallow-20200712-045715-93su0.json 329 download   job
urls-archive.max.fan-twitter-@kmatheson-filtered.txt-shallow-20200712-044925-a3a2c-00000.warc.gz 459283518 download   job
urls-archive.max.fan-twitter-@kmatheson-filtered.txt-shallow-20200712-044925-a3a2c-00000.warc.os.cdx.gz 499783 download
urls-archive.max.fan-twitter-@kmatheson-filtered.txt-shallow-20200712-044925-a3a2c-meta.warc.gz 265482 download   job
urls-archive.max.fan-twitter-@kmatheson-filtered.txt-shallow-20200712-044925-a3a2c-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@kmatheson-filtered.txt-shallow-20200712-044925-a3a2c-urls.txt 298785 download
urls-archive.max.fan-twitter-@kmatheson-filtered.txt-shallow-20200712-044925-a3a2c.json 333 download   job
urls-archive.max.fan-twitter-@kronayne-filtered.txt-shallow-20200712-044707-8hr83-00000.warc.gz 1011661984 download   job
urls-archive.max.fan-twitter-@kronayne-filtered.txt-shallow-20200712-044707-8hr83-00000.warc.os.cdx.gz 1457789 download
urls-archive.max.fan-twitter-@kronayne-filtered.txt-shallow-20200712-044707-8hr83-meta.warc.gz 774312 download   job
urls-archive.max.fan-twitter-@kronayne-filtered.txt-shallow-20200712-044707-8hr83-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@kronayne-filtered.txt-shallow-20200712-044707-8hr83-urls.txt 804058 download
urls-archive.max.fan-twitter-@kronayne-filtered.txt-shallow-20200712-044707-8hr83.json 331 download   job
urls-archive.max.fan-twitter-@ksamissionun-filtered.txt-shallow-20200712-044517-2hjpv-00000.warc.gz 620805227 download   job
urls-archive.max.fan-twitter-@ksamissionun-filtered.txt-shallow-20200712-044517-2hjpv-00000.warc.os.cdx.gz 894682 download
urls-archive.max.fan-twitter-@ksamissionun-filtered.txt-shallow-20200712-044517-2hjpv-meta.warc.gz 467002 download   job
urls-archive.max.fan-twitter-@ksamissionun-filtered.txt-shallow-20200712-044517-2hjpv-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ksamissionun-filtered.txt-shallow-20200712-044517-2hjpv-urls.txt 179603 download
urls-archive.max.fan-twitter-@ksamissionun-filtered.txt-shallow-20200712-044517-2hjpv.json 339 download   job
urls-archive.max.fan-twitter-@kzernike-filtered.txt-shallow-20200712-042652-6q7ks-00000.warc.gz 167160504 download   job
urls-archive.max.fan-twitter-@kzernike-filtered.txt-shallow-20200712-042652-6q7ks-00000.warc.os.cdx.gz 361629 download
urls-archive.max.fan-twitter-@kzernike-filtered.txt-shallow-20200712-042652-6q7ks-meta.warc.gz 197658 download   job
urls-archive.max.fan-twitter-@kzernike-filtered.txt-shallow-20200712-042652-6q7ks-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@kzernike-filtered.txt-shallow-20200712-042652-6q7ks-urls.txt 125735 download
urls-archive.max.fan-twitter-@kzernike-filtered.txt-shallow-20200712-042652-6q7ks.json 331 download   job
urls-archive.max.fan-twitter-@laurabronner-filtered.txt-shallow-20200712-041236-ec70n-00000.warc.gz 87788957 download   job
urls-archive.max.fan-twitter-@laurabronner-filtered.txt-shallow-20200712-041236-ec70n-00000.warc.os.cdx.gz 214949 download
urls-archive.max.fan-twitter-@laurabronner-filtered.txt-shallow-20200712-041236-ec70n-meta.warc.gz 118624 download   job
urls-archive.max.fan-twitter-@laurabronner-filtered.txt-shallow-20200712-041236-ec70n-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@laurabronner-filtered.txt-shallow-20200712-041236-ec70n-urls.txt 38640 download
urls-archive.max.fan-twitter-@laurabronner-filtered.txt-shallow-20200712-041236-ec70n.json 339 download   job
urls-archive.max.fan-twitter-@lbanypd-filtered.txt-shallow-20200712-041146-u6sll-00000.warc.gz 2335962 download   job
urls-archive.max.fan-twitter-@lbanypd-filtered.txt-shallow-20200712-041146-u6sll-00000.warc.os.cdx.gz 7061 download
urls-archive.max.fan-twitter-@lbanypd-filtered.txt-shallow-20200712-041146-u6sll-meta.warc.gz 7811 download   job
urls-archive.max.fan-twitter-@lbanypd-filtered.txt-shallow-20200712-041146-u6sll-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@lbanypd-filtered.txt-shallow-20200712-041146-u6sll-urls.txt 270 download
urls-archive.max.fan-twitter-@lbanypd-filtered.txt-shallow-20200712-041146-u6sll.json 329 download   job
urls-archive.max.fan-twitter-@lc_francesca-filtered.txt-shallow-20200712-041121-dzb1j-00000.warc.gz 17912239 download   job
urls-archive.max.fan-twitter-@lc_francesca-filtered.txt-shallow-20200712-041121-dzb1j-00000.warc.os.cdx.gz 21160 download
urls-archive.max.fan-twitter-@lc_francesca-filtered.txt-shallow-20200712-041121-dzb1j-meta.warc.gz 15637 download   job
urls-archive.max.fan-twitter-@lc_francesca-filtered.txt-shallow-20200712-041121-dzb1j-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@lc_francesca-filtered.txt-shallow-20200712-041121-dzb1j-urls.txt 7605 download
urls-archive.max.fan-twitter-@lc_francesca-filtered.txt-shallow-20200712-041121-dzb1j.json 339 download   job
urls-archive.max.fan-twitter-@ldkeith-filtered.txt-shallow-20200712-040839-7it2y-00000.warc.gz 59513383 download   job
urls-archive.max.fan-twitter-@ldkeith-filtered.txt-shallow-20200712-040839-7it2y-00000.warc.os.cdx.gz 68315 download
urls-archive.max.fan-twitter-@ldkeith-filtered.txt-shallow-20200712-040839-7it2y-meta.warc.gz 40890 download   job
urls-archive.max.fan-twitter-@ldkeith-filtered.txt-shallow-20200712-040839-7it2y-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@ldkeith-filtered.txt-shallow-20200712-040839-7it2y-urls.txt 42396 download
urls-archive.max.fan-twitter-@ldkeith-filtered.txt-shallow-20200712-040839-7it2y.json 329 download   job
urls-archive.max.fan-twitter-@lecocqcharlotte-filtered.txt-shallow-20200712-040839-9ocn5-00000.warc.gz 160379514 download   job
urls-archive.max.fan-twitter-@lecocqcharlotte-filtered.txt-shallow-20200712-040839-9ocn5-00000.warc.os.cdx.gz 216233 download
urls-archive.max.fan-twitter-@lecocqcharlotte-filtered.txt-shallow-20200712-040839-9ocn5-meta.warc.gz 117984 download   job
urls-archive.max.fan-twitter-@lecocqcharlotte-filtered.txt-shallow-20200712-040839-9ocn5-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@lecocqcharlotte-filtered.txt-shallow-20200712-040839-9ocn5-urls.txt 53812 download
urls-archive.max.fan-twitter-@lecocqcharlotte-filtered.txt-shallow-20200712-040839-9ocn5.json 345 download   job
urls-archive.max.fan-twitter-@legalhackNYC-filtered.txt-shallow-20200712-040619-3wmva-00000.warc.gz 55378644 download   job
urls-archive.max.fan-twitter-@legalhackNYC-filtered.txt-shallow-20200712-040619-3wmva-00000.warc.os.cdx.gz 98906 download
urls-archive.max.fan-twitter-@legalhackNYC-filtered.txt-shallow-20200712-040619-3wmva-meta.warc.gz 56986 download   job
urls-archive.max.fan-twitter-@legalhackNYC-filtered.txt-shallow-20200712-040619-3wmva-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@legalhackNYC-filtered.txt-shallow-20200712-040619-3wmva-urls.txt 43546 download
urls-archive.max.fan-twitter-@legalhackNYC-filtered.txt-shallow-20200712-040619-3wmva.json 339 download   job
urls-archive.max.fan-twitter-@leshnerd-filtered.txt-shallow-20200712-040306-4w0bc-00000.warc.gz 22202096 download   job
urls-archive.max.fan-twitter-@leshnerd-filtered.txt-shallow-20200712-040306-4w0bc-00000.warc.os.cdx.gz 29927 download
urls-archive.max.fan-twitter-@leshnerd-filtered.txt-shallow-20200712-040306-4w0bc-meta.warc.gz 20619 download   job
urls-archive.max.fan-twitter-@leshnerd-filtered.txt-shallow-20200712-040306-4w0bc-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@leshnerd-filtered.txt-shallow-20200712-040306-4w0bc-urls.txt 11688 download
urls-archive.max.fan-twitter-@leshnerd-filtered.txt-shallow-20200712-040306-4w0bc.json 331 download   job
urls-archive.max.fan-twitter-@litalie-filtered.txt-shallow-20200712-035103-6uljz-00000.warc.gz 234865760 download   job
urls-archive.max.fan-twitter-@litalie-filtered.txt-shallow-20200712-035103-6uljz-00000.warc.os.cdx.gz 313201 download
urls-archive.max.fan-twitter-@litalie-filtered.txt-shallow-20200712-035103-6uljz-meta.warc.gz 170101 download   job
urls-archive.max.fan-twitter-@litalie-filtered.txt-shallow-20200712-035103-6uljz-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@litalie-filtered.txt-shallow-20200712-035103-6uljz-urls.txt 159109 download
urls-archive.max.fan-twitter-@litalie-filtered.txt-shallow-20200712-035103-6uljz.json 329 download   job
urls-archive.max.fan-twitter-@lopezobrador_-filtered.txt-shallow-20200712-033943-ejd7h.json 341 download   job
urls-archive.max.fan-twitter-@loracorkelley-filtered.txt-shallow-20200712-033942-b5l64.json 341 download   job
urls-archive.max.fan-twitter-@ltgovmeyer-filtered.txt-shallow-20200712-033913-db6uy-urls.txt 58 download
urls-archive.max.fan-twitter-@ltgovmeyer-filtered.txt-shallow-20200712-033913-db6uy.json 335 download   job
urls-archive.max.fan-twitter-@marclauritsen-filtered.txt-shallow-20200712-030103-1ur3w-00000.warc.gz 275940949 download   job
urls-archive.max.fan-twitter-@marclauritsen-filtered.txt-shallow-20200712-030103-1ur3w-00000.warc.os.cdx.gz 350433 download
urls-archive.max.fan-twitter-@marclauritsen-filtered.txt-shallow-20200712-030103-1ur3w.json 341 download   job
urls-archive.max.fan-twitter-@marwilliamson-filtered.txt-shallow-20200712-023051-3id5d-00000.warc.gz 1031024973 download   job
urls-archive.max.fan-twitter-@marwilliamson-filtered.txt-shallow-20200712-023051-3id5d-00000.warc.os.cdx.gz 3111241 download
urls-archive.max.fan-twitter-@mccanner-filtered.txt-shallow-20200712-014048-961x4-00000.warc.gz 2602940591 download   job
urls-archive.max.fan-twitter-@mccanner-filtered.txt-shallow-20200712-014048-961x4-00000.warc.os.cdx.gz 3972926 download
urls-archive.max.fan-twitter-@mccanner-filtered.txt-shallow-20200712-014048-961x4-meta.warc.gz 2086629 download   job
urls-archive.max.fan-twitter-@mccanner-filtered.txt-shallow-20200712-014048-961x4-meta.warc.os.cdx.gz 47 download
urls-archive.max.fan-twitter-@mccanner-filtered.txt-shallow-20200712-014048-961x4-urls.txt 1646184 download
urls-archive.max.fan-twitter-@mccanner-filtered.txt-shallow-20200712-014048-961x4.json 331 download   job
urls-transfer.notkiska.pw-twitter-%23BlackHistoryMonth-shallow-20200610-132545-46qdq-00201.warc.gz 5368756631 download   job
urls-transfer.notkiska.pw-twitter-%23BlackHistoryMonth-shallow-20200610-132545-46qdq-00201.warc.os.cdx.gz 1679881 download
urls-transfer.notkiska.pw-twitter-%23notmypresident-shallow-20200530-220957-2c0z0-00123.warc.gz 5371101711 download   job
urls-transfer.notkiska.pw-twitter-%23notmypresident-shallow-20200530-220957-2c0z0-00123.warc.os.cdx.gz 2382333 download
urls-transfer.notkiska.pw-twitter-%23notmypresident-shallow-20200530-220957-2c0z0-00124.warc.gz 5427424355 download   job
urls-transfer.notkiska.pw-twitter-%23notmypresident-shallow-20200530-220957-2c0z0-00124.warc.os.cdx.gz 1818183 download
urls-transfer.notkiska.pw-twitter-@NYCCouncil-shallow-20200711-202213-4ibxb-00005.warc.gz 5559742368 download   job
urls-transfer.notkiska.pw-twitter-@NYCCouncil-shallow-20200711-202213-4ibxb-00005.warc.os.cdx.gz 660 download
urls-transfer.notkiska.pw-twitter-@NYCCouncil-shallow-20200711-202213-4ibxb-00006.warc.gz 5576001551 download   job
urls-transfer.notkiska.pw-twitter-@NYCCouncil-shallow-20200711-202213-4ibxb-00006.warc.os.cdx.gz 8559 download
urls-transfer.notkiska.pw-twitter-@NYCCouncil-shallow-20200711-202213-4ibxb-00007.warc.gz 3277497651 download   job
urls-transfer.notkiska.pw-twitter-@NYCCouncil-shallow-20200711-202213-4ibxb-00007.warc.os.cdx.gz 390 download
urls-transfer.notkiska.pw-twitter-@NYCCouncil-shallow-20200711-202213-4ibxb-meta.warc.gz 2791832 download   job
urls-transfer.notkiska.pw-twitter-@NYCCouncil-shallow-20200711-202213-4ibxb-meta.warc.os.cdx.gz 47 download
urls-transfer.notkiska.pw-twitter-@NYCCouncil-shallow-20200711-202213-4ibxb-urls.txt 672721 download
urls-transfer.notkiska.pw-twitter-@NYCCouncil-shallow-20200711-202213-4ibxb.json 332 download   job
urls-transfer.notkiska.pw-twitter-@sarahmanavis-shallow-20200712-003056-5tmlk-00000.warc.gz 5102900658 download   job
urls-transfer.notkiska.pw-twitter-@sarahmanavis-shallow-20200712-003056-5tmlk-00000.warc.os.cdx.gz 4124561 download
whc.unesco.org-inf-20200622-104903-7ibzx-00073.warc.gz 5368719128 download   job
whc.unesco.org-inf-20200622-104903-7ibzx-00073.warc.os.cdx.gz 5815949 download
www.12377.cn-inf-20200711-122213-b397n-00003.warc.gz 3631544179 download   job
www.12377.cn-inf-20200711-122213-b397n-00003.warc.os.cdx.gz 1077024 download
www.12377.cn-inf-20200711-122213-b397n-meta.warc.gz 5493638 download   job
www.12377.cn-inf-20200711-122213-b397n-meta.warc.os.cdx.gz 47 download
www.12377.cn-inf-20200711-122213-b397n.json 242 download   job
www.seniorsnews.com.au-inf-20200528-062104-cuuvc-00111.warc.gz 5370002811 download   job
www.seniorsnews.com.au-inf-20200528-062104-cuuvc-00111.warc.os.cdx.gz 6630613 download
yepan.tistory.com-inf-20200711-025221-cq5rp-00002.warc.gz 5369008850 download   job
yepan.tistory.com-inf-20200711-025221-cq5rp-00002.warc.os.cdx.gz 3763069 download
zxfb.12371.cn-inf-20200712-045952-1yoik-00000.warc.gz 667087452 download   job
zxfb.12371.cn-inf-20200712-045952-1yoik-00000.warc.os.cdx.gz 207638 download
zxfb.12371.cn-inf-20200712-045952-1yoik-meta.warc.gz 115578 download   job
zxfb.12371.cn-inf-20200712-045952-1yoik-meta.warc.os.cdx.gz 47 download
zxfb.12371.cn-inf-20200712-045952-1yoik.json 242 download   job