Item archiveteam_archivebot_go_20171029150001

View on Internet Archive

Filename Size
addons.mozilla.org-inf-20170829-025732-4aa66-00208.warc.gz 5376385321 download   job
addons.mozilla.org-inf-20170829-025732-4aa66-00208.warc.os.cdx.gz 3967730 download
archiveteam_archivebot_go_20171029150001.cdx.gz 38398137 download
archiveteam_archivebot_go_20171029150001.cdx.idx 37730 download
archiveteam_archivebot_go_20171029150001_archive.torrent 840324 download
archiveteam_archivebot_go_20171029150001_files.xml 0 download
archiveteam_archivebot_go_20171029150001_meta.sqlite 227328 download
archiveteam_archivebot_go_20171029150001_meta.xml 1007 download
blogs.harvard.edu-inf-20171024-201411-8w024-00023.warc.gz 5370756042 download   job
blogs.harvard.edu-inf-20171024-201411-8w024-00023.warc.os.cdx.gz 789818 download
blogs.harvard.edu-inf-20171024-201411-8w024-00024.warc.gz 5386165479 download   job
blogs.harvard.edu-inf-20171024-201411-8w024-00024.warc.os.cdx.gz 2551729 download
blogs.mentor.com-shallow-20171029-065205-dar8u-meta.warc.gz 8123 download   job
blogs.mentor.com-shallow-20171029-065205-dar8u-meta.warc.os.cdx.gz 47 download
decidimhojunts.pirata.cat-shallow-20171029-074042-8y8it.json 272 download   job
elmon.cat-shallow-20171029-065819-3mr8d-00000.warc.gz 9135301 download   job
elmon.cat-shallow-20171029-065819-3mr8d-00000.warc.os.cdx.gz 36126 download
forums.meez.com-inf-20171025-220402-tsuml-00004.warc.gz 5368853505 download   job
forums.meez.com-inf-20171025-220402-tsuml-00004.warc.os.cdx.gz 8526484 download
github.com-shallow-20171029-065444-a9kn4-00000.warc.gz 2395725 download   job
github.com-shallow-20171029-065444-a9kn4-00000.warc.os.cdx.gz 3969 download
ib3tv.com-inf-20171029-013303-9iuv5-00003.warc.gz 5578377186 download   job
ib3tv.com-inf-20171029-013303-9iuv5-00003.warc.os.cdx.gz 172264 download
ib3tv.com-inf-20171029-013303-9iuv5-00004.warc.gz 5528494530 download   job
ib3tv.com-inf-20171029-013303-9iuv5-00004.warc.os.cdx.gz 54483 download
ib3tv.com-inf-20171029-013303-9iuv5-00005.warc.gz 5719191957 download   job
ib3tv.com-inf-20171029-013303-9iuv5-00005.warc.os.cdx.gz 97553 download
ib3tv.com-inf-20171029-013303-9iuv5-00006.warc.gz 5512737629 download   job
ib3tv.com-inf-20171029-013303-9iuv5-00006.warc.os.cdx.gz 35661 download
ib3tv.com-inf-20171029-013303-9iuv5-00007.warc.gz 5568267868 download   job
ib3tv.com-inf-20171029-013303-9iuv5-00007.warc.os.cdx.gz 6443 download
ib3tv.com-inf-20171029-013303-9iuv5-00008.warc.gz 5374340057 download   job
ib3tv.com-inf-20171029-013303-9iuv5-00008.warc.os.cdx.gz 176192 download
ib3tv.com-inf-20171029-013303-9iuv5-00009.warc.gz 5370934054 download   job
ib3tv.com-inf-20171029-013303-9iuv5-00009.warc.os.cdx.gz 249093 download
ib3tv.com-inf-20171029-013303-9iuv5-00010.warc.gz 5553257208 download   job
ib3tv.com-inf-20171029-013303-9iuv5-00010.warc.os.cdx.gz 216051 download
ib3tv.com-inf-20171029-013303-9iuv5-00011.warc.gz 5370595110 download   job
ib3tv.com-inf-20171029-013303-9iuv5-00011.warc.os.cdx.gz 145221 download
libraries.ucsd.edu-inf-20171026-221214-76cvo-00064.warc.gz 1703749865 download   job
libraries.ucsd.edu-inf-20171026-221214-76cvo-00064.warc.os.cdx.gz 95891 download
libraries.ucsd.edu-inf-20171026-221214-76cvo-meta.warc.gz 93002 download   job
libraries.ucsd.edu-inf-20171026-221214-76cvo-meta.warc.os.cdx.gz 47 download
libraries.ucsd.edu-inf-20171026-221214-76cvo.json 262 download   job
mee6.xyz-shallow-20171029-065009-6xliy-00000.warc.gz 1758211 download   job
mee6.xyz-shallow-20171029-065009-6xliy-00000.warc.os.cdx.gz 13002 download
mike.tig.as-shallow-20171029-074133-8a2ez-00000.warc.gz 373905 download   job
mike.tig.as-shallow-20171029-074133-8a2ez-00000.warc.os.cdx.gz 2390 download
mike.tig.as-shallow-20171029-074133-8a2ez-meta.warc.gz 4896 download   job
mike.tig.as-shallow-20171029-074133-8a2ez-meta.warc.os.cdx.gz 47 download
mike.tig.as-shallow-20171029-074133-8a2ez.json 259 download   job
mindprod.com-inf-20171029-110958-a7sqj-00000.warc.gz 116996481 download   job
mindprod.com-inf-20171029-110958-a7sqj-00000.warc.os.cdx.gz 283543 download
mindprod.com-inf-20171029-110958-a7sqj-meta.warc.gz 182745 download   job
mindprod.com-inf-20171029-110958-a7sqj-meta.warc.os.cdx.gz 47 download
mindprod.com-inf-20171029-110958-a7sqj.json 252 download   job
mossos.gencat.cat-inf-20171029-114430-d9ai3-00000.warc.gz 1945591846 download   job
mossos.gencat.cat-inf-20171029-114430-d9ai3-00000.warc.os.cdx.gz 1318766 download
mossos.gencat.cat-inf-20171029-114430-d9ai3-meta.warc.gz 792354 download   job
mossos.gencat.cat-inf-20171029-114430-d9ai3-meta.warc.os.cdx.gz 47 download
mossos.gencat.cat-inf-20171029-114430-d9ai3.json 247 download   job
news.sky.com-shallow-20171029-100530-jjulo-00000.warc.gz 1366615 download   job
news.sky.com-shallow-20171029-100530-jjulo-00000.warc.os.cdx.gz 5289 download
news.sky.com-shallow-20171029-100530-jjulo-meta.warc.gz 7009 download   job
news.sky.com-shallow-20171029-100530-jjulo-meta.warc.os.cdx.gz 47 download
news.sky.com-shallow-20171029-100530-jjulo.json 336 download   job
noticias.juridicas.com-shallow-20171029-064805-t880g-00000.warc.gz 887573 download   job
noticias.juridicas.com-shallow-20171029-064805-t880g-00000.warc.os.cdx.gz 4965 download
noticias.juridicas.com-shallow-20171029-064805-t880g-meta.warc.gz 6413 download   job
noticias.juridicas.com-shallow-20171029-064805-t880g-meta.warc.os.cdx.gz 47 download
pastebin.com-shallow-20171029-070123-6oms5-00000.warc.gz 365992 download   job
pastebin.com-shallow-20171029-070123-6oms5-00000.warc.os.cdx.gz 4542 download
pastebin.com-shallow-20171029-070123-6oms5.json 255 download   job
twitter.com-shallow-20171029-124626-e1lwi-00000.warc.gz 1260001 download   job
twitter.com-shallow-20171029-124626-e1lwi-00000.warc.os.cdx.gz 4303 download
twitter.com-shallow-20171029-124626-e1lwi-meta.warc.gz 6364 download   job
twitter.com-shallow-20171029-124626-e1lwi-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20171029-124626-e1lwi.json 275 download   job
twitter.com-shallow-20171029-124650-3rboc-00000.warc.gz 1298005 download   job
twitter.com-shallow-20171029-124650-3rboc-00000.warc.os.cdx.gz 4324 download
twitter.com-shallow-20171029-124650-3rboc-meta.warc.gz 6348 download   job
twitter.com-shallow-20171029-124650-3rboc-meta.warc.os.cdx.gz 47 download
twitter.com-shallow-20171029-124650-3rboc.json 275 download   job
urls-a.uguu.se-Ot2nl8FP4AJ8_nn.txt-shallow-20171029-140405-d6ll3-urls.txt 165000 download
urls-a.uguu.se-Ot2nl8FP4AJ8_nn.txt-shallow-20171029-140405-d6ll3.json 294 download   job
urls-a.uguu.se-l2GRYDQy5xmW_nn.txt-shallow-20171029-124714-1863c-00000.warc.gz 1232659748 download   job
urls-a.uguu.se-l2GRYDQy5xmW_nn.txt-shallow-20171029-124714-1863c-00000.warc.os.cdx.gz 1458200 download
urls-a.uguu.se-l2GRYDQy5xmW_nn.txt-shallow-20171029-124714-1863c-meta.warc.gz 879009 download   job
urls-a.uguu.se-l2GRYDQy5xmW_nn.txt-shallow-20171029-124714-1863c-meta.warc.os.cdx.gz 47 download
urls-a.uguu.se-l2GRYDQy5xmW_nn.txt-shallow-20171029-124714-1863c-urls.txt 165000 download
urls-a.uguu.se-l2GRYDQy5xmW_nn.txt-shallow-20171029-124714-1863c.json 294 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-093721-c3p7b-00000.warc.gz 904649 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-093721-c3p7b-00000.warc.os.cdx.gz 6782 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-093721-c3p7b-meta.warc.gz 7098 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-093721-c3p7b-meta.warc.os.cdx.gz 47 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-093721-c3p7b-urls.txt 9214 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-093721-c3p7b.json 496 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-094310-c3p7b-00000.warc.gz 906628 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-094310-c3p7b-00000.warc.os.cdx.gz 6778 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-094310-c3p7b-meta.warc.gz 7100 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-094310-c3p7b-meta.warc.os.cdx.gz 47 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-094310-c3p7b-urls.txt 9214 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-094310-c3p7b.json 496 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-095410-c3p7b-00000.warc.gz 907738 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-095410-c3p7b-00000.warc.os.cdx.gz 6783 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-095410-c3p7b-meta.warc.gz 7091 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-095410-c3p7b-meta.warc.os.cdx.gz 47 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-095410-c3p7b-urls.txt 9214 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-095410-c3p7b.json 496 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-100221-c3p7b-00000.warc.gz 907985 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-100221-c3p7b-00000.warc.os.cdx.gz 6796 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-100221-c3p7b-meta.warc.gz 7106 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-100221-c3p7b-meta.warc.os.cdx.gz 47 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-100221-c3p7b-urls.txt 9214 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-100221-c3p7b.json 496 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-100706-c3p7b-00000.warc.gz 907696 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-100706-c3p7b-00000.warc.os.cdx.gz 6771 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-100706-c3p7b-meta.warc.gz 7099 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-100706-c3p7b-meta.warc.os.cdx.gz 47 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-100706-c3p7b-urls.txt 9214 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-100706-c3p7b.json 496 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-101457-c3p7b-00000.warc.gz 909226 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-101457-c3p7b-00000.warc.os.cdx.gz 6765 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-101457-c3p7b-meta.warc.gz 7096 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-101457-c3p7b-meta.warc.os.cdx.gz 47 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-101457-c3p7b-urls.txt 9214 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-101457-c3p7b.json 496 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-104216-c3p7b-00000.warc.gz 912101 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-104216-c3p7b-00000.warc.os.cdx.gz 6753 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-104216-c3p7b-meta.warc.gz 7088 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-104216-c3p7b-meta.warc.os.cdx.gz 47 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-104216-c3p7b-urls.txt 9214 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-104216-c3p7b.json 496 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-105405-c3p7b-00000.warc.gz 914220 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-105405-c3p7b-00000.warc.os.cdx.gz 6788 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-105405-c3p7b-meta.warc.gz 7095 download   job
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-105405-c3p7b-meta.warc.os.cdx.gz 47 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-105405-c3p7b-urls.txt 9214 download
urls-gist.githubusercontent.com-gistfile1.txt-shallow-20171029-105405-c3p7b.json 496 download   job
urls-pastebin.com-7B8haU8k-shallow-20171029-103512-6gjrx-00000.warc.gz 1061906470 download   job
urls-pastebin.com-7B8haU8k-shallow-20171029-103512-6gjrx-00000.warc.os.cdx.gz 1335504 download
urls-pastebin.com-7B8haU8k-shallow-20171029-103512-6gjrx-meta.warc.gz 799141 download   job
urls-pastebin.com-7B8haU8k-shallow-20171029-103512-6gjrx-meta.warc.os.cdx.gz 47 download
urls-pastebin.com-7B8haU8k-shallow-20171029-103512-6gjrx-urls.txt 169998 download
urls-pastebin.com-7B8haU8k-shallow-20171029-103512-6gjrx.json 286 download   job
urls-pastebin.com-MGTeKC0n-shallow-20171029-113713-3vme5-00000.warc.gz 1253440209 download   job
urls-pastebin.com-MGTeKC0n-shallow-20171029-113713-3vme5-00000.warc.os.cdx.gz 1378769 download
urls-pastebin.com-MGTeKC0n-shallow-20171029-113713-3vme5-meta.warc.gz 826301 download   job
urls-pastebin.com-MGTeKC0n-shallow-20171029-113713-3vme5-meta.warc.os.cdx.gz 47 download
urls-pastebin.com-MGTeKC0n-shallow-20171029-113713-3vme5-urls.txt 169998 download
urls-pastebin.com-MGTeKC0n-shallow-20171029-113713-3vme5.json 286 download   job
webcache.googleusercontent.com-shallow-20171029-070219-bbxxu-00000.warc.gz 1949052 download   job
webcache.googleusercontent.com-shallow-20171029-070219-bbxxu-00000.warc.os.cdx.gz 6147 download
webcache.googleusercontent.com-shallow-20171029-070219-bbxxu.json 321 download   job
whedonesque.com-inf-20171026-082121-5tq6y-00012.warc.gz 5369002515 download   job
whedonesque.com-inf-20171026-082121-5tq6y-00012.warc.os.cdx.gz 4369806 download
www.8tv.cat-inf-20171029-074928-bx792-00000.warc.gz 291898189 download   job
www.8tv.cat-inf-20171029-074928-bx792-00000.warc.os.cdx.gz 537505 download
www.8tv.cat-inf-20171029-074928-bx792-meta.warc.gz 331879 download   job
www.8tv.cat-inf-20171029-074928-bx792-meta.warc.os.cdx.gz 47 download
www.8tv.cat-inf-20171029-074928-bx792.json 241 download   job
www.ara.cat-shallow-20171029-064944-5lig9-00000.warc.gz 11175619 download   job
www.ara.cat-shallow-20171029-064944-5lig9-00000.warc.os.cdx.gz 37492 download
www.ara.cat-shallow-20171029-064944-5lig9-meta.warc.gz 28036 download   job
www.ara.cat-shallow-20171029-064944-5lig9-meta.warc.os.cdx.gz 47 download
www.bcnisnotcat.es-shallow-20171029-065232-58kyr-00000.warc.gz 10815787 download   job
www.bcnisnotcat.es-shallow-20171029-065232-58kyr-00000.warc.os.cdx.gz 41772 download
www.bonjourkarl.com-shallow-20171029-111516-9vp8l-00000.warc.gz 5287618 download   job
www.bonjourkarl.com-shallow-20171029-111516-9vp8l-00000.warc.os.cdx.gz 4749 download
www.bonjourkarl.com-shallow-20171029-111516-9vp8l-meta.warc.gz 6335 download   job
www.bonjourkarl.com-shallow-20171029-111516-9vp8l-meta.warc.os.cdx.gz 47 download
www.bonjourkarl.com-shallow-20171029-111516-9vp8l.json 387 download   job
www.ccma.cat-inf-20171029-011426-1p44j-00002.warc.gz 5395452199 download   job
www.ccma.cat-inf-20171029-011426-1p44j-00002.warc.os.cdx.gz 7599 download
www.ccma.cat-inf-20171029-011426-1p44j-00003.warc.gz 5371101967 download   job
www.ccma.cat-inf-20171029-011426-1p44j-00003.warc.os.cdx.gz 124174 download
www.ccma.cat-inf-20171029-011426-1p44j-00004.warc.gz 5978316137 download   job
www.ccma.cat-inf-20171029-011426-1p44j-00004.warc.os.cdx.gz 54870 download
www.ccma.cat-inf-20171029-011426-1p44j-00005.warc.gz 5951158469 download   job
www.ccma.cat-inf-20171029-011426-1p44j-00005.warc.os.cdx.gz 2843 download
www.ccma.cat-inf-20171029-011426-1p44j-00006.warc.gz 5479242082 download   job
www.ccma.cat-inf-20171029-011426-1p44j-00006.warc.os.cdx.gz 2713 download
www.ccma.cat-inf-20171029-011426-1p44j-00007.warc.gz 6186987432 download   job
www.ccma.cat-inf-20171029-011426-1p44j-00007.warc.os.cdx.gz 2332 download
www.ccma.cat-inf-20171029-011426-1p44j-00008.warc.gz 5973068972 download   job
www.ccma.cat-inf-20171029-011426-1p44j-00008.warc.os.cdx.gz 1425 download
www.ccma.cat-inf-20171029-011426-1p44j-00009.warc.gz 6056857424 download   job
www.ccma.cat-inf-20171029-011426-1p44j-00009.warc.os.cdx.gz 2028 download
www.ccma.cat-inf-20171029-011426-1p44j-00010.warc.gz 5474042208 download   job
www.ccma.cat-inf-20171029-011426-1p44j-00010.warc.os.cdx.gz 2263 download
www.ccma.cat-shallow-20171029-070027-c52eq-meta.warc.gz 14990 download   job
www.ccma.cat-shallow-20171029-070027-c52eq-meta.warc.os.cdx.gz 47 download
www.diarijornada.coop-shallow-20171029-074250-duggo-00000.warc.gz 10525998 download   job
www.diarijornada.coop-shallow-20171029-074250-duggo-00000.warc.os.cdx.gz 28197 download
www.diarijornada.coop-shallow-20171029-074250-duggo-meta.warc.gz 20185 download   job
www.diarijornada.coop-shallow-20171029-074250-duggo-meta.warc.os.cdx.gz 47 download
www.diarijornada.coop-shallow-20171029-074250-duggo.json 256 download   job
www.disneyabcpress.com-shallow-20171029-070310-5nchq-00000.warc.gz 1110805 download   job
www.disneyabcpress.com-shallow-20171029-070310-5nchq-00000.warc.os.cdx.gz 7648 download
www.dropbox.com-shallow-20171029-064243-buj1n-00000.warc.gz 129772290 download   job
www.dropbox.com-shallow-20171029-064243-buj1n-00000.warc.os.cdx.gz 466 download
www.dropbox.com-shallow-20171029-064243-buj1n-meta.warc.gz 3638 download   job
www.dropbox.com-shallow-20171029-064243-buj1n-meta.warc.os.cdx.gz 47 download
www.elpuntavui.tv-inf-20171029-102336-8cp11.json 247 download   job
www.jackheartserin.com-inf-20171029-114311-d3mkn.json 251 download   job
www.lifehack.org-inf-20171019-094354-4yr1a-00011.warc.gz 5376460867 download   job
www.lifehack.org-inf-20171019-094354-4yr1a-00011.warc.os.cdx.gz 4086716 download
www.lne.es-shallow-20171029-065037-9a85v-00000.warc.gz 1226364 download   job
www.lne.es-shallow-20171029-065037-9a85v-00000.warc.os.cdx.gz 5582 download
www.lne.es-shallow-20171029-065037-9a85v-meta.warc.gz 7411 download   job
www.lne.es-shallow-20171029-065037-9a85v-meta.warc.os.cdx.gz 47 download
www.lne.es-shallow-20171029-065108-5rdjc-00000.warc.gz 2693044 download   job
www.lne.es-shallow-20171029-065108-5rdjc-00000.warc.os.cdx.gz 13971 download
www.lne.es-shallow-20171029-065108-5rdjc-meta.warc.gz 11760 download   job
www.lne.es-shallow-20171029-065108-5rdjc-meta.warc.os.cdx.gz 47 download
www.moddb.com-shallow-20171029-065510-5q9p6-meta.warc.gz 6882 download   job
www.moddb.com-shallow-20171029-065510-5q9p6-meta.warc.os.cdx.gz 47 download
www.moddb.com-shallow-20171029-065541-7hjpv-00000.warc.gz 707576 download   job
www.moddb.com-shallow-20171029-065541-7hjpv-00000.warc.os.cdx.gz 2585 download
www.moddb.com-shallow-20171029-065541-7hjpv-meta.warc.gz 5026 download   job
www.moddb.com-shallow-20171029-065541-7hjpv-meta.warc.os.cdx.gz 47 download
www.moddb.com-shallow-20171029-065606-2scrr-00000.warc.gz 54075 download   job
www.moddb.com-shallow-20171029-065606-2scrr-00000.warc.os.cdx.gz 409 download
www.naciodigital.cat-inf-20170919-214300-247yw-00068.warc.gz 5368728006 download   job
www.naciodigital.cat-inf-20170919-214300-247yw-00068.warc.os.cdx.gz 6936053 download
www.network54.com-inf-20171029-104719-f2o98.json 253 download   job
www.rac105.cat-inf-20171029-101842-9e0bi.json 244 download   job
www.resetera.com-inf-20171027-095822-dpp92-00009.warc.gz 5371021499 download   job
www.resetera.com-inf-20171027-095822-dpp92-00009.warc.os.cdx.gz 196729 download
www.resetera.com-inf-20171027-095822-dpp92-00010.warc.gz 5372098049 download   job
www.resetera.com-inf-20171027-095822-dpp92-00010.warc.os.cdx.gz 337262 download
www.resetera.com-inf-20171027-095822-dpp92-00011.warc.gz 3363548831 download   job
www.resetera.com-inf-20171027-095822-dpp92-00011.warc.os.cdx.gz 160406 download
www.resetera.com-inf-20171027-095822-dpp92.json 247 download   job
www.vilaweb.cat-shallow-20171029-065140-2d77d-00000.warc.gz 6488177 download   job
www.vilaweb.cat-shallow-20171029-065140-2d77d-00000.warc.os.cdx.gz 17283 download
www.vilaweb.cat-shallow-20171029-070001-dbhkc-00000.warc.gz 6688572 download   job
www.vilaweb.cat-shallow-20171029-070001-dbhkc-00000.warc.os.cdx.gz 18843 download
www.vilaweb.cat-shallow-20171029-070001-dbhkc-meta.warc.gz 14560 download   job
www.vilaweb.cat-shallow-20171029-070001-dbhkc-meta.warc.os.cdx.gz 47 download
www.vilaweb.cat-shallow-20171029-070001-dbhkc.json 328 download   job