#archiveteam-bs 2017-06-12,Mon

↑back Search ←Prev date Next date→ (Showing only urls - See all)(Click on time to show url line in full context)

WhoWhatWhen
arkiverhttps://tracker.archiveteam.org/imzy/ [01:14]
jrwrhttp://internetarchive.readthedocs.io/en/latest/cli.html [01:44]
https://github.com/internetarchive/warcprox [01:48]
MrRadararkiver: I'm getting an infinite sequence of HTTP 206 responses for a URL for Imzy: 87=206 https://www.imzy.com/api/accounts/profiles/mcnulty?check=true [03:06]
Lord_Nighsee https://twitter.com/TheMogMiner/status/873950228994502658 [05:35]
the "directions on how to automatically exclude your site" was a link to https://web.archive.org/web/20130606003203/http://archive.org/about/exclude.php
faq from 6/2013 is https://web.archive.org/web/20130606003203/http://archive.org/about/faqs.php
[05:48]
so this is why i am majorly concerned, especially since the https://twitter.com/TheMogMiner/status/873950228994502658 text implies there may be a legal/court order preventing IA staff from talking about it, let alone fixing it [05:54]
***dcmorton has quit IRC (Quit: ZNC - http://znc.in) [12:51]
godanehttps://archive.org/details/godaneinbox?sort=-publicdate&&and[]=subject%3A%22Sports%20Illustrated%22 [17:14]
JAAbsmith093: Although you're right in that several smaller files are easier to handle, you can browse the contents of tar and zip archives on the IA and only download individual files from it. For example, https://archive.org/download/fanfictiondotnet_repack/Fanfiction_Q.zip/ (Don't attempt to do this with the larger zips though; my browser was not amused.) [17:25]
tsp_PurpleSym: I'm there (http://archive.org/history/misc_web_rips). Doesn't seem like there are any tasks, though I can't tell if "server readonly -- tasks waiting for harddrive fix" is a task or not. [18:29]
PurpleSymYou probably meant misc_website_rips: https://catalogd.archive.org/log/682094076 [18:32]
bsmith093tapedrive: i also just uploaded this one recently https://archive.org/details/Fanfictiondotnet1011dump [19:30]
tsp_How should I turn a URL into an identifier? I can tell which page I started from, for example: http://home.att.net/~polliwog-press/pollistoryindex.htm [20:39]
Lord_Nighhttps://archive.org/about/faqs.php <- some of the content removal requests in the forum at the bottom of that page are weird... one page was about what looks like a foreign-langauge 'carfacts' equivalent site for a specific vehicle, which was not actually taken down... [21:07]
huh, this might be related to the robots.txt thing: https://archive.org/post/1074464/robotstxt-processing-failure although that might just be 'collateral damage' from the recent change and not a bug? [21:09]

↑back Search ←Prev date Next date→ (Showing only urls - See all)(Click on time to show url line in full context)