#archiveteam 2013-12-15,Sun

↑back Search

Time	Nickname	Message
04:36 ^🔗	chfoo	has yahoo blogs and wretch been saved yet?
08:07 ^🔗	arkiver	I have no idea if those have been saved already
08:20 ^🔗	chfoo	hmm, the yahoo blog and wretched channels are op-less. everyone join #shipwretched! please
08:25 ^🔗	chfoo	---> #shipwretched! <--- for yahoo blogs and wretch
08:32 ^🔗	arkiver	I'm there
08:32 ^🔗	arkiver	I'll take a look at those webistes now
08:41 ^🔗	arkiver	is someone who archived Fileplanet availabel here right now?
12:39 ^🔗	chfoo	oh yeah, it's #shipwretched (no exclamation mark)
14:21 ^🔗	arkiver	linea is going away on the december 15th
14:21 ^🔗	arkiver	so I'm downloading these:
14:21 ^🔗	arkiver	http://blog.getlinea.com/
14:21 ^🔗	arkiver	http://info.getlinea.com/
14:21 ^🔗	arkiver	https://www.getlinea.com/
14:24 ^🔗	arkiver	also
14:25 ^🔗	arkiver	I'm doing a full pastebin grab
14:25 ^🔗	arkiver	not just of the urls with the codes
14:25 ^🔗	arkiver	but the full site
14:25 ^🔗	arkiver	crawl
15:10 ^🔗	godane	arkiver: grab this too: http://www.youtube.com/user/GetLinea
15:44 ^🔗	godane	i need some help with this: http://computerpoweruser.com/articles/archive/G0803/36g03/36g03.asp?guid=
15:45 ^🔗	godane	this error is in it:
15:45 ^🔗	godane	The include file '/includes/security.inc' was not found.
15:45 ^🔗	godane	/articles/archive/G0803/36g03/36g03.asp, line 3
15:45 ^🔗	godane	that tells me the file still exist
16:11 ^🔗	antomatic	Thinking....
16:11 ^🔗	antomatic	Why don't we archive EBAY?
16:11 ^🔗	antomatic	As a rolling, ongoing project?
16:11 ^🔗	antomatic	grabbing new item description pages, etc.
16:11 ^🔗	antomatic	archiving them for eternity instead of ebay's usual 90 days (or so)
16:12 ^🔗	Smiley	could be a nice rolling project.
16:12 ^🔗	Smiley	if we can get it setup with minimal admistration then it'd be great for idle warriors.
17:08 ^🔗	godane	i added cookies to my steam app page dump
17:09 ^🔗	godane	turns out its needed for games that a M rating
17:09 ^🔗	godane	other wise i don't get the page
17:20 ^🔗	Nova_	WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD
17:21 ^🔗	Nova_	anyone?
17:21 ^🔗	Nova_	lol
17:22 ^🔗	Nova_	no one? :(
17:22 ^🔗	Smiley	Nova_: yahoosucks
17:22 ^🔗	Nova_	thanks.
17:22 ^🔗	Smiley	no worries
17:24 ^🔗	Nova_	a site I want to go on is archived, is there a way I can get on it so that I can look trough old photos of me and my friends and stuff?
17:25 ^🔗	Smiley	Nova_: which site?
17:25 ^🔗	Nova_	hyves.nl
17:25 ^🔗	Smiley	not sure if it's gone into wayback yet..... hmmm
17:25 ^🔗	Smiley	we normally get some kind of browsing page setup... however i don't know if it's been done yet
17:25 ^🔗	Smiley	the guy who was running the project isn't here atm
17:26 ^🔗	Nova_	ah okay I understand, but someday it will go online? I am not in a hurry so yeah.
17:26 ^🔗	Smiley	yup
17:26 ^🔗	Smiley	might already be listed on archive.org somewhere
17:27 ^🔗	Smiley	https://archive.org/details/hyves << it'll be in there somewhere.
17:49 ^🔗	arkiver	antomic, that's a good idea!
17:49 ^🔗	arkiver	I would like to start download ebay
17:49 ^🔗	arkiver	but do you think 10 GB memory is enough for ebay download?
17:50 ^🔗	arkiver	Nova_ What was the exact name of your hyves url?
18:01 ^🔗	arkiver	If you are archiving a big website or know a website which is going to die, please add it here to the list: http://archiveteam.org/index.php?title=Projects
18:20 ^🔗	arkiver	IMPORTANT QUESTION: is winamp already fully download???
18:26 ^🔗	yipdw	well, www.winamp.com is
18:26 ^🔗	yipdw	http://archivebot.at.ninjawedding.org:4567/#/histories/http://www.winamp.com/
18:26 ^🔗	yipdw	forums.winamp.com, not sure
18:26 ^🔗	arkiver	well
18:26 ^🔗	arkiver	I got the following domains listed:
18:27 ^🔗	arkiver	http://blog.winamp.com/
18:27 ^🔗	arkiver	http://dev.winamp.com/
18:27 ^🔗	arkiver	http://forums.winamp.com/
18:27 ^🔗	arkiver	http://www.winamp.com/
18:27 ^🔗	yipdw	plug them into IA and see what their snapshots are
18:27 ^🔗	arkiver	I'm downloading them all again just to be 100% sure they are really downloaded
18:27 ^🔗	yipdw	or plug them into that histories URL
18:29 ^🔗	arkiver	http://dev.winamp.com/ and http://blog.winamp.com/ are downloaded by the archivebot, but they are very small???
18:29 ^🔗	arkiver	not sure if they are downloaded 100%...
18:29 ^🔗	yipdw	if it doesn't say "aborted", it's done
18:30 ^🔗	yipdw	with the exception of links not discoverable by wget
18:30 ^🔗	yipdw	also, check IA
18:30 ^🔗	yipdw	there are snapshots for all sites dating to December 12
18:30 ^🔗	yipdw	which is pretty recent
18:30 ^🔗	arkiver	yes I saw it
18:30 ^🔗	arkiver	it's probably ok then
18:30 ^🔗	arkiver	:)
18:30 ^🔗	arkiver	pfieuw
18:30 ^🔗	yipdw	if you want to do another one, I suggest forums.winamp.com
18:30 ^🔗	yipdw	be aware that that requires a lot of space
18:31 ^🔗	arkiver	yes I'm doing all 4 again
18:31 ^🔗	arkiver	http://archiveteam.org/index.php?title=Projects
18:31 ^🔗	arkiver	see the first line here from the table:
18:31 ^🔗	arkiver	but I need to go now
18:31 ^🔗	arkiver	will let you know how my download goes
18:31 ^🔗	arkiver	and I hope I will be finished by the 20th of december
18:31 ^🔗	arkiver	(which I doubt...)
18:32 ^🔗	arkiver	(since it's whole full forum...)
18:32 ^🔗	arkiver	brb
19:39 ^🔗	dashcloud	interesting article and project: http://arstechnica.com/information-technology/2013/12/british-library-sticks-1-million-pics-on-flickr-asks-for-help-making-them-useful/ photos here: http://www.flickr.com/photos/britishlibrary
19:45 ^🔗	BiggieJ	ohhhh archivebot . . . .
19:45 ^🔗	Smiley	im not sure he'll grqb flickr
19:45 ^🔗	Smiley	jdownloader will
19:48 ^🔗	BiggieJ	youtube-dl says it handles flickr too
19:48 ^🔗	yipdw	the problem with flickr, as with many other sites these days, is that retrieving URLs from the web interface requires an event loop that executes page stuff
19:48 ^🔗	yipdw	if someone has a good way to do this in a stable manner, a pull request would be good
19:48 ^🔗	balrog	does google have some way of doing this for their cache?
19:49 ^🔗	Smiley	][#'aszxxxxxhnjbgflk ;http://www.newstatesman.com/sci-tech/2013/12/trawling-dark-web
19:49 ^🔗	Smiley	wow my cat is good at typing
19:56 ^🔗	arkiver	I'll take a look at it if I can download that flickr account...
20:01 ^🔗	arkiver	will leave it runnig for some time
20:01 ^🔗	arkiver	and then I'll test a wrc.gz file if it is going ok

irclogger-viewer