Time |
Nickname |
Message |
04:36
🔗
|
chfoo |
has yahoo blogs and wretch been saved yet? |
08:07
🔗
|
arkiver |
I have no idea if those have been saved already |
08:20
🔗
|
chfoo |
hmm, the yahoo blog and wretched channels are op-less. everyone join #shipwretched! please |
08:25
🔗
|
chfoo |
---> #shipwretched! <--- for yahoo blogs and wretch |
08:32
🔗
|
arkiver |
I'm there |
08:32
🔗
|
arkiver |
I'll take a look at those webistes now |
08:41
🔗
|
arkiver |
is someone who archived Fileplanet availabel here right now? |
12:39
🔗
|
chfoo |
oh yeah, it's #shipwretched (no exclamation mark) |
14:21
🔗
|
arkiver |
linea is going away on the december 15th |
14:21
🔗
|
arkiver |
so I'm downloading these: |
14:21
🔗
|
arkiver |
http://blog.getlinea.com/ |
14:21
🔗
|
arkiver |
http://info.getlinea.com/ |
14:21
🔗
|
arkiver |
https://www.getlinea.com/ |
14:24
🔗
|
arkiver |
also |
14:25
🔗
|
arkiver |
I'm doing a full pastebin grab |
14:25
🔗
|
arkiver |
not just of the urls with the codes |
14:25
🔗
|
arkiver |
but the full site |
14:25
🔗
|
arkiver |
crawl |
15:10
🔗
|
godane |
arkiver: grab this too: http://www.youtube.com/user/GetLinea |
15:44
🔗
|
godane |
i need some help with this: http://computerpoweruser.com/articles/archive/G0803/36g03/36g03.asp?guid= |
15:45
🔗
|
godane |
this error is in it: |
15:45
🔗
|
godane |
The include file '/includes/security.inc' was not found. |
15:45
🔗
|
godane |
/articles/archive/G0803/36g03/36g03.asp, line 3 |
15:45
🔗
|
godane |
that tells me the file still exist |
16:11
🔗
|
antomatic |
Thinking.... |
16:11
🔗
|
antomatic |
Why don't we archive EBAY? |
16:11
🔗
|
antomatic |
As a rolling, ongoing project? |
16:11
🔗
|
antomatic |
grabbing new item description pages, etc. |
16:11
🔗
|
antomatic |
archiving them for eternity instead of ebay's usual 90 days (or so) |
16:12
🔗
|
Smiley |
could be a nice rolling project. |
16:12
🔗
|
Smiley |
if we can get it setup with minimal admistration then it'd be great for idle warriors. |
17:08
🔗
|
godane |
i added cookies to my steam app page dump |
17:09
🔗
|
godane |
turns out its needed for games that a M rating |
17:09
🔗
|
godane |
other wise i don't get the page |
17:20
🔗
|
Nova_ |
WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD |
17:21
🔗
|
Nova_ |
anyone? |
17:21
🔗
|
Nova_ |
lol |
17:22
🔗
|
Nova_ |
no one? :( |
17:22
🔗
|
Smiley |
Nova_: yahoosucks |
17:22
🔗
|
Nova_ |
thanks. |
17:22
🔗
|
Smiley |
no worries |
17:24
🔗
|
Nova_ |
a site I want to go on is archived, is there a way I can get on it so that I can look trough old photos of me and my friends and stuff? |
17:25
🔗
|
Smiley |
Nova_: which site? |
17:25
🔗
|
Nova_ |
hyves.nl |
17:25
🔗
|
Smiley |
not sure if it's gone into wayback yet..... hmmm |
17:25
🔗
|
Smiley |
we normally get some kind of browsing page setup... however i don't know if it's been done yet |
17:25
🔗
|
Smiley |
the guy who was running the project isn't here atm |
17:26
🔗
|
Nova_ |
ah okay I understand, but someday it will go online? I am not in a hurry so yeah. |
17:26
🔗
|
Smiley |
yup |
17:26
🔗
|
Smiley |
might already be listed on archive.org somewhere |
17:27
🔗
|
Smiley |
https://archive.org/details/hyves << it'll be in there somewhere. |
17:49
🔗
|
arkiver |
antomic, that's a good idea! |
17:49
🔗
|
arkiver |
I would like to start download ebay |
17:49
🔗
|
arkiver |
but do you think 10 GB memory is enough for ebay download? |
17:50
🔗
|
arkiver |
Nova_ What was the exact name of your hyves url? |
18:01
🔗
|
arkiver |
If you are archiving a big website or know a website which is going to die, please add it here to the list: http://archiveteam.org/index.php?title=Projects |
18:20
🔗
|
arkiver |
IMPORTANT QUESTION: is winamp already fully download??? |
18:26
🔗
|
yipdw |
well, www.winamp.com is |
18:26
🔗
|
yipdw |
http://archivebot.at.ninjawedding.org:4567/#/histories/http://www.winamp.com/ |
18:26
🔗
|
yipdw |
forums.winamp.com, not sure |
18:26
🔗
|
arkiver |
well |
18:26
🔗
|
arkiver |
I got the following domains listed: |
18:27
🔗
|
arkiver |
http://blog.winamp.com/ |
18:27
🔗
|
arkiver |
http://dev.winamp.com/ |
18:27
🔗
|
arkiver |
http://forums.winamp.com/ |
18:27
🔗
|
arkiver |
http://www.winamp.com/ |
18:27
🔗
|
yipdw |
plug them into IA and see what their snapshots are |
18:27
🔗
|
arkiver |
I'm downloading them all again just to be 100% sure they are really downloaded |
18:27
🔗
|
yipdw |
or plug them into that histories URL |
18:29
🔗
|
arkiver |
http://dev.winamp.com/ and http://blog.winamp.com/ are downloaded by the archivebot, but they are very small??? |
18:29
🔗
|
arkiver |
not sure if they are downloaded 100%... |
18:29
🔗
|
yipdw |
if it doesn't say "aborted", it's done |
18:30
🔗
|
yipdw |
with the exception of links not discoverable by wget |
18:30
🔗
|
yipdw |
also, check IA |
18:30
🔗
|
yipdw |
there are snapshots for all sites dating to December 12 |
18:30
🔗
|
yipdw |
which is pretty recent |
18:30
🔗
|
arkiver |
yes I saw it |
18:30
🔗
|
arkiver |
it's probably ok then |
18:30
🔗
|
arkiver |
:) |
18:30
🔗
|
arkiver |
pfieuw |
18:30
🔗
|
yipdw |
if you want to do another one, I suggest forums.winamp.com |
18:30
🔗
|
yipdw |
be aware that that requires a lot of space |
18:31
🔗
|
arkiver |
yes I'm doing all 4 again |
18:31
🔗
|
arkiver |
http://archiveteam.org/index.php?title=Projects |
18:31
🔗
|
arkiver |
see the first line here from the table: |
18:31
🔗
|
arkiver |
but I need to go now |
18:31
🔗
|
arkiver |
will let you know how my download goes |
18:31
🔗
|
arkiver |
and I hope I will be finished by the 20th of december |
18:31
🔗
|
arkiver |
(which I doubt...) |
18:32
🔗
|
arkiver |
(since it's whole full forum...) |
18:32
🔗
|
arkiver |
brb |
19:39
🔗
|
dashcloud |
interesting article and project: http://arstechnica.com/information-technology/2013/12/british-library-sticks-1-million-pics-on-flickr-asks-for-help-making-them-useful/ photos here: http://www.flickr.com/photos/britishlibrary |
19:45
🔗
|
BiggieJ |
ohhhh archivebot . . . . |
19:45
🔗
|
Smiley |
im not sure he'll grqb flickr |
19:45
🔗
|
Smiley |
jdownloader will |
19:48
🔗
|
BiggieJ |
youtube-dl says it handles flickr too |
19:48
🔗
|
yipdw |
the problem with flickr, as with many other sites these days, is that retrieving URLs from the web interface requires an event loop that executes page stuff |
19:48
🔗
|
yipdw |
if someone has a good way to do this in a stable manner, a pull request would be good |
19:48
🔗
|
balrog |
does google have some way of doing this for their cache? |
19:49
🔗
|
Smiley |
][#'aszxxxxxhnjbgflk ;http://www.newstatesman.com/sci-tech/2013/12/trawling-dark-web |
19:49
🔗
|
Smiley |
wow my cat is good at typing |
19:56
🔗
|
arkiver |
I'll take a look at it if I can download that flickr account... |
20:01
🔗
|
arkiver |
will leave it runnig for some time |
20:01
🔗
|
arkiver |
and then I'll test a wrc.gz file if it is going ok |