Time |
Nickname |
Message |
00:04
🔗
|
SketchCow |
Kazaa supported Gnutella. |
00:30
🔗
|
dashcloud |
not sure how many people know about https://guerrillamail.com , but it's been great when I've needed to get email accounts real quick for testing- it auto-creates a box, and any email only lasts 60 minutes |
00:35
🔗
|
BlueMax |
hahahaha |
00:35
🔗
|
BlueMax |
"blablabla@sharklasers.com" |
00:36
🔗
|
BlueMax |
I love it |
00:47
🔗
|
Coderjoe |
dashcloud: sounds like mailinator.com in a way |
00:48
🔗
|
xmc |
or tenminutemail |
01:20
🔗
|
SketchCow |
Helvetica in the streets but a Wingdings in the sheets. |
01:24
🔗
|
BlueMax |
A crazy mess of characters? :P |
02:01
🔗
|
BlueMax |
oh wow didn't know DownThemAll could export lists of links |
02:04
🔗
|
BlueMax |
anyone willing to take these lists of links and download/upload them to somewhere else? would do it myself if it wouldn't take two weeks to upload |
02:14
🔗
|
BlueMax |
They're servers hosting DOOM singleplayer and multiplayer WADs. |
02:34
🔗
|
BlueMax |
http://paste.archivingyoursh.it/heferiroha.avrasm | http://paste.archivingyoursh.it/yurerewogu.avrasm | http://paste.archivingyoursh.it/kuxacifaxe.avrasm forgot that GLaDOS had a pastebin. |
02:35
🔗
|
DFJustin |
those could just be crawled with archivebot |
02:35
🔗
|
DFJustin |
point it at the folder |
02:35
🔗
|
DFJustin |
granted it's tied up with winamp jobs for a while |
02:36
🔗
|
BlueMax |
didn't know that's what archivebot was for. |
02:36
🔗
|
BlueMax |
wait. |
02:36
🔗
|
BlueMax |
I'm dumb aren't I |
02:38
🔗
|
SketchCow |
Yeah, that's pretty non-observant |
02:38
🔗
|
DFJustin |
it'll make a warc but you can just warctozip it later for hosting more directly |
02:38
🔗
|
BlueMax |
Fully deserving of the bollocking I'd usually get for that one |
02:39
🔗
|
BlueMax |
DFJustin, what do you mean? Where does it host the warc it makes? |
02:39
🔗
|
yipdw |
fos |
02:39
🔗
|
yipdw |
then eventually to IA |
02:40
🔗
|
yipdw |
assuming it doesn't crash from overload |
02:40
🔗
|
yipdw |
(it's happened a couple of times) |
02:40
🔗
|
DFJustin |
they eventually get dumped in items like this https://archive.org/details/archiveteam_archivebot_go_003 |
02:40
🔗
|
DFJustin |
which the wayback machine can then pull from |
02:40
🔗
|
BlueMax |
ah OK |
02:40
🔗
|
SketchCow |
Archivebot was the result of xmc and I brainstorming where archive team could use some automation. We decided that it was one-off, smaller (sub-gigabyte) websites thast people would mention and then we had whoever was sitting around do however they thought WARCing was done. |
02:41
🔗
|
BlueMax |
These servers aren't sub-gigabyte |
02:41
🔗
|
SketchCow |
And then yipdw really made it his own, and the bot does the best practices, and then gives it to IA to add into the wayback. |
02:41
🔗
|
DFJustin |
BlueMax: neither is most of the stuff we've been cramming down archivebot's maw |
02:41
🔗
|
SketchCow |
The bot has limits. Larger things should be done elsewhere, but people use it that way anyway, because easy. |
02:42
🔗
|
SketchCow |
I'm just saying what it was designed for. |
02:42
🔗
|
SketchCow |
This pair of scissors is designed to cut paper, but I'm going to stab you with them anyway |
02:42
🔗
|
BlueMax |
Fair enough, I just don't want to overload the bot if it's doing anything important like WinAMP |
02:42
🔗
|
yipdw |
<GeneKrantz> I don't care what it was designed to do, I care about what it can do |
02:42
🔗
|
* |
BlueMax hides |
02:42
🔗
|
SketchCow |
It's not how much you want to eat, it's how much you CAN eat |
02:43
🔗
|
yipdw |
anyway, we're doing okay on archivebot so far |
02:43
🔗
|
yipdw |
I can turn on another swap file |
02:43
🔗
|
yipdw |
heh |
02:43
🔗
|
BlueMax |
alright, well, if you're fine with it, how do I load the URLs into the archivebot |
02:43
🔗
|
yipdw |
oh, uh |
02:43
🔗
|
yipdw |
currently there is no mass load thing |
02:43
🔗
|
yipdw |
I can do that for now |
02:44
🔗
|
BlueMax |
does it work if I link a single page like http://static.best-ever.org/wads/ to the bot |
02:44
🔗
|
yipdw |
yes |
02:44
🔗
|
yipdw |
actually, that's the recommended usage |
02:44
🔗
|
BlueMax |
that's how I got the text lists I posted above |
02:45
🔗
|
yipdw |
https://github.com/ArchiveTeam/ArchiveBot/issues/14 is a mass-loader but I haven't really gotten around to it |
02:45
🔗
|
BlueMax |
fair enough |
02:46
🔗
|
yipdw |
oh hey |
02:46
🔗
|
yipdw |
it finished winamp |
02:46
🔗
|
yipdw |
neat |
02:46
🔗
|
BlueMax |
cool, can I jump in then? :P |
02:49
🔗
|
yipdw |
yeah |
04:04
🔗
|
BlueMax |
I talked to one of the WAD sources I wanted to back up, but he seemed unwilling to let me attempt to make a backup of his files. http://paste.archivingyoursh.it/kanowicejo.xml only reason I asked was because there's no public link list for his server, it's pure cluster-bomb guesswork to know what files he does have on there |
04:07
🔗
|
BlueMax |
should I try talking to him again later on or leave it |
04:41
🔗
|
SketchCow |
I'm all up for more BBS material. |
04:41
🔗
|
arkhive |
k |
04:41
🔗
|
arkhive |
:) |
04:41
🔗
|
arkhive |
Yeah. I was really excited to come home and tell you(might be weird lol) |
04:42
🔗
|
arkhive |
and he has about 150 more 3.25" floppy disks |
04:42
🔗
|
arkhive |
but he said look through them and let him know. :) |
04:43
🔗
|
arkhive |
he had a lot of old manuals from 80's too |
04:43
🔗
|
BlueMax |
Sniffin' for treasure me hearty. |
04:44
🔗
|
arkhive |
To all AT: I strongly recommend posting an ad on Craigslist in the Computers by owner and the Wanted section looking for FREE floppies or other stuff.. People have a ton of stuff that they'd otherwise throw out. but can be rescued |
04:45
🔗
|
BlueMax |
should add that to the -bs topic. |
04:45
🔗
|
arkhive |
sometimes your ad will get flagged by some people and removed. but just repost :) Also, I was sad to find out my Dad recycled a shitload of 5.25" floppy games I played with my sister when we were little. heh like spellbound and midnight rescue by the learning company |
04:47
🔗
|
arkhive |
he got rid of probably 30.. and a few years ago(like 5?) I recycled a shitload of stuff(my sis and i computer when i was 7, old computers with a turbo button haha, floppies) before I started getting into this stuff. |
04:48
🔗
|
arkhive |
But, SketchCow, can you dump/digitize them? I can mail them this week if you'd like. :) |
04:49
🔗
|
SketchCow |
I can |
04:51
🔗
|
arkhive |
cool. Can I also send about 500 more(commodore 64, apple 5.25" disks, and such. ) Or do you recommend me sending it to Cowering/some guy named Al at the Silicon valley computer museum, still? |
04:51
🔗
|
SketchCow |
Or me. |
04:51
🔗
|
SketchCow |
I have a hell of a backlog but I will work through said backlog |
04:51
🔗
|
SketchCow |
https://www.youtube.com/watch?v=E9XQ2MdNgKY |
05:31
🔗
|
Coderjoe |
did the winamp grab include the program, or just plugins and skins and the like? |
05:32
🔗
|
DFJustin |
it's getting the program too |
05:33
🔗
|
DFJustin |
watch download.nullsoft.com at http://archivebot.at.ninjawedding.org:4567/ |
05:47
🔗
|
SketchCow |
Another group wrote me, with, essentially "So, we'd love to have a chat about DOWNLOADING FACEBOOK AND TWITTER" |
05:47
🔗
|
SketchCow |
I sent them to #archiveteam, we'll see if they show |
06:25
🔗
|
ersi |
Hoho, that'll be interesting |
07:11
🔗
|
Coderjoe |
grr |
07:12
🔗
|
Coderjoe |
i've been using noscript, and have hit a couple of sites the display absolutely nothing without javascript. there have been others that display nearly nothing but a message to turn JS on. |
07:13
🔗
|
Coderjoe |
and i'm not talking about things like the leaderboard or warrior dashboard |
07:26
🔗
|
Lord_Nigh |
i know. its annoying as hell |
07:26
🔗
|
Lord_Nigh |
noscript itself has built in workarounds for some sites |
07:26
🔗
|
Lord_Nigh |
but it doesn't cover everything |
12:36
🔗
|
dashcloud |
BlueMax: uploading the ftp.fu-berlin.de idgames grab now (will be a little while at 32 GB) |
12:42
🔗
|
BlueMax |
jeez, that idgames folder takes up 2/3rds of the FTP |
12:49
🔗
|
BlueMax |
dashcloud, what's your opinion on this: I talked to one of the WAD sources I wanted to back up, but he seemed unwilling to let me attempt to make a backup of his files. http://paste.archivingyoursh.it/kanowicejo.xml only reason I asked was because there's no public link list for his server, it's pure cluster-bomb guesswork to know what files he does have on there |
12:55
🔗
|
dashcloud |
don't know |
20:15
🔗
|
w0rp |
I was trying to take a copy of 240GB of raw photos at work today, and I was handed this hard disk I just could not get to work with anything I tried. I tried two Linux machines and a Mac desktop. It apparely works fine on the guy's Mac laptop. |
20:16
🔗
|
w0rp |
It was also somehow a NAS, and it was from some company I've never heard of before. |
21:06
🔗
|
ivan` |
Coderjoe: for blogspot dynamic view sites, you can give google cache the URL and it will respond with HTML |
21:06
🔗
|
ivan` |
Coderjoe: I've been thinking about making some sort of HTTP proxy that uses a headless webkit to render and sends the resulting DOM to Firefox |
21:23
🔗
|
godane |
uploaded: https://archive.org/details/cdrom-linuxformatmagazine-175 |
21:36
🔗
|
nico_32 |
godane: can you help me ? |
21:36
🔗
|
nico_32 |
i am trying to upload an item to archive.org |
21:36
🔗
|
nico_32 |
with the old ftp interface |
21:36
🔗
|
nico_32 |
i went to the https://archive.org/checkin/ url |
21:36
🔗
|
nico_32 |
first time i got a empty page |
21:36
🔗
|
nico_32 |
now i got The identifier chosen is already taken. You will need to try an alternate identifier |
21:37
🔗
|
nico_32 |
the unit name is CedricBlancherTribute |
21:49
🔗
|
midas1 |
http://techcrunch.com/2013/11/21/source-microsoft-in-talks-to-buy-shoutcast-and-winamp-from-aol/ |
21:50
🔗
|
midas1 |
this is the important part: We have also learned that AOL has been planning to announce the closure of Shoutcast next week |
21:50
🔗
|
Coderjoe |
not terribly surprised |
21:51
🔗
|
midas1 |
nope, was to be expected |
22:03
🔗
|
SketchCow |
Oh, cool. |
22:03
🔗
|
SketchCow |
That thing I linked in #archiveteam an hour ago |
22:04
🔗
|
nico_32 |
anyone can help me with my ia issue ? |
22:06
🔗
|
SketchCow |
What is your ia issue. |
22:08
🔗
|
nico_32 |
i uploading a warc+cdc with the ftp interface |
22:08
🔗
|
nico_32 |
and i got a empty page when i tried to checkin it |
22:09
🔗
|
SketchCow |
it means it's taking a little time. |
22:09
🔗
|
nico_32 |
going to https://archive.org/details/CedricBlancherTribute |
22:09
🔗
|
nico_32 |
tell me to pick a collection |
22:09
🔗
|
SketchCow |
Pick any one. |
22:15
🔗
|
nico_32 |
CHANGING sid.cdx source="" to source="original" |
22:15
🔗
|
nico_32 |
ASSIGNING "sid.cdx" to format "Unknown" |
22:15
🔗
|
nico_32 |
normal ? |
22:22
🔗
|
nico_32 |
hu |
22:22
🔗
|
nico_32 |
i uploaded a the generated cdx file |
22:22
🔗
|
nico_32 |
and ia is regenerating a cdx file |
22:23
🔗
|
DFJustin |
it always does, it's actually kinda pointless to upload a cdx |
22:24
🔗
|
nico_32 |
it will take some time :( |
22:24
🔗
|
nico_32 |
wget generated a 51mb cdx file |
22:32
🔗
|
nico_32 |
okay |
22:32
🔗
|
nico_32 |
task complete |
22:33
🔗
|
nico_32 |
should i delete the cdx i uploaded ? |