#archiveteam 2014-06-18,Wed

↑back Search

Time Nickname Message
01:10 🔗 SketchCow How big is rawporter?
01:10 🔗 garyrh SketchCow, looks like it'll be about 12GB
01:10 🔗 SketchCow Really?
01:11 🔗 * SketchCow looks around
01:11 🔗 * SketchCow gets a box of macaroni and cheese
01:11 🔗 * SketchCow dumps mac and cheese packet on floor
01:11 🔗 SketchCow Here, store it in this
01:11 🔗 garyrh lol
01:16 🔗 dashcloud SketchCow: sure you know this, but somehow Angelfire and Tripod are both still around and maybe even thriving- yet Geocities wasn't able to make it when it was the biggest of the three of them
01:20 🔗 vantec Kinda like this SketchCow? https://imgur.com/StKMhqD
01:21 🔗 garyrh 12 Giga-Bites
01:22 🔗 db48x heh
01:23 🔗 SketchCow vantec: Exactly that
01:27 🔗 dashcloud am I the only person who just found out about UAS (USB-attached SCSI)? http://hansdegoede.livejournal.com/14660.html You can finally get the full performance of a disk/SSD when hooked up over USB3 (provided you have an enclosure that supports UAS)
01:28 🔗 SN4T14 dashcloud, only really useful for SSDs, you're never going to have USB3 bottlenecking a hard drive. ;)
02:47 🔗 SketchCow http://discimage.tumblr.com/
02:47 🔗 SketchCow Curated disc images
02:54 🔗 db48x shiny
02:54 🔗 SN4T14 Literal. :p
02:54 🔗 db48x also, slightly dizzying
03:00 🔗 joepie91_ ooo, Cultures!
05:46 🔗 SketchCow https://www.youtube.com/watch?v=sKIOqJns5N8
05:46 🔗 SketchCow Someone youtube dl that before it dies
05:48 🔗 DFJustin got it
05:50 🔗 SketchCow Thank youuu
06:08 🔗 midas boo they closed the S3 service it seems
06:09 🔗 midas i have 23874 of 39685 of the items
06:09 🔗 garyrh what?!
06:10 🔗 garyrh i still see it up
06:10 🔗 garyrh e.g. http://rawporter.s3.amazonaws.com/uploads/it86ue4m83dphc.flv
06:10 🔗 midas yeah, i got a forbidden
06:10 🔗 midas lemme check why
06:11 🔗 garyrh i'm at 34604/39685
06:15 🔗 db48x did you pull in random order?
06:15 🔗 midas it crashes in the AWOL folder
06:16 🔗 midas might just skip that one
06:16 🔗 db48x or, since there are two of you, did one of you reverse your traversal?
06:17 🔗 midas mine has some hate now, error 418, 416
06:17 🔗 midas first have to drive to work again
06:46 🔗 SketchCow If an archive team member in England wants to go to this, I can help pump up your proposal. http://failureinthearchives.wordpress.com/
07:45 🔗 schbirid someone please mirror the torrents from http://chriswhong.com/open-data/foil_nyc_taxi/ to archive.org. highlight me _after_ you did. thanks!
07:46 🔗 schbirid i mean to contents of the torrent, not the .torrent files of course ;D
07:48 🔗 Nemo_bis schbirid: what's the difference? archive.org downloads the torrent content if you upload the torrent, is that not good enough?
07:48 🔗 schbirid Nemo_bis: i had no idea, that's crazy
07:48 🔗 * Nemo_bis now wonders if the highlight request was respected
07:49 🔗 schbirid heh
07:49 🔗 schbirid let me try that
07:49 🔗 Nemo_bis ok
07:54 🔗 schbirid let's see what happens https://archive.org/details/nycTaxiTripData2013
07:58 🔗 deathy mm... that looks interesting
08:10 🔗 deathy I'm uploading to IA torrent client :D btw schbirid did you add both of the torrent files?
08:25 🔗 db48x someone who isn't going to sleep could grab a copy of http://delimiter.com.au/2014/06/18/delimiter-coming-natural-end/
08:29 🔗 garyrh what, just natural? not an organic, free range, non-gmo ending?!
08:33 🔗 db48x apparently
08:34 🔗 garyrh Cameron_D just put delimiter into archivebot
08:35 🔗 db48x good
08:36 🔗 Cameron_D Yeah, looks like the site will stick around but won't be updated, but still worth grabbing
09:02 🔗 schbirid deathy: yeah, both in one to see what happens
09:11 🔗 Nemo_bis deathy: I'm not sure two torrents work, IIRC it was necessary to give the torrent the same name as the item
09:12 🔗 Nemo_bis ah no, it seems it's done with the first and 20 % with the second :) https://catalogd.archive.org/log/316848601
09:12 🔗 schbirid sweet :))
09:13 🔗 Nemo_bis what a leecher! 55m18s | .. Percent Done: 93.3% Peers: ^ 1.37 MB/s to 6, v 4.08 MB/s from 13, of 14 (Ratio: 0.34)
12:43 🔗 schbirid Nemo_bis: the files were downloaded but they are not listed https://archive.org/details/nycTaxiTripData2013 :\
12:46 🔗 Nemo_bis schbirid: that's normal because you chose mediatype text, they're in https://ia802501.us.archive.org/1/items/nycTaxiTripData2013/
13:12 🔗 schbirid Nemo_bis: it did that all by itself. i used the browser uploader and even let the collection at "media" by default
13:56 🔗 godane so i maybe able to get video from here: http://www.click2houston.com/sitemap/video-20110701.xml
13:56 🔗 godane i couldn't use youtube-dl
13:57 🔗 godane but i grab the video link thru httpfox and here is the link to the first video: http://ib141804.ib-prod.com/p/557781/sp/55778100/serveFlavor/entryId/0_8u87aii9/v/1/flavorId/0_gspvcjay/name/a.flv
13:59 🔗 godane based on want i can tell http://ib141804.ib-prod.com/p/557781/sp/55778100/serveFlavor/entryId/ maybe in every url
14:00 🔗 godane you then take the part at the end of the video:player_loc url: 0_8u87aii9
14:01 🔗 godane *video : player_loc
14:04 🔗 godane looks like the stuff between flavorid and /name/ is not in the xml
14:28 🔗 midas http://www.marketwired.com/press-release/blippar-acquires-layar-creating-worlds-largest-ar-userbase-1921802.htm
14:29 🔗 midas Blippar buys Layar
16:48 🔗 joepie91_ http://www.securitycurrent.com/en/writers/richard-stiennon/cloudflare-acquires-cryptoseal
16:50 🔗 midas DDoS ALL THE VPNS!
16:52 🔗 exmic woop woop woop off-topic siren
16:56 🔗 joepie91_ exmic: your siren is sensitive today :P
17:11 🔗 SketchCow It's true, though
17:35 🔗 db48x mmm, delicious roast beef on sourdough
17:38 🔗 godane SketchCow: i'm starting to upload Bobby Blackwolf Show: https://archive.org/search.php?query=creator%3A%22Bobby%20Blackwolf%20Show%22&sort=-publicdate
17:38 🔗 godane i need to use dos2unix just to get the xml data to upload
19:58 🔗 garyrh rawporter is shaping up to be 30GB+
21:14 🔗 garyrh i'm gonna have to stop my rawporter grab, my estimate is that it's going to be >50GB, which i can't do right now
21:15 🔗 garyrh so the ones i haven;t grabbed are tail -n+35900 urlList.txt
21:15 🔗 garyrh *haven't
21:31 🔗 midas mine is still running, have some 600GB free on that box
21:31 🔗 garyrh great!
22:02 🔗 joepie91_ okay
22:02 🔗 joepie91_ panic
22:02 🔗 joepie91_ http://freecode.com/about
22:02 🔗 SN4T14 freecode.com?
22:02 🔗 joepie91_ looks like it's going to require urgent saving
22:02 🔗 joepie91_ this is pretty much a notice of death
22:02 🔗 joepie91_ "we put the site on static mode"
22:02 🔗 joepie91_ "because not much happening"
22:03 🔗 joepie91_ "The site contents have been retained in this static state as a continued path to access the linked software, much of which is on self-hosted servers and would be difficult to find otherwise."
22:03 🔗 joepie91_ cc SketchCow yipdw exmic
22:04 🔗 exmic hmm
22:16 🔗 yipdw joepie91_: oh yeah
22:16 🔗 yipdw I wonder if we can just archivebot it
22:16 🔗 yipdw well, probably not
22:16 🔗 yipdw luckily it has a URL structure that isn't horyshitinsane
22:17 🔗 SN4T14 Just recurive wget it. :D
22:17 🔗 yipdw probably just split it up by project
22:19 🔗 yipdw actually
22:19 🔗 yipdw http://web.archive.org/web/*/http://freecode.com
22:19 🔗 yipdw maybe no action required
22:19 🔗 yipdw yeah, unless someone can show a deficiency in the Wayback grabs, I say let it be
22:20 🔗 yipdw clicking around, this seems pretty complete
22:20 🔗 yipdw oh, some of the download URLs have bad robots.txt rules
22:20 🔗 yipdw ok
22:20 🔗 yipdw so maybe just grab all the download links for starters
22:21 🔗 SN4T14 yipdw, wouldn't those be 90% of the total size anyway?
22:21 🔗 yipdw I don't know, I didn't run a size check
22:21 🔗 joepie91_ there should be a full run anyway
22:21 🔗 joepie91_ for the stuff that is missed but unnoticed
22:21 🔗 joepie91_ (and hey, it's static anyway, heh)
22:22 🔗 yipdw SN4T14: that said, freecode didn't appear to host the downloadable archives, just the project metadata
22:23 🔗 SN4T14 yipdw, then someone here will probably just get a complete archive of it, text and metadata isn't that big. :p
22:24 🔗 yipdw sure, that's fine
22:24 🔗 yipdw I'm just not panicking over it, since the Wayback grabs of it are pretty extensive already
23:52 🔗 db48x you guys should read Constellation Games, if you haven't already

irclogger-viewer