#archiveteam 2014-05-10,Sat

↑back Search

Time Nickname Message
02:23 🔗 Stilett0 well, it's a good first step at reconstruction at least: http://www.vogons.org/viewtopic.php?p=347343#p347343
02:24 🔗 Stilett0 but it seems theres some other issues: http://www.vogons.org/viewtopic.php?p=357590#p357590
04:35 🔗 garyrh https://news.ycombinator.com/item?id=7724444
04:44 🔗 DFJustin agree 100% on this https://news.ycombinator.com/item?id=7724469
04:45 🔗 DFJustin I want a desktop BookReader :(
05:59 🔗 yipdw garyrh: we're ArchiveBotting Quora
05:59 🔗 yipdw I'm not sure if it's getting anything but the "you must log in" page
05:59 🔗 yipdw but it does seem to be getting something
06:00 🔗 yipdw and in the case that it hasn't, we've wasted 75 GB of their bandwidth, which I guess is pretty cool
06:41 🔗 Spring Hey, does anyone know what the most recent backup of userscripts.org is?
06:41 🔗 Spring It's been down for the past 5 days
06:44 🔗 DFJustin archivebot did a grab in november but it didn't finish completely https://archive.org/download/archiveteam_archivebot_go_004/userscripts.org-inf-20131103-142850-aborted.warc.gz
06:46 🔗 Spring There have been no official news about the downtime, and I'm concerned about the site as it's the main source of userscripts online.
07:47 🔗 exmic yipdw: I wonder if quora would notice an archivebot-equivalent scraper running while logged in
07:59 🔗 Nemo_bis did someone archive the sugarcrm forums?
15:08 🔗 stefanct hi. ive been told i could get a script to upload a bunch of files to archive org... i have a collection of flash chip datasheets obtained from various sources
15:08 🔗 stefanct some might have cover pages appended or prepended
18:58 🔗 krisu Userscripts.org has some problems: http://www.ghacks.net/2014/05/09/userscripts-org-good-alternatives/
19:00 🔗 krisu there's alternative address with 8080 port, but original address hasn't work for 4 days.
19:02 🔗 krisu And it has been down few weeks ago for few days, admins are really unactive these days.
19:05 🔗 balrog ugh.......
19:07 🔗 krisu ?
19:46 🔗 DFJustin stefanct: https://github.com/kngenie/ias3upload
19:46 🔗 DFJustin or you can roll your own with https://pypi.python.org/pypi/internetarchive
20:08 🔗 Stiletto I want to know where datasheets should be stashed within the archive :)
20:09 🔗 Stiletto it would also be cool if there were some pdf util script to de-spam datasheets taken from sites like datasheetarchive
20:10 🔗 Stiletto which spam up the pdf metadata, insert pages spamming their site, etc :)
20:10 🔗 Stiletto pdfclean? :D
20:13 🔗 DFJustin https://archive.org/details/ic_datasheets or under https://archive.org/details/manuals somewhere
20:14 🔗 dashcloud Stiletto: I have a bunch of datasheets from datasheetarchive, and probably the best idea is to strip all the metadata from the PDF
20:15 🔗 Stiletto I have a bunch from datasheetarchive too ;)
20:15 🔗 Stiletto DFJustin knows :D
20:16 🔗 dashcloud it's a shame, because otherwise they are a great site to get datasheets from
20:18 🔗 dashcloud Stiletto: although if you or someone else has a way to grab everything from there, metadata could always be added later
20:18 🔗 Stiletto dashcloud: I know, I talk to their admin occasionally through email
20:19 🔗 Stiletto dashcloud: i am working on inserting myself into their good graces...
20:19 🔗 Stiletto dashcloud: ???
20:19 🔗 Stiletto dashcloud: PROFIT!
20:20 🔗 Stiletto :D
20:22 🔗 dashcloud you're doing great work!
20:22 🔗 Stiletto we'll see :)
21:57 🔗 Spring Thanks so much for starting the archivebot on userscripts.org, someone found an address that worked after all :)

irclogger-viewer