#archiveteam 2016-02-28,Sun

↑back Search

Time Nickname Message
00:10 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
00:11 πŸ”— MMovie has joined #archiveteam
00:12 πŸ”— JesseW johtso: ping ivan` about archiving youtube
00:14 πŸ”— johtso If it involves any kind of manual involvement it's probably not practical, it's a 24 hour live stream :)
00:14 πŸ”— johtso I'm sure they'll archive it..
00:15 πŸ”— HCross johtso, look at using livestreamer and vlc media plater
00:15 πŸ”— HCross player
00:34 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
00:35 πŸ”— MMovie has joined #archiveteam
00:37 πŸ”— hictooth I was wondering what the status of archiving fanfiction.net is? According to http://www.archiveteam.org/index.php?title=FanFiction.Net it's being saved, but I can't find out by who or where to.
00:38 πŸ”— MrRadar Last September someone scraped every story from it and put it up as a torrent.
00:39 πŸ”— MrRadar (Though they just saved each one as plain text, so maybe not the best job)
00:39 πŸ”— MrRadar You can find the magnet link here: https://www.reddit.com/r/DataHoarder/comments/3jl3qm/nearly_complete_archive_of_fanfictionnet/
00:39 πŸ”— hictooth So it's not being actively archived now?
00:40 πŸ”— MrRadar Not by us, as far as I know
00:43 πŸ”— SimpBrain got tagged as saved
00:44 πŸ”— JesseW Hm, probably should be changed to {{partiallysaved}} in that case
00:44 πŸ”— JesseW Do you know if anyone tossed the torrent onto IA?
00:44 πŸ”— MrRadar I did
00:45 πŸ”— MrRadar https://archive.org/details/fanfiction.net_2015_09
00:46 πŸ”— JesseW ah, cool -- please do add that link to the wiki page
00:48 πŸ”— JesseW also, if/when you get a chance, you could turn the link from the item description into a clickable link.
00:49 πŸ”— MrRadar How do you do that?
00:49 πŸ”— ndiddy has quit IRC (Read error: Operation timed out)
00:50 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
00:51 πŸ”— MMovie has joined #archiveteam
00:52 πŸ”— JesseW MrRadar: the description can use HTML
00:52 πŸ”— MrRadar OK, I didn't know that
00:52 πŸ”— JesseW i.e. <a href="http://blah.com">http://blah.com</a>
00:52 πŸ”— JesseW yeah, one of many IA hidden features. :-)
00:52 πŸ”— JesseW IDK what sanatizing they do (not much, I'd guess)
00:54 πŸ”— JesseW Example (just tested): https://archive.org/details/fav-jesse_w
00:55 πŸ”— MrRadar It looks like we did a scrape of the site back in 2014. Where did that data end up?
00:55 πŸ”— MrRadar I'd like to add a link to it too
00:55 πŸ”— robink has quit IRC (Ping timeout: 190 seconds)
00:56 πŸ”— JesseW Hm, this is ... mildly alarming: https://catalogd.archive.org/log/466976952
00:56 πŸ”— JesseW Apparently the lack of sanatizing they do extends to these pages. :-)
00:56 πŸ”— MrRadar Oh, yeah, that's bad
00:57 πŸ”— MrRadar I'd hit info@
00:57 πŸ”— robink has joined #archiveteam
00:57 πŸ”— MrRadar They probably just need to put an htmlspecialchars() call around the output
01:00 πŸ”— MrRadar Hmm. It looks like that Fanfiction.net scrape was also uploaded by its original creator.
01:00 πŸ”— robink has quit IRC (Ping timeout: 190 seconds)
01:00 πŸ”— MrRadar Now that I do a search for it
01:01 πŸ”— MrRadar The only difference is that mine has the original torrent file
01:01 πŸ”— MrRadar Does the IA dedup by hash? I'd hate to have them storing this data 4 times
01:01 πŸ”— robink has joined #archiveteam
01:02 πŸ”— bsmith093 MrRadar: that was me, i think, and it now has an inventory file
01:02 πŸ”— yipdw they're not storing it 4 times
01:02 πŸ”— yipdw they're storing it 8 times
01:02 πŸ”— yipdw at least
01:02 πŸ”— robink has quit IRC (Remote host closed the connection)
01:03 πŸ”— yipdw it would be interesting to see if any of those copies have the original version of Fifty Shades of Grey
01:03 πŸ”— yipdw if none of them do that is a serious mark against all ofu s
01:03 πŸ”— MrRadar bsmith093, this one: https://archive.org/details/FanfictionNearlyCompleteArchive ?
01:03 πŸ”— bsmith093 MrRadar: yes, thst
01:03 πŸ”— bsmith093 that
01:04 πŸ”— MrRadar Haha, I should have searched first before uploading it
01:04 πŸ”— MrRadar Did you create that scrape originally?
01:04 πŸ”— bsmith093 MrRadar: which one did you do
01:04 πŸ”— MrRadar I uploaded it to https://archive.org/details/fanfiction.net_2015_09
01:04 πŸ”— bsmith093 MrRadar: using fanficfare running through a list of all id numbers
01:05 πŸ”— MrRadar OK
01:05 πŸ”— bsmith093 you many want the inventory file to be able to search that
01:05 πŸ”— MrRadar How should I credit you in my copy of the upload?
01:06 πŸ”— bsmith093 list the other link. it's fine i just used another project somebody else built to scrape all of it
01:07 πŸ”— bsmith093 MrRadar: while we're on the subject, how the hell do i extract one file from this archive. should it be taking forever?
01:08 πŸ”— MrRadar Unlink zip and 7z files, TAR files don't have a catalog so if you want to extract a file it has to scan through the whole thing
01:08 πŸ”— MrRadar To find it
01:08 πŸ”— bsmith093 ugh
01:10 πŸ”— bsmith093 MrRadar: if you want to rebuild it into a 7z file, you can, i would really like something i can extract a given file from in less than an hour
01:15 πŸ”— snape yipdw, the original fifty shades was gone from ff by 2010. Wayback machine might have a copy, but it's robots.txt-excluded. There are still copies of it floating around the web tho, if you know the original title.
01:15 πŸ”— yipdw so much for our efforts
01:16 πŸ”— bsmith093 yipdw: if they did'nt throttle so hard, we could have gotten all of it in like a week
01:20 πŸ”— yipdw I kid
01:20 πŸ”— yipdw it's just that I've encountered situations where it's like "oh I wonder if we got that" and yet in the terabytes we drag in daily
01:20 πŸ”— yipdw nope
01:21 πŸ”— yipdw this happens sometimes in archivebot crawls
01:21 πŸ”— yipdw maybe that's just what happens when you deal with something as ineffably huge as the web
01:21 πŸ”— MrRadar Updated the Fanfiction.net page with references to the AT's 2012 scrape and bsmith093's 2015 one
01:21 πŸ”— dserodio has quit IRC (Read error: Operation timed out)
01:21 πŸ”— snape If it's any consolation, we likely have good representative samples of the early age of dinosaur erotica, and... whatever the next terrible trend will be.
01:22 πŸ”— yipdw ARCHIVE TEAM TRENDSETTIN' 2016 \m/
01:23 πŸ”— dxrt simply amazing
01:23 πŸ”— bsmith093 snape:well, thank FSM we have that! ;)
01:24 πŸ”— dserodio has joined #archiveteam
01:27 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
01:27 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
01:28 πŸ”— MMovie has joined #archiveteam
01:28 πŸ”— JesseW OK, reported the lack of escaping to info@
01:30 πŸ”— dashcloud has joined #archiveteam
01:31 πŸ”— bsmith093 JesseW: I checked that log, what was wrong?
01:32 πŸ”— JesseW bsmith093: note the 2nd time the description was shown, after [description] =>
01:32 πŸ”— JesseW It uses the actual HTML, not escaped (as it is above)
01:32 πŸ”— JesseW (and again below, by "with value:"
01:33 πŸ”— bsmith093 oh, i see it now
01:33 πŸ”— bsmith093 Also, is there an archive format that stores an index, because apparently tar doesn't
01:35 πŸ”— HCross2 cdx
01:35 πŸ”— MrRadar Well, that's for WARCs
01:35 πŸ”— MrRadar For general files .zip or .7z are the go-to
01:36 πŸ”— bsmith093 is there a thing i can use without having to un- and re-compress 300GB of files?
01:37 πŸ”— MrRadar Not really. Part of it is also that if your .tar is also gzipped you would need to decompress the entire gzip stream up to each file
01:37 πŸ”— MrRadar Even if you had an index
01:37 πŸ”— MrRadar .tar.gz is not designed for random access
01:38 πŸ”— bsmith093 anyone feel like being awesome? when i created that file, i thought it would actually be searchable easily.
01:39 πŸ”— snape On a related note, I wonder if there's a list somewhere of the oldest continuously-active porn sites. The oldest one I could remember, from 2000, seems to have disappeared sometime last year. :/
01:41 πŸ”— JesseW snape: there may have been such a list on Wikipedia -- although it quite likely has been deleted by now; but if you look through old revisions, you may be able to find it.
01:42 πŸ”— JesseW bsmith093: once I get done with the IA census (which should be pretty soon -- mostly just waiting on jjake uploading the results) I'm glad to recompress the fanfiction tarball as a zip.
01:42 πŸ”— JesseW I have a good pipe, and enough free space.
01:47 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
01:47 πŸ”— bsmith093 JesseW: thanks so much! BTW when I made the file, every single gui tool was choking on a folder that big, and i didn't actually have the space to sstore the final tar, so i compressed on the fly to fos. when i created the file, apparently i pushed in the whole path of the folder, so when you rebuild it, could you start with the Fanfiction folder, b
01:47 πŸ”— bsmith093 uried in home/Desktop etc. that was my bad.
01:49 πŸ”— JesseW yeah
01:49 πŸ”— bsmith093 any way i have plenty of space now, mostly because i finally dumped the uncompressed files, and thats how i got started looking for omething to search a tar file
01:49 πŸ”— JesseW I think my debian box should be OK handling it.
01:49 πŸ”— JesseW Thank you for babysitting the script to make it!
01:50 πŸ”— dashcloud has joined #archiveteam
01:51 πŸ”— JesseW MrRadar: I improved the link to the 2012 scrape.
01:51 πŸ”— bsmith093 np, i wasn't doing much anyway! also the inventory file is here, and you'll see the problem immediately https://archive.org/download/FanfictionNearlyCompleteArchive/inventory.txt
01:51 πŸ”— MrRadar Thanks, JesseW
01:52 πŸ”— JesseW argh, the *inventory* is nearly 800MB!
01:53 πŸ”— JesseW The pipe I'm on right /now/ isn't so good -- I'll download that later. :-)
01:55 πŸ”— bsmith093 JesseW:hey i had to leave that uncompressed, the whole point is so google can find it.
02:10 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
02:12 πŸ”— MMovie has joined #archiveteam
02:26 πŸ”— bsmith093 has quit IRC (http://www.kiwiirc.com/ - A hand crafted IRC client)
02:35 πŸ”— JesseW has quit IRC (Quit: Leaving.)
02:37 πŸ”— mafrasi2_ has quit IRC (Read error: Connection reset by peer)
02:38 πŸ”— JesseW has joined #archiveteam
02:38 πŸ”— mafrasi2 has joined #archiveteam
02:38 πŸ”— yipdw_ has joined #archiveteam
02:41 πŸ”— yipdw has quit IRC (Ping timeout: 506 seconds)
02:54 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
02:55 πŸ”— MMovie has joined #archiveteam
03:00 πŸ”— JesseW has quit IRC (Quit: Leaving.)
03:05 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
03:06 πŸ”— MMovie has joined #archiveteam
03:11 πŸ”— tomwsmf-a has quit IRC (Read error: Operation timed out)
03:24 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
03:25 πŸ”— MMovie has joined #archiveteam
03:42 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
03:43 πŸ”— Boppen has joined #archiveteam
03:43 πŸ”— MMovie has joined #archiveteam
03:48 πŸ”— bwn has quit IRC (Ping timeout: 492 seconds)
03:58 πŸ”— ndiddy has joined #archiveteam
04:03 πŸ”— xXx_ndidd has joined #archiveteam
04:12 πŸ”— xXx_ndidd has quit IRC (Read error: Connection reset by peer)
04:13 πŸ”— xXx_ndidd has joined #archiveteam
04:16 πŸ”— Boppen has quit IRC (Ping timeout: 200 seconds)
04:16 πŸ”— ndiddy has quit IRC (Read error: Operation timed out)
04:21 πŸ”— JesseW has joined #archiveteam
04:26 πŸ”— JesseW A games database that is looking for a home -- http://forum.kodi.tv/showthread.php?tid=261575 someone should suggest archive.org for them.
04:34 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
04:35 πŸ”— JesseW Nemo_bis: it looks like the last wikiteam dump of the archiveteam wiki was in october 2015 -- could you make another one?
04:36 πŸ”— JesseW (I'm asking you because you are listed as the uploader for https://archive.org/details/wiki-archiveteamorg )
04:36 πŸ”— MMovie has joined #archiveteam
04:38 πŸ”— bsmith093 has joined #archiveteam
04:49 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
04:50 πŸ”— MMovie has joined #archiveteam
05:07 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
05:08 πŸ”— MMovie has joined #archiveteam
05:10 πŸ”— xXx_ndidd has quit IRC (Read error: Connection reset by peer)
05:21 πŸ”— Sk1d has quit IRC (Ping timeout: 250 seconds)
05:24 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
05:25 πŸ”— MMovie has joined #archiveteam
05:30 πŸ”— Sk1d has joined #archiveteam
05:41 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
05:43 πŸ”— MMovie has joined #archiveteam
05:45 πŸ”— myself has joined #archiveteam
05:46 πŸ”— myself yo bitches
05:46 πŸ”— myself http://www.ridethemindway.com/phones/
05:47 πŸ”— myself no idea where that came from or how long it'll be up, but something tells me, not forever
05:47 πŸ”— myself has quit IRC (Client Quit)
05:50 πŸ”— aksel has joined #archiveteam
05:50 πŸ”— aksel Why did stypi shut down
05:50 πŸ”— aksel ?
05:50 πŸ”— aksel Hello?
05:50 πŸ”— aksel Is anyone here?
05:50 πŸ”— aksel Why did Code.Stypi Shutdown
05:51 πŸ”— aksel .
05:51 πŸ”— aksel has quit IRC (Client Quit)
06:01 πŸ”— atank1 has joined #archiveteam
06:01 πŸ”— atank1 hello
06:03 πŸ”— JesseW I'd never heard of Code.Stypi before.
06:03 πŸ”— atank1 Code.stypi.com was a online source project that allowed programming languages on docs that you could edit with friends
06:04 πŸ”— JesseW apparently I forgot it, as I created a wiki page for it back in Aug 2015: http://archiveteam.org/index.php?title=Stypi&action=history
06:05 πŸ”— atank1 Wait you work with the the team?\
06:05 πŸ”— JesseW It doesn't look like we made any specific effort to save it.
06:05 πŸ”— atank1 What happend to it?
06:06 πŸ”— atank1 Why did they shut it down?
06:06 πŸ”— atank1 Do you know why they shut Code.Stypi.com Down?
06:06 πŸ”— JesseW Apparently whoever was running it decided to stop. I don't see any farewell notice, although apparently there was something saying it was going to die on Sept 3, 2015.
06:07 πŸ”— atank1 oh
06:07 πŸ”— atank1 is there an Archive with the websites source? like the code i would love to bring it back as my students are quite depressed.
06:08 πŸ”— VADemon has quit IRC (Quit: left4dead)
06:08 πŸ”— JesseW I don't think so. It doesn't look like the source for it was available.
06:09 πŸ”— atank1 I decided to try and talk to somoene, I know it went down a while ago but. i want to try and get it back.
06:09 πŸ”— atank1 Shit.
06:09 πŸ”— atank1 Well uhh
06:09 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
06:09 πŸ”— atank1 do you know who created it?
06:09 πŸ”— JesseW I think there are alternatives, though.
06:09 πŸ”— atank1 Such as?
06:09 πŸ”— JesseW Collaborative text editing, certainly -- and I think collaborative programming too. Not sure of names offhand, though.
06:10 πŸ”— MMovie has joined #archiveteam
06:10 πŸ”— JesseW I'm digging around in the Wayback Machine copy, to see if I can dig up any relevant contact info.
06:11 πŸ”— JesseW You can do the same.
06:11 πŸ”— atank1 ?
06:11 πŸ”— JesseW e.g. https://web.archive.org/web/20130514113228/https://www.stypi.com/press
06:11 πŸ”— atank1 ._.
06:12 πŸ”— atank1 I was asking around and someone said they would sell the source code for a couple thousand
06:12 πŸ”— JesseW It looks like they were owned by Salesforce. So that'd be who you should contact.
06:12 πŸ”— atank1 Bullshit lol
06:12 πŸ”— JesseW Please let us know if you have any luck.
06:12 πŸ”— atank1 Ok
06:12 πŸ”— JesseW but they were aquired back in 2012
06:12 πŸ”— JesseW so it wasn't the aquasition that killed them
06:13 πŸ”— JesseW and this gives their address (back in 2012) https://web.archive.org/web/20150320123654/https://code.stypi.com/privacy
06:14 πŸ”— atank1 apparently someone i know attally knows where a source can be located
06:14 πŸ”— JesseW neat!
06:14 πŸ”— JesseW if you get a hold of it, please upload a copy to the Internet Archive
06:15 πŸ”— JesseW Their tweets (all 42 of them) are hidden: https://twitter.com/stypi
06:19 πŸ”— JesseW according to https://www.technologyreview.com/s/425690/google-wave-reincarnated/ the founders names were: Byron Milligan and Jason Chen.
06:20 πŸ”— atank1 ?
06:20 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
06:20 πŸ”— JesseW You could try emailing/tweeting at them.
06:22 πŸ”— Stiletto has joined #archiveteam
06:22 πŸ”— MMovie has joined #archiveteam
06:26 πŸ”— atank1 ?
06:26 πŸ”— atank1 grr
06:28 πŸ”— JesseW ?
06:28 πŸ”— atank1 He lied to me
06:29 πŸ”— JesseW your contact who said they had a copy of the source code? damm, that sucks
06:29 πŸ”— atank1 Yep
06:30 πŸ”— JesseW I can't say I'm surprised, but I'm sorry to hear it.
06:34 πŸ”— atank1 Attually
06:34 πŸ”— atank1 Since all my students robotic programming is gone...
06:34 πŸ”— atank1 well
06:35 πŸ”— atank1 i guess i just have to break the news
06:35 πŸ”— atank1 they wanted me to grab their code for them
06:36 πŸ”— atank1 oh dear
06:36 πŸ”— atank1 i hope this does not get me fired
06:37 πŸ”— JesseW I hope not. That's an awful place to get stuck in.
06:37 πŸ”— JesseW Beware The Cloud.
06:37 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
06:37 πŸ”— JesseW Always Have Multiple Local Backups
06:38 πŸ”— dashcloud has joined #archiveteam
06:39 πŸ”— atank1 We did
06:40 πŸ”— atank1 but last week the Servers got attacked by Crpytowall
06:40 πŸ”— JesseW OH FUCK. That REALLY sucks.
06:40 πŸ”— atank1 ik
06:41 πŸ”— JesseW BTW, thank you *VERY MUCH* for telling your story here. It's shit like this that demonstrates why what we do (or failed to do in this case) is important.
06:42 πŸ”— atank1 anyways imma go
06:42 πŸ”— bsmith093 JesseW: not to distract from the suck of cryptowall, but did you ever get that tar --> zip thing going?
06:43 πŸ”— atank1 fuck im lagging my dads internet
06:43 πŸ”— atank1 i should get off
06:43 πŸ”— bsmith093 atank1: hope it goes well
06:43 πŸ”— atank1 My dad aint happy
06:43 πŸ”— JesseW It'll be a few days -- I want to keep my new IA census workstuff around until jjake gets the census stuff uploaded.
06:44 πŸ”— atank1 I borrow the internet from my dad next door lol
06:44 πŸ”— bsmith093 JesseW: k then, thanks
06:44 πŸ”— atank1 cya
06:44 πŸ”— JesseW bsmith093 but I should be able to start downloading the file at least
06:44 πŸ”— atank1 has quit IRC ()
06:44 πŸ”— JesseW Well, that was a dammed sob story. :-(
06:44 πŸ”— bsmith093 yeah that sucks, who *lies* about having source code?
06:51 πŸ”— yipdw_ "Most importantly, Stypi will continue to be the Stypi you know. Our users will continue to have access to this great service, community, and innovation."
06:51 πŸ”— yipdw_ nice
06:51 πŸ”— JesseW Where is that from?
06:51 πŸ”— yipdw_ https://web.archive.org/web/20130514120823/http://blog.stypi.com/
06:52 πŸ”— yipdw_ more specifically https://web.archive.org/web/20130325111746/http://blog.stypi.com/2012/05/stypi-joins-salesforce-com/
06:52 πŸ”— JesseW well, it wasn't being bought that killed them -- they didn't die till 3 years later.
06:53 πŸ”— JesseW bsmith093: fanfiction download started.
06:53 πŸ”— yipdw_ true but Bram Mooleenar switching jobs to Google didn't kill vim 3 years later
06:54 πŸ”— bsmith093 JesseW: thanks. do, Stypi didn't actually say they were dumping anything. they usually make a point of that.
06:54 πŸ”— bsmith093 startups in general, i mean
06:54 πŸ”— JesseW and that illustrates the difference between a piece of FOSS standalone software and a proprietary service
06:55 πŸ”— yipdw_ I mean it's nothing that nobody in here doesn't know
06:55 πŸ”— JesseW bsmith093: they did say they were deleting it: "All documents that have not been downloaded to an archive by that time will be deleted. "
06:55 πŸ”— yipdw_ it really sucks for atank1 and unless someone here happens to have an archive they may just be out of luck
06:56 πŸ”— JesseW bsmith093: ETA on the fanfict download is 2days, 5 hours. :-)
06:56 πŸ”— bsmith093 JesseW: just curious, what isp do you have?
06:57 πŸ”— bsmith093 theres also a torrent file
06:58 πŸ”— JesseW bsmith093: Wave G in Seattle
06:59 πŸ”— JesseW Hm, I suppose I'll switch over to the torrent.
06:59 πŸ”— bsmith093 JesseW: never heard of them, any good?
06:59 πŸ”— JesseW They used to be called CondoInternet
07:00 πŸ”— JesseW I've been very happy with them. Only complaint is that they keep sending me junk mail offering me a discount to sign up -- after I've already signed up. :-)
07:00 πŸ”— bsmith093 how fast
07:01 πŸ”— bsmith093 also will someone with ops please add http://www.ridethemindway.com/phones/ to archivebot yipdw_ SketchCow ersi
07:01 πŸ”— JesseW bsmith093 already in there
07:02 πŸ”— JesseW check the dashboard
07:02 πŸ”— JesseW bsmith093: I pay for 100 Mbps
07:02 πŸ”— bsmith093 great, it looks like 80s era phone docs
07:02 πŸ”— bsmith093 those are rare
07:03 πŸ”— bsmith093 so do i, new isp, faster than twc, for much cheaper
07:03 πŸ”— JesseW It's already grabbed 5 GB, with about 1000 files to go
07:03 πŸ”— JesseW I'm just delighted to not have to deal with the big ISPs. Those folks are simply unpleasant to deal with.
07:05 πŸ”— bsmith093 mine's been down once since i got them, like ~2 years ago, for maybe a half hour, when i called them, they ACTUALLY TOLD ME WHY!!
07:05 πŸ”— JesseW that is excellent, yeah
07:07 πŸ”— bsmith093 JesseW: also grab the inventory file, that might not be int the torrent yet, i don't know how fast that updates
07:09 πŸ”— JesseW It's not in the torrent I'm using (with hash d934709d1c7f1bf26d826718804de5f7a53757dc)
07:10 πŸ”— bsmith093 it's on the page though.
07:10 πŸ”— JesseW I know. I'll grab it afterward -- I mean, I can regenerate it myself once I have the data. :-)
07:13 πŸ”— JesseW The torrent ETA is 1 day, 19 hours right now.
07:13 πŸ”— JesseW or 1 day, 14 hours.
07:27 πŸ”— vitzli has joined #archiveteam
07:32 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
07:34 πŸ”— MMovie has joined #archiveteam
07:37 πŸ”— JesseW has quit IRC (Quit: Leaving.)
07:43 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
07:45 πŸ”— DFJustin has quit IRC (Remote host closed the connection)
07:46 πŸ”— dashcloud has joined #archiveteam
07:50 πŸ”— metalcamp has joined #archiveteam
08:00 πŸ”— DFJustin has joined #archiveteam
08:00 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
08:01 πŸ”— MMovie has joined #archiveteam
08:07 πŸ”— brayden has quit IRC (Read error: Connection reset by peer)
08:07 πŸ”— brayden has joined #archiveteam
08:17 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
08:18 πŸ”— MMovie has joined #archiveteam
08:36 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
08:38 πŸ”— MMovie has joined #archiveteam
08:54 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
08:56 πŸ”— MMovie has joined #archiveteam
09:07 πŸ”— Tomcat_ has joined #archiveteam
09:19 πŸ”— Burak How can I add into wget -H -D domains, that looks like this - imagesX.fotosik.pl, where X is number from 1 to 99?
09:25 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
09:27 πŸ”— MMovie has joined #archiveteam
09:29 πŸ”— Zei-Pii has joined #archiveteam
09:38 πŸ”— vitzli has quit IRC (Leaving)
09:41 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
09:42 πŸ”— MMovie has joined #archiveteam
09:48 πŸ”— bwn has joined #archiveteam
09:58 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
09:58 πŸ”— MMovie has joined #archiveteam
10:04 πŸ”— hictooth has quit IRC (Ping timeout: 255 seconds)
10:13 πŸ”— joepie91 https://twitter.com/trulloapp/status/702225155464900608
10:13 πŸ”— joepie91 "Thank you all for your support & app love. Sadly, we're shutting down Trullo. We are grateful for your contributions and we'll miss you. :("
10:13 πŸ”— joepie91 well
10:13 πŸ”— joepie91 looks like they didn't waste any time
10:13 πŸ”— joepie91 DNS doesn't resolve anymore
10:14 πŸ”— zhongfu wow
10:14 πŸ”— joepie91 https://www.producthunt.com/tech/trullo
10:16 πŸ”— joepie91 well
10:16 πŸ”— joepie91 this has got to be one of the most dickish shutdowns
10:16 πŸ”— joepie91 I think they just beat Yahoo
10:16 πŸ”— joepie91 shutdown announcement with no notice
10:16 πŸ”— joepie91 DNS gone 5 days later
10:22 πŸ”— hictooth has joined #archiveteam
10:23 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
10:23 πŸ”— PurpleSym Is there some kind of β€œDNS archive” you could go back to, fetch the IP address and see if the server still works?
10:23 πŸ”— hictooth has quit IRC (Client Quit)
10:25 πŸ”— MMovie has joined #archiveteam
10:25 πŸ”— joepie91 PurpleSym: potentially, sec
10:26 πŸ”— joepie91 PurpleSym: https://www.robtex.com/?dns=trullo.com
10:26 πŸ”— joepie91 so yeah
10:26 πŸ”— joepie91 52.24.188.223 and 52.24.194.8
10:27 πŸ”— PurpleSym Also: https://dnshistory.org/dns-records/trullo.com
10:27 πŸ”— joepie91 AWS
10:27 πŸ”— joepie91 IPs non-responsive
10:27 πŸ”— joepie91 PurpleSym: robtex is more complet e:P
10:28 πŸ”— PurpleSym Indeed.
10:28 πŸ”— joepie91 I <3 robtex
10:28 πŸ”— joepie91 the NSA does too, for obvious reasons
10:29 πŸ”— PurpleSym We should get a copy.
10:29 πŸ”— PurpleSym Anyway, was worth a shot…
10:30 πŸ”— joepie91 PurpleSym: copy of?
10:32 πŸ”— PurpleSym Robtex.
10:32 πŸ”— joepie91 heh.
10:32 πŸ”— joepie91 robtex is big
10:32 πŸ”— joepie91 I still need to eventually talk to the guy and see if some kind of feed can be negotiated
10:32 πŸ”— joepie91 he's supposedly working on an API for 'qualified organizations' but it's not entirely clear to me what that would mean
10:33 πŸ”— joepie91 but
10:33 πŸ”— joepie91 -bs
10:45 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
10:46 πŸ”— MMovie has joined #archiveteam
11:03 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
11:04 πŸ”— MMovie has joined #archiveteam
11:16 πŸ”— winterfox has quit IRC (Remote host closed the connection)
11:21 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
11:23 πŸ”— MMovie has joined #archiveteam
11:41 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
11:43 πŸ”— MMovie has joined #archiveteam
11:57 πŸ”— schbirid has joined #archiveteam
11:59 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
12:00 πŸ”— MMovie has joined #archiveteam
12:13 πŸ”— philpem has joined #archiveteam
12:16 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
12:17 πŸ”— MMovie has joined #archiveteam
12:34 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
12:35 πŸ”— MMovie has joined #archiveteam
12:53 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
12:54 πŸ”— arkiver Is someone interested in scannig FTPs for the FTP project?
12:54 πŸ”— arkiver Instructions and FTPs are here http://archiveteam.org/index.php?title=FTP/List
12:54 πŸ”— MMovie has joined #archiveteam
12:54 πŸ”— arkiver This is only scanning the FTP and creating a list of items for the grab. This won't take a lot of diskspace.
12:56 πŸ”— arkiver It might take a lot of time, depending on the number of files and the speed of the FTP
13:03 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
13:05 πŸ”— MMovie has joined #archiveteam
13:10 πŸ”— snape has quit IRC (Hey! Where'd my controlling terminal go?)
13:18 πŸ”— HCross Shall I kick a scan off on ftp://ftp.cup.cam.ac.uk - seems to have a lot of stuff on books published by Cambridge University Press
13:20 πŸ”— HCross nvm, its done
13:21 πŸ”— HCross or it isnt
13:36 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
13:37 πŸ”— philpem has quit IRC (Ping timeout: 260 seconds)
13:37 πŸ”— MMovie has joined #archiveteam
13:55 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
13:56 πŸ”— MMovie has joined #archiveteam
14:13 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
14:14 πŸ”— PurpleSym arkiver: Would you accept the output of `ncftpls -R`?
14:15 πŸ”— arkiver I'm not sure what kind of output that gives
14:15 πŸ”— MMovie has joined #archiveteam
14:15 πŸ”— arkiver URL and filesize?
14:15 πŸ”— arkiver if yes, then I can convert it, if no, then no
14:16 πŸ”— arkiver Though using the ftp-queue scripts would be best for this (scripts will be more optimized in the future)
14:16 πŸ”— PurpleSym Output looks like this: http://pastebin.com/t6vcPHbD
14:17 πŸ”— arkiver Looks like URL can be generated and size is also there
14:17 πŸ”— arkiver so I should be able to convert it
14:17 πŸ”— arkiver Why would you use that command rather then the ftp-queue script?
14:18 πŸ”— PurpleSym I don’t see why a script needed to be written for that in the first place :)
14:19 πŸ”— arkiver The script wil make sure only new files are size changed files are added to the itemlists
14:20 πŸ”— arkiver Previously scanned FTPs are also in /archive/, so they can be used for that if they are scanned again
14:20 πŸ”— arkiver It creates smaller lists of 200 MB of FTP files
14:20 πŸ”— PurpleSym `cat old new | sort | uniq`
14:21 πŸ”— arkiver Also tests for the server response if a file or folder does not exist
14:30 πŸ”— megaminxw has quit IRC (Quit: Leaving.)
14:30 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
14:31 πŸ”— MMovie has joined #archiveteam
14:35 πŸ”— ohhdemgir has quit IRC (Read error: Operation timed out)
14:47 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
14:48 πŸ”— MMovie has joined #archiveteam
14:48 πŸ”— zhongfu has quit IRC (Remote host closed the connection)
14:59 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
15:00 πŸ”— test_ has joined #archiveteam
15:00 πŸ”— test_ uhh hello?
15:00 πŸ”— arkiver hi
15:01 πŸ”— test_ can i request something for deletion
15:01 πŸ”— test_ personal info
15:01 πŸ”— arkiver Requests for deletion of something should be sent to info@archive.org
15:01 πŸ”— test_ ok thanks
15:01 πŸ”— MMovie has joined #archiveteam
15:01 πŸ”— test_ will do, bye
15:01 πŸ”— test_ has quit IRC (Client Quit)
15:11 πŸ”— scyther has joined #archiveteam
15:31 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
15:33 πŸ”— MMovie has joined #archiveteam
15:50 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
15:52 πŸ”— MMovie has joined #archiveteam
16:00 πŸ”— snape has joined #archiveteam
16:06 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
16:07 πŸ”— MMovie has joined #archiveteam
16:25 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
16:26 πŸ”— MMovie has joined #archiveteam
16:30 πŸ”— ats has quit IRC (Quit: Let's see if Linux 4.4.3 has working NFS again...)
16:36 πŸ”— ats has joined #archiveteam
16:40 πŸ”— Zei-Pii has quit IRC (Read error: Connection reset by peer)
16:41 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
16:42 πŸ”— MMovie has joined #archiveteam
16:52 πŸ”— joepie91 arkiver: whatever happened to the dump of open FTPs that I had a while ago?
16:52 πŸ”— joepie91 :p
16:54 πŸ”— philpem has joined #archiveteam
17:10 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
17:12 πŸ”— MMovie has joined #archiveteam
17:30 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
17:30 πŸ”— scyther_ has joined #archiveteam
17:31 πŸ”— MMovie has joined #archiveteam
17:32 πŸ”— scyther has quit IRC (Ping timeout: 250 seconds)
17:45 πŸ”— arkiver joepie91: I'll look into that!
17:47 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
17:49 πŸ”— arkiver joepie91: do you still have that dump?
17:50 πŸ”— MMovie has joined #archiveteam
17:52 πŸ”— joepie91 arkiver: eh, might have, but my files are a mess atm
17:59 πŸ”— zhongfu has joined #archiveteam
18:00 πŸ”— joepie91 arkiver: remind me of the filename?
18:06 πŸ”— zhongfu has quit IRC (Remote host closed the connection)
18:11 πŸ”— JesseW has joined #archiveteam
18:15 πŸ”— metalcamp has quit IRC (Ping timeout: 252 seconds)
18:16 πŸ”— zhongfu has joined #archiveteam
18:17 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
18:18 πŸ”— MMovie has joined #archiveteam
18:36 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
18:37 πŸ”— MMovie has joined #archiveteam
18:41 πŸ”— scyther_ has quit IRC (Read error: Connection reset by peer)
18:54 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
18:56 πŸ”— MMovie has joined #archiveteam
19:04 πŸ”— godane has quit IRC (Read error: Operation timed out)
19:07 πŸ”— victor has joined #archiveteam
19:11 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
19:12 πŸ”— MMovie has joined #archiveteam
19:29 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
19:31 πŸ”— MMovie has joined #archiveteam
19:33 πŸ”— Infreq has quit IRC (Ping timeout: 258 seconds)
19:34 πŸ”— Infreq has joined #archiveteam
19:35 πŸ”— zino_ has joined #archiveteam
19:37 πŸ”— Burak has quit IRC (Ping timeout: 255 seconds)
19:38 πŸ”— schbirid has quit IRC (hub.efnet.us irc.Prison.NET)
19:38 πŸ”— zino has quit IRC (hub.efnet.us irc.Prison.NET)
19:38 πŸ”— vOYtEC has quit IRC (hub.efnet.us irc.Prison.NET)
19:38 πŸ”— achip has quit IRC (hub.efnet.us irc.Prison.NET)
19:42 πŸ”— schbirid2 has joined #archiveteam
19:57 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
19:57 πŸ”— achip has joined #archiveteam
19:58 πŸ”— MMovie has joined #archiveteam
19:58 πŸ”— vOYtEC has joined #archiveteam
20:02 πŸ”— Burak has joined #archiveteam
20:11 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
20:12 πŸ”— ndiddy has joined #archiveteam
20:12 πŸ”— megaminxw has joined #archiveteam
20:13 πŸ”— MMovie has joined #archiveteam
20:29 πŸ”— metalcamp has joined #archiveteam
20:47 πŸ”— Burak has quit IRC (Ping timeout: 255 seconds)
20:48 πŸ”— JesseW has quit IRC (Quit: Leaving.)
20:48 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
20:49 πŸ”— MMovie has joined #archiveteam
20:50 πŸ”— scyther has joined #archiveteam
21:04 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
21:05 πŸ”— MMovie has joined #archiveteam
21:06 πŸ”— Burak has joined #archiveteam
21:22 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
21:23 πŸ”— MMovie has joined #archiveteam
21:26 πŸ”— metalcamp has quit IRC (Ping timeout: 252 seconds)
21:32 πŸ”— bwn has quit IRC (Ping timeout: 246 seconds)
21:39 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
21:40 πŸ”— MMovie has joined #archiveteam
21:43 πŸ”— Tomcat_ has quit IRC (Remote host closed the connection)
21:51 πŸ”— Boppen has joined #archiveteam
21:54 πŸ”— mismatch_ has joined #archiveteam
22:03 πŸ”— schbirid2 has quit IRC (Quit: Leaving)
22:07 πŸ”— Boppen has quit IRC (hub.se irc.du.se)
22:11 πŸ”— bwn has joined #archiveteam
22:25 πŸ”— Boppen has joined #archiveteam
22:27 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
22:29 πŸ”— MMovie has joined #archiveteam
22:30 πŸ”— JesseW has joined #archiveteam
22:30 πŸ”— scyther has quit IRC (Read error: Connection reset by peer)
22:30 πŸ”— bwn has quit IRC (Read error: Operation timed out)
22:34 πŸ”— scyther has joined #archiveteam
22:44 πŸ”— Boppen has quit IRC (Ping timeout: 200 seconds)
22:47 πŸ”— Boppen has joined #archiveteam
22:53 πŸ”— Boppen has quit IRC (hub.se irc.du.se)
22:56 πŸ”— megaminxw has quit IRC (Quit: Leaving.)
22:59 πŸ”— mismatch_ has quit IRC (Ping timeout: 499 seconds)
23:06 πŸ”— rduser has quit IRC (Ping timeout: 260 seconds)
23:06 πŸ”— Rickster has quit IRC (Ping timeout: 260 seconds)
23:08 πŸ”— Famicoman has quit IRC (Ping timeout: 260 seconds)
23:09 πŸ”— Simpbrai_ has quit IRC (Remote host closed the connection)
23:10 πŸ”— bauruine has quit IRC (Ping timeout: 260 seconds)
23:10 πŸ”— mismatch_ has joined #archiveteam
23:10 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
23:11 πŸ”— rduser has joined #archiveteam
23:12 πŸ”— MMovie has joined #archiveteam
23:12 πŸ”— bauruine has joined #archiveteam
23:13 πŸ”— Rickster has joined #archiveteam
23:14 πŸ”— Simpbrai_ has joined #archiveteam
23:27 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
23:28 πŸ”— MMovie has joined #archiveteam
23:43 πŸ”— arkiver BnA-Robin: if you're interested in another project to run you might like FTP
23:43 πŸ”— arkiver restarted today and we don't have a lot of people running it yet
23:46 πŸ”— Boppen has joined #archiveteam
23:48 πŸ”— arkiver SketchCow: What do you think of saving LiveJournal? We can make it a long running project, maybe over a year, so it won't need a lot of resources
23:49 πŸ”— arkiver If you give the go we'll have a project running soon for livejournal
23:54 πŸ”— bwn has joined #archiveteam
23:54 πŸ”— scyther has quit IRC (Quit: Leaving)

irclogger-viewer