#archiveteam-bs 2016-12-09,Fri

↑back Search

Time Nickname Message
00:02 πŸ”— BartoCH has joined #archiveteam-bs
01:11 πŸ”— compu_85 look into Novus plastic polish
01:11 πŸ”— compu_85 it works really well
01:11 πŸ”— compu_85 using it I got some laserdiscs to just about play
01:33 πŸ”— nicolas17 ok, I ripped the Scooter's Magic Castle CD, can't get MIDI to work in dosbox though
01:34 πŸ”— nicolas17 that's playing it from the CD... if I run the installer, it tries to install *into the CD* and fails
01:50 πŸ”— jrwr doing something kinda like docker, but using UML instead, mostly just to see how it would be done, and then we will have a fully usermode AT Warrior that runs on x64 Linux
01:50 πŸ”— jrwr ls
01:57 πŸ”— PovAddict has joined #archiveteam-bs
01:57 πŸ”— tpw_rules . ..
01:57 πŸ”— PovAddict is now known as nicolas18
01:57 πŸ”— nicolas18 stupid internet
01:58 πŸ”— nicolas18 I still can't get midi to work in dosbox :/
01:59 πŸ”— nicolas17 has quit IRC (Read error: Operation timed out)
02:02 πŸ”— nicolas18 is now known as nicolas17
02:02 πŸ”— nicolas17 oh well
02:14 πŸ”— VADemon has joined #archiveteam-bs
02:19 πŸ”— VADemon I don't know if it's ok to use archivebot for this kind of requests, but I've a little website made specifically for scams that may disappear any time soon and will definitely not be saved by waybackbot: http://csxbets.com/ Will be nice if someone would put it in, otherwise I'll make my own copy
02:21 πŸ”— nicolas17 wut
02:24 πŸ”— VADemon not "I've" in "I created" but rather "I found" or was attempted to be screwed over
02:24 πŸ”— VADemon as in*
02:26 πŸ”— nicolas17 http://pastebin.com/5U7vTzxR this?
02:27 πŸ”— Lord_Nigh has quit IRC (Ping timeout: 244 seconds)
02:30 πŸ”— VADemon looks quite the same, yes. the guy was russian, in a hurry and wrote like a real typical asshole and tried to get me on "how we'll screw over these americans while they're sleeping"
02:30 πŸ”— VADemon the domain was registered on november 15th using a popular russian registrar
02:32 πŸ”— VADemon line 278: Greezy: Ρ‹Π·ΠΊΠΊΠ½
02:32 πŸ”— nicolas17 there you go
02:34 πŸ”— Stiletto has joined #archiveteam-bs
02:48 πŸ”— Lord_Nigh has joined #archiveteam-bs
03:08 πŸ”— VADemon has quit IRC (Quit: left4dead)
03:09 πŸ”— Lord_Nigh has quit IRC (Ping timeout: 244 seconds)
03:10 πŸ”— jsp12345 has joined #archiveteam-bs
03:11 πŸ”— joepie91 yipdw: https://twitter.com/Arcaire/status/786720522977640448
03:14 πŸ”— dashcloud nicolas17: there's different CD options you can play with in DOSBOX to handle how a CD is mounted: http://www.dosbox.com/wiki/MOUNT
03:15 πŸ”— nicolas17 dashcloud: setting the CD label fixed "installer says the CD isn't present", and mounting a C: drive fixed "installer tries to install into CD"
03:15 πŸ”— nicolas17 never got MIDI to work though
03:16 πŸ”— dashcloud there's MIDI pages here: http://www.dosbox.com/wiki/Configuration:MIDI and here: http://www.dosbox.com/wiki/MIDI_software_devices
03:16 πŸ”— dashcloud I haven't tried to do any MIDI things in many years (and even then, only in ScummVM)
03:16 πŸ”— nicolas17 nah I gave up ^^ uploading the iso now
03:38 πŸ”— ndiddy has quit IRC (Quit: Leaving)
03:44 πŸ”— jrwr has quit IRC (Remote host closed the connection)
03:57 πŸ”— Lord_Nigh has joined #archiveteam-bs
03:59 πŸ”— nicolas17 https://nicolas17.s3.amazonaws.com/SCOOTCD.iso?Signature=D8%2F6%2F92lFxpZO%2Fd%2FrLXWrHU4kBU%3D&AWSAccessKeyId=AKIAJ4WFWKA3POOZTJSA&Expires=1481277564
04:18 πŸ”— yipdw joepie91: I've never dealt with the Go community, but if that's true that's pretty shitty
04:18 πŸ”— yipdw C++ community is like that, too
04:31 πŸ”— joepie91 yipdw: most language communities are, in my experience
04:31 πŸ”— joepie91 yipdw: only ones I've seen that aren't, so far, are Node.js and Rust
04:31 πŸ”— joepie91 oh, and Lua
04:31 πŸ”— yipdw yes
04:32 πŸ”— nicolas17 yeah
04:32 πŸ”— nicolas17 in nodejs they won't tell you "why are you using X? you should be using Y" because they are assholes
04:32 πŸ”— yipdw fortunately, with C++ there's sites like cppreference and tools like the compiler explorer that mean that I don't need to actually talk to people
04:32 πŸ”— nicolas17 they will tell you that because X really did get replaced by Y, and that happens often :P
04:32 πŸ”— yipdw if I want to see if something works, I can just read it and do it
04:35 πŸ”— Lord_Nigh has quit IRC (Ping timeout: 633 seconds)
04:35 πŸ”— Lord_Nigh has joined #archiveteam-bs
04:39 πŸ”— Start has joined #archiveteam-bs
04:42 πŸ”— Lord_Nigh has quit IRC (Read error: Operation timed out)
04:50 πŸ”— joepie91 nicolas17: well, I wouldn't say "replaced"
04:50 πŸ”— joepie91 nicolas17: rather "superseded" (in the sense that it does the thing better) or "what you're trying to do is a bad idea because <technical reason>"
04:50 πŸ”— joepie91 things very rarely really get *replaced* in JS
04:50 πŸ”— joepie91 even if it may seem like that to people who just chase the latest framework from hackernews
04:50 πŸ”— joepie91 :p
04:54 πŸ”— Lord_Nigh has joined #archiveteam-bs
05:39 πŸ”— Sk1d has quit IRC (Ping timeout: 194 seconds)
05:45 πŸ”— Sk1d has joined #archiveteam-bs
06:26 πŸ”— BartoCH has quit IRC (Ping timeout: 260 seconds)
06:40 πŸ”— ravetcofx has quit IRC (Read error: Connection reset by peer)
06:40 πŸ”— BartoCH has joined #archiveteam-bs
06:46 πŸ”— Start has quit IRC (Read error: Connection reset by peer)
06:47 πŸ”— Start has joined #archiveteam-bs
06:47 πŸ”— ravetcofx has joined #archiveteam-bs
06:50 πŸ”— kristian_ has quit IRC (Quit: Leaving)
07:15 πŸ”— Lord_Nigh has quit IRC (Read error: Operation timed out)
07:16 πŸ”— Lord_Nigh has joined #archiveteam-bs
07:21 πŸ”— Lord_Nigh has quit IRC (Ping timeout: 250 seconds)
07:23 πŸ”— ravetcofx has quit IRC (Read error: Operation timed out)
07:30 πŸ”— Lord_Nigh has joined #archiveteam-bs
07:31 πŸ”— nicolas17 has quit IRC (Quit: oh hey it's 4am *passes out*)
07:33 πŸ”— GE has joined #archiveteam-bs
08:19 πŸ”— Lord_Nigh has quit IRC (Ping timeout: 632 seconds)
08:45 πŸ”— Lord_Nigh has joined #archiveteam-bs
08:48 πŸ”— Lord_Nigh has quit IRC (Read error: Operation timed out)
09:23 πŸ”— Lord_Nigh has joined #archiveteam-bs
09:42 πŸ”— Lord_Nigh has quit IRC (Read error: Operation timed out)
10:10 πŸ”— will has quit IRC (Quit: Goodbye)
10:12 πŸ”— GE has quit IRC (Quit: zzz)
10:13 πŸ”— will has joined #archiveteam-bs
10:43 πŸ”— Lord_Nigh has joined #archiveteam-bs
10:48 πŸ”— Lord_Nigh has quit IRC (Read error: Operation timed out)
10:56 πŸ”— Lord_Nigh has joined #archiveteam-bs
11:01 πŸ”— Lord_Nigh has quit IRC (Ping timeout: 250 seconds)
11:09 πŸ”— BlueMaxim has quit IRC (Quit: Leaving)
11:27 πŸ”— SN4T14_ has joined #archiveteam-bs
11:29 πŸ”— SN4T14_ has quit IRC (Client Quit)
11:45 πŸ”— i336 has joined #archiveteam-bs
11:45 πŸ”— i336 hi
11:45 πŸ”— Igloo^_^ So, We don't not archive stuff
11:45 πŸ”— Igloo^_^ If that makes sense :)
11:45 πŸ”— i336 cooool. okay :>
11:45 πŸ”— i336 I'll start from the top
11:46 πŸ”— Igloo^_^ It's more a case of the people being @'d making a decision on if it's valuable use of our resources. Which I am not able to do say either way
11:46 πŸ”— i336 that makes absolute sense.
11:46 πŸ”— Igloo^_^ Plead your case :) it'll be noted and the answer will come down
11:46 πŸ”— i336 okay, will do *types*
11:47 πŸ”— trs80 first up, what is ex.ua? a file locker?
11:47 πŸ”— i336 ex.ua was the 11th largest site in Ukraine. the content was somewhere vaguely within the spectrum of bittorrent and DC++, sometimes better than both, with a dash of what.cd thrown in in the form of user accounts that could manage folder lists.
11:47 πŸ”— i336 the site went ukraine-only in 2010, but I accidentally discovered that fuzzing the user agent string let people in, so I happily browsed from australia
11:48 πŸ”— i336 I grabbed a couple TB a few months ago - someone archived pages and pages and pages of FLAC discographies, I found the user profile via google one day
11:48 πŸ”— i336 anyway, on the 1st of this month the site went dark and browsing stopped working.
11:48 πŸ”— i336 so the what.cd-like aspects of the site disappeared.
11:49 πŸ”— i336 :(
11:49 πŸ”— i336 HOWEVER
11:49 πŸ”— i336 good news: the web archive crawler was allowed in!!! it has some percentage of the site content
11:49 πŸ”— i336 bad news: CDX is a puddle on the floor if I request ex.ua&matchType=domain.
11:49 πŸ”— i336 so that's the user profiles linked to folder lists and file lists
11:50 πŸ”— i336 the difference between ex.ua and what.cd is that what.cd was just a list of hashes at the end of the day
11:50 πŸ”— i336 ex.ua is more than that: all the content is still on the servers.
11:50 πŸ”— i336 these servers go down on the 31st. forever.
11:50 πŸ”— i336 https://torrentfreak.com/major-cyberlocker-ex-ua-to-shut-down-amid-police-crackdown-161118/
11:51 πŸ”— i336 this happened 20 days ago - I had no idea, I decided to check the site out randomly, 2 days ago
11:51 πŸ”— i336 so. here's how archiving works
11:51 πŸ”— i336 wget ex.ua/filelist/71339549.xspf
11:52 πŸ”— i336 that ID came from a bookmark from several months back
11:52 πŸ”— GE has joined #archiveteam-bs
11:52 πŸ”— i336 that gives you back an XSPF file with a <title> - very important - and a series of filenames and URLs
11:52 πŸ”— i336 then you just follow the /get/.... URLs in the XSPF
11:52 πŸ”— i336 it 301 Moved's you to the file location **OR** a /view/ link if the file has been silently deleted
11:53 πŸ”— i336 that ^ is the entire crawler. the wayback machine actually has a small archive of the file servers, they're that trivial to access.
11:53 πŸ”— i336 so. the major problem is that the IDs are unknown.
11:53 πŸ”— Igloo^_^ That's not too bad
11:53 πŸ”— Igloo^_^ Just increment from 1 - x on the .xspf
11:53 πŸ”— i336 yup
11:54 πŸ”— Igloo^_^ and follow the links it produces
11:54 πŸ”— i336 x might be really high :P and the other issue is it's kinda slow
11:54 πŸ”— Igloo^_^ Problem is where to store the content.
11:54 πŸ”— Igloo^_^ (and the questionable legality of same)
11:54 πŸ”— i336 I was wondering about *requisitioning* the Petabox? something something https://archive.org/details/whatcdcrawl
11:54 πŸ”— i336 :>
11:54 πŸ”— Igloo^_^ I don't know if the internet archive would be appreciative of 6PB of questionable legality. But that's not my decision
11:55 πŸ”— i336 s/questionable legality/"shh, it will be gone soon"/
11:55 πŸ”— Igloo^_^ (Just to make it clear we're not the internet archive, we're just a team who provide resource to capture the sites before they're gone forever)
11:55 πŸ”— i336 I completely understand your position though. I only just learned the IA has 15PB total.
11:55 πŸ”— i336 (right.)
11:55 πŸ”— i336 although backblaze have 200pb
11:57 πŸ”— Igloo^_^ Ok, So the archiving itself is trivial, we can break it down into a warrior project easy enough. It's just the two questions a) do we want it (probably - typical attitude is archive first questions later) b) where do we put it?!
11:57 πŸ”— i336 I'm impressed you can make the warrior do ex.ua
11:57 πŸ”— i336 lol
11:57 πŸ”— Igloo^_^ If you're happy to hang around then i'm sure we'll get an answer soon enough
11:57 πŸ”— i336 okay, sure. I might see if I can run irssi from my friend's PC for after I put my laptop to sleep
11:58 πŸ”— Igloo^_^ The warrior just takes a string of URLs and follows a set pattern for data. Then syncs them somewhere
11:58 πŸ”— i336 yes, but it's accessing ex.ua, a site known to be a really big file locker
11:59 πŸ”— i336 things did change in 2010 though, it went ukraine-only, so perhaps most ISPs have forgotten about them. I just don't want to cause issues for residential customers
12:00 πŸ”— Igloo^_^ This wouldn't be the first time.
12:00 πŸ”— i336 that's reassuring :]
12:00 πŸ”— i336 also - I'm actually in an annoying/amusing position atm - I was using a friend's VPS but after a bit of a disagreement with systemd and system administration I kinda got booted off. I'd likely have some sort of sketchy proof-of-concept crawler/indexer otherwise.
12:01 πŸ”— i336 (I'm currently working on getting my own VPS, but due to some inane IRL issues that's taking a bit longer than I'd like.)
12:01 πŸ”— Igloo^_^ Leave it with us :)
12:01 πŸ”— i336 awesome to hear :D
12:01 πŸ”— Igloo^_^ No promises
12:02 πŸ”— i336 I understand
12:02 πŸ”— i336 one thing I can recommend is, if you can, bulk-fetch a full dump of the web archive's crawls of ex.ua
12:02 πŸ”— Sanqui i definitely recommend hanging around if you want the project to lift off the ground
12:02 πŸ”— i336 okay, will do... it's 11:02PM right now, but I'll hang around for as long as I can
12:02 πŸ”— i336 (thanks)
12:02 πŸ”— Sanqui i mean, not right now, but in the long term :p
12:02 πŸ”— i336 oh! okay
12:03 πŸ”— i336 also, if you fetch the IA's copy of ex.ua, you can just `grep -o 'ex.ua/[0-9]\+'` from all of the HTML
12:03 πŸ”— i336 that would very likely give you a very good idea of the upper bounds you're dealing with
12:03 πŸ”— i336 although you could determine those easily enough
12:04 πŸ”— i336 bruteforcing sounds easier though, since what's in IA is stable, and having it available doesn't provide all that much right now
12:04 πŸ”— i336 also, if you can let me loose on a decent machine, I can help :D
12:04 πŸ”— i336 (I'd be using my own VPS right now if I could)
12:05 πŸ”— Igloo^_^ It's the storage, I could start grabbing it at a decent rate (i've just tested a few bits) but i need the disk space
12:05 πŸ”— i336 hmm.
12:05 πŸ”— i336 one of my friends has about 6TB free right now and I could borrow that, but that would be decidedly temporary
12:06 πŸ”— i336 I'm very curious what would happen if the IA woke up tomorrow (in a manner of speaking) and found a few hundred TB missing...
12:07 πŸ”— Sanqui i also recommend creating an article on the wiki
12:07 πŸ”— Sanqui describe the project, structure, approximate size, possible action etc
12:08 πŸ”— Sanqui basically what you did here, but in a more lasting place
12:08 πŸ”— i336 okay then...
12:08 πŸ”— i336 and link to it from the front page?
12:08 πŸ”— Lord_Nigh has joined #archiveteam-bs
12:09 πŸ”— Igloo^_^ Put it in the at risk bit
12:09 πŸ”— Igloo^_^ yahoosucks is the secret =]
12:09 πŸ”— i336 thanks!
12:10 πŸ”— i336 should I title the page "ex.ua"?
12:10 πŸ”— Igloo^_^ yup
12:10 πŸ”— ranma ukraine, eh?
12:10 πŸ”— i336 yup
12:11 πŸ”— Sanqui it doesn't have to be tidy, but it's much better to have an information dump on the wiki than lost in irc logs :)
12:11 πŸ”— i336 of course, doing that now. thanks for the directive
12:16 πŸ”— Igloo^_^ No problem
12:18 πŸ”— arkiver i336: IA has 32 PB
12:19 πŸ”— vitzli has joined #archiveteam-bs
12:22 πŸ”— i336 arkiver: oh okay! very interesting. that's awesome :)
12:22 πŸ”— arkiver 105 PB if you add free space, backups, other stuf
12:23 πŸ”— arkiver stuff*
12:23 πŸ”— arkiver I had a look at the site
12:23 πŸ”— arkiver what day is it shutting down exactly?
12:24 πŸ”— Igloo^_^ 31st dec
12:24 πŸ”— Igloo^_^ 22 days
12:24 πŸ”— Lord_Nigh has quit IRC (Ping timeout: 250 seconds)
12:26 πŸ”— arkiver We can do a project to at least archive the xspf files, so we have the metadata of what was on the site
12:26 πŸ”— arkiver I'll also support the actual files, but we won't just be queueing whatever we can find due to the size
12:27 πŸ”— arkiver the plan is that people can send lists of files they want to have preserved and then we'll preserve them
12:28 πŸ”— Lord_Nigh has joined #archiveteam-bs
12:29 πŸ”— i336 arkiver: thank you so much. the wiki page is 99% done
12:32 πŸ”— i336 arkiver, Igloo^_^, ranma, Sanqui: http://www.archiveteam.org/index.php?title=Ex.ua
12:32 πŸ”— i336 suggestions welcome
12:33 πŸ”— i336 adding to front page now
12:33 πŸ”— i336 cool
12:33 πŸ”— i336 okay I can't edit the front page and I can't find anything named "at risk"
12:33 πŸ”— i336 I think someone else needs to do that
12:34 πŸ”— arkiver yeah, the front page isn't maintained very well
12:34 πŸ”— i336 ah. okay
12:34 πŸ”— Sanqui It's included from this page: http://www.archiveteam.org/index.php?title=Current_Projects
12:34 πŸ”— i336 arkiver: also, another note - when the /get/ URLs give you the 301 Moved, the URL you get sent to can be sent HEAD and you get back a valid Content-Length.
12:34 πŸ”— arkiver http://tracker.archiveteam.org gives a better idea of running/upcoming projects
12:34 πŸ”— i336 thanks
12:34 πŸ”— Sanqui but yeah, not maintained
12:34 πŸ”— i336 right
12:35 πŸ”— Sanqui I'll add a project template to the page
12:35 πŸ”— i336 okay then
12:37 πŸ”— arkiver I think the wikipage should be a little different
12:37 πŸ”— arkiver I'll edit it a bit
12:38 πŸ”— i336 sure thing - I have no idea how to wikicode, and I've no idea what I'm doing in this situation either
12:39 πŸ”— i336 ooh, the page has an infobox :>
12:40 πŸ”— i336 I'm wondering if there's anyone out there with a very large pile of diskspace who might not mind holding these kinds of files indefinitely.
12:40 πŸ”— Igloo^_^ 6PB is a very very very expensive setup
12:41 πŸ”— i336 I can't deny that :/
12:41 πŸ”— JSharp___ has joined #archiveteam-bs
12:44 πŸ”— i336 I think I might go out and look for disk space
12:45 πŸ”— i336 should ex.ua have its own IRC channel? might be a tad early to ask and/or presumptive, but I figure I might as well ask in advance
12:49 πŸ”— i336 also, do you think if I mention this on /r/datahoarder and /hoard/ ex.ua might get wind and kill the API access?
12:49 πŸ”— i336 should we archive the XSPFs *then* see what we can get, or take the risk?
12:50 πŸ”— i336 (of making a bit of noise, in the hope it gets us extra diskspace)
12:58 πŸ”— trs80 grab the XSPFs first regardless. do they have size info?
13:01 πŸ”— i336 trs80: unfortunately not - only URL and local filename :/
13:01 πŸ”— i336 I wish they did
13:02 πŸ”— trs80 what about a HEAD on the content URLs (following the redirect if necessary)?
13:03 πŸ”— trs80 Looks like HEAD includes Content-Length:
13:04 πŸ”— i336 yup it does
13:04 πŸ”— i336 but it requires HEAD on every URL listed in the XSPF
13:04 πŸ”— i336 which is annoying
13:04 πŸ”— trs80 yeah. maybe have it as a stage 1.5 grab? (normally stage 1 is grab lists of URLs, stage 2 is grab content)
13:06 πŸ”— i336 that sounds interesting
13:07 πŸ”— Lord_Nigh has quit IRC (Read error: Operation timed out)
13:10 πŸ”— arkiver so let's do the XSPF URLs and the /get/ URLs without redirect
13:10 πŸ”— arkiver err
13:10 πŸ”— arkiver without the data they redirect to
13:10 πŸ”— i336 right
13:10 πŸ”— i336 I recommend HEADing the URLs the /get/ URLs point to - this will 1) give us file size and 2) identify dead links
13:10 πŸ”— arkiver then, people can give us lists of /get/ URLs they want to have archived and we'll archive actual file for that too
13:10 πŸ”— i336 right
13:11 πŸ”— arkiver I changed the wikipage
13:12 πŸ”— i336 right. that looks much more professional
13:12 πŸ”— Sanqui ...dutch?
13:12 πŸ”— i336 I think the original text is russian, not du...what Sanqui said
13:13 πŸ”— arkiver uh
13:13 πŸ”— arkiver oops
13:13 πŸ”— Sanqui I think the original wikipage had some useful info that's been omitted
13:14 πŸ”— arkiver like how the XSPF file looked?
13:15 πŸ”— Sanqui a lot - the fact that the site had a catalog that's been partly archived by IA, that the API isn't meant to be public, etc
13:15 πŸ”— i336 it wasn't presented in the most professional style (I tend to turn into a pile of nerves in situations like this, I blame that)
13:15 πŸ”— Sanqui history about the website (going ukraine only in 2010),...
13:21 πŸ”— i336 THE SITE IS FULL OF TORRENTS
13:21 πŸ”— i336 http://web.archive.org/web/20140912032528/http://www.ex.ua/80745202?
13:22 πŸ”— i336 "Toppeht" (I can't type unicode, it looks like that)
13:22 πŸ”— i336 curl -I http://www.ex.ua/torrent/125030659
13:22 πŸ”— i336 Location: http://fs61.www.ex.ua/torrent/beafc19e3c8a355ca8bd0401392649b6/125030659/48.%20Ori%20Uplift%20-%20Uplifting%20Only%20Fan%20Favorites%202013%202014%20(Continuous%20Dj%20Mix%20Part%202).flac.EX.UA.torrent
13:22 πŸ”— Igloo^_^ I must say I love the enthusiasm i336
13:22 πŸ”— i336 we'd need to bruteforce that unfortunately, the XSPF files don't tell us
13:23 πŸ”— i336 oh. I'm just freaking out about this thing going away in ~20 days
13:23 πŸ”— i336 I wanna save the what.cd-like thing :P
13:23 πŸ”— arkiver fixed.
13:23 πŸ”— arkiver look at the wiki again
13:23 πŸ”— arkiver also, let's create a channel for this
13:23 πŸ”— Igloo^_^ #exexbaby ?
13:24 πŸ”— arkiver yes
13:24 πŸ”— i336 lol okay
13:24 πŸ”— Igloo^_^ (stupid channel names are part of the woodwork here)
13:24 πŸ”— i336 (right)
13:37 πŸ”— RichardG_ has joined #archiveteam-bs
13:38 πŸ”— RichardG has quit IRC (Ping timeout: 260 seconds)
14:01 πŸ”— i336 has quit IRC (leaving)
14:02 πŸ”— i336_ has joined #archiveteam-bs
14:13 πŸ”— RichardG_ is now known as RichardG
14:22 πŸ”— Lord_Nigh has joined #archiveteam-bs
14:22 πŸ”— Boltsie has joined #archiveteam-bs
14:32 πŸ”— Lord_Nigh has quit IRC (Ping timeout: 244 seconds)
14:39 πŸ”— Lord_Nigh has joined #archiveteam-bs
15:08 πŸ”— Lord_Nigh has quit IRC (Read error: Operation timed out)
15:13 πŸ”— godane i'm starting to upload The Portugal News pdf i got: https://archive.org/details/The_Portugal_News-898
15:13 πŸ”— godane *pdfs
15:17 πŸ”— Lord_Nigh has joined #archiveteam-bs
15:26 πŸ”— Flierp has quit IRC (Quit: Page closed)
15:33 πŸ”— alembic has quit IRC ()
15:34 πŸ”— alembic has joined #archiveteam-bs
15:41 πŸ”— Start has quit IRC (Quit: Disconnected.)
15:54 πŸ”— Lord_Nigh has quit IRC (Read error: Operation timed out)
16:04 πŸ”— Lord_Nigh has joined #archiveteam-bs
17:32 πŸ”— tpw_rules https://web.archive.org/web/*/http://media.fukung.net/images/10174/*
17:32 πŸ”— tpw_rules why is the 'to' date way before the 'from' date? and when i click on it it says the page is ont in the archive
17:33 πŸ”— tpw_rules not
17:34 πŸ”— xmc it was fetched exactly once, on 21-jun-2013, and yielded a 500-series error
17:34 πŸ”— xmc https://web-beta.archive.org/web/*/http://media.fukung.net/images/10174/crackers.jpg
17:34 πŸ”— tpw_rules ah okay
17:34 πŸ”— xmc web-beta.archive.org handles these cases better than web.archive.org
17:35 πŸ”— xmc they're in the middle of a big rewrite so,
17:35 πŸ”— tpw_rules shit i bet all the pictures from that site are like that
17:35 πŸ”— tpw_rules to prevent hotlinking
17:35 πŸ”— xmc randomly selected https://web-beta.archive.org/web/20130703155351/http://media.fukung.net/images/10033/b0146902f9402a370a54792116702f75.gif
17:35 πŸ”— tpw_rules okay so some work
17:36 πŸ”— xmc https://web-beta.archive.org/web/*/http://media.fukung.net/images/*
17:36 πŸ”— xmc oh huh, fukung.net is gone
17:36 πŸ”— xmc damn
17:36 πŸ”— tpw_rules yeah
17:36 πŸ”— xmc i didn't know
17:37 πŸ”— tpw_rules i have a collection of 400 or so pics from them i saved a while back
17:38 πŸ”— vitzli has quit IRC (Leaving)
17:43 πŸ”— godane i'm starting to upload more ABC RN Late Night Live
17:47 πŸ”— HCross has quit IRC (Read error: Operation timed out)
17:49 πŸ”— REiN^ has quit IRC (Max SendQ exceeded)
17:50 πŸ”— REiN^ has joined #archiveteam-bs
17:54 πŸ”— Mayonaise has quit IRC (Read error: Operation timed out)
18:19 πŸ”— Aranje has joined #archiveteam-bs
18:45 πŸ”— Lord_Nigh has quit IRC (Ping timeout: 250 seconds)
18:56 πŸ”— Lord_Nigh has joined #archiveteam-bs
19:04 πŸ”— nicolas17 has joined #archiveteam-bs
19:28 πŸ”— Ravenloft has joined #archiveteam-bs
19:36 πŸ”— godane i'm at 1018k items now
19:46 πŸ”— Aranje has quit IRC (Ping timeout: 260 seconds)
19:46 πŸ”— Start has joined #archiveteam-bs
20:22 πŸ”— Lord_Nigh has quit IRC (Read error: Operation timed out)
20:22 πŸ”— Lord_Nigh has joined #archiveteam-bs
20:27 πŸ”— Specular has joined #archiveteam-bs
20:29 πŸ”— Specular the person who runs archive.is seems pretty open via his informal Q&A on the blog. Think if anything happened to the site he'd be responsive to backup collabs?
20:45 πŸ”— alembic I would think so
20:53 πŸ”— VADemon has joined #archiveteam-bs
20:55 πŸ”— ndiddy has joined #archiveteam-bs
20:59 πŸ”— Start has quit IRC (Quit: Disconnected.)
21:16 πŸ”— i336_ has quit IRC (Read error: Operation timed out)
21:35 πŸ”— i336 has joined #archiveteam-bs
21:56 πŸ”— Asparagir has quit IRC (Asparagir)
22:05 πŸ”— Start has joined #archiveteam-bs
22:29 πŸ”— Start has quit IRC (Quit: Disconnected.)
22:36 πŸ”— GE has quit IRC (Remote host closed the connection)
22:43 πŸ”— achip has quit IRC (Ping timeout: 255 seconds)
22:47 πŸ”— BartoCH has quit IRC (Ping timeout: 260 seconds)
22:48 πŸ”— achip has joined #archiveteam-bs
22:52 πŸ”— BartoCH has joined #archiveteam-bs
22:57 πŸ”— Yoshimura has quit IRC (Quit: quit)
23:04 πŸ”— Yoshimura has joined #archiveteam-bs
23:11 πŸ”— BlueMaxim has joined #archiveteam-bs
23:19 πŸ”— Start has joined #archiveteam-bs
23:28 πŸ”— i0npulse has quit IRC (Ping timeout: 244 seconds)
23:28 πŸ”— i0npulse has joined #archiveteam-bs
23:29 πŸ”— krazedkat has quit IRC (Ping timeout: 244 seconds)
23:30 πŸ”— Specular has quit IRC (Ping timeout: 244 seconds)
23:31 πŸ”— espes__ has quit IRC (Ping timeout: 244 seconds)
23:35 πŸ”— i0npulse has quit IRC (Ping timeout: 244 seconds)
23:35 πŸ”— i0npulse has joined #archiveteam-bs
23:37 πŸ”— Specular has joined #archiveteam-bs
23:37 πŸ”— closure has quit IRC (Ping timeout: 244 seconds)
23:38 πŸ”— ravetcofx has joined #archiveteam-bs
23:38 πŸ”— espes__ has joined #archiveteam-bs
23:38 πŸ”— joepie91 Specular: iirc there have been problems with the guy before but I don't recall the specifics
23:39 πŸ”— closure has joined #archiveteam-bs
23:40 πŸ”— i0npulse has quit IRC (Ping timeout: 244 seconds)
23:40 πŸ”— i0npulse has joined #archiveteam-bs
23:41 πŸ”— useretail has quit IRC (Ping timeout: 244 seconds)
23:44 πŸ”— bsmith093 has quit IRC (Ping timeout: 244 seconds)
23:45 πŸ”— useretail has joined #archiveteam-bs
23:45 πŸ”— joepie91 Specular: ref http://archiveteam.org/index.php?title=Archive.is
23:47 πŸ”— pikhq_ has quit IRC (Ping timeout: 244 seconds)
23:47 πŸ”— joepie91 SketchCow: I'm sure you'll love how Wikipedians here have wiki-linked "rage", "paranoia" and "kleptomania" in your quote: https://en.wikipedia.org/wiki/Archive_Team :P
23:47 πŸ”— SketchCow has quit IRC (Ping timeout: 244 seconds)
23:49 πŸ”— i336_ has joined #archiveteam-bs
23:49 πŸ”— i0npulse has quit IRC (Ping timeout: 244 seconds)
23:49 πŸ”— i336_ I understand some of the people here have access to crazy internet bandwidth. I currently have a very big use for a 5gbps or 10gbps link to run an indexer on what's left of ex.ua, which is disappearing for good in about 20 days. I'd need only a few GB of disk space as I would only be using this for indexing efforts and prototyping.
23:49 πŸ”— pikhq has joined #archiveteam-bs
23:49 πŸ”— i336_ if anyone can help please PM me
23:49 πŸ”— i0npulse has joined #archiveteam-bs
23:49 πŸ”— nicolas17 a DigitalOcean VPS maybe?
23:49 πŸ”— bsmith093 has joined #archiveteam-bs
23:49 πŸ”— Medowar0 has quit IRC (Ping timeout: 244 seconds)
23:49 πŸ”— i336_ that sounds awesome, but I unfortunately can't waltz over to the site and grab one right now
23:50 πŸ”— bwn has quit IRC (Ping timeout: 244 seconds)
23:51 πŸ”— zenguy has quit IRC (Ping timeout: 244 seconds)
23:51 πŸ”— Simpbrain has quit IRC (Ping timeout: 244 seconds)
23:51 πŸ”— Frogging has quit IRC (Ping timeout: 244 seconds)
23:53 πŸ”— i336_ yeah, really awesome. but also noteworthily expensive
23:53 πŸ”— nicolas17 their pricing says how much data transfer you get on each droplet size... but afaik nothing actually happens when you reach the "limit"
23:53 πŸ”— Kaz do you *need* 5-10gbps?
23:53 πŸ”— Kaz like are you actually going to be able to saturate it
23:53 πŸ”— i336_ well... tbh, I'm not honestly sure.
23:53 πŸ”— i336_ further explanation: (details in #exexbaby)
23:53 πŸ”— i336_ ex.ua was the 11th biggest site in ukraine. it was a file host with user accounts and public access. kind of like what.cd
23:53 πŸ”— hook54321 has quit IRC (Ping timeout: 244 seconds)
23:54 πŸ”— Medowar_ has joined #archiveteam-bs
23:55 πŸ”— i0npulse has quit IRC (Ping timeout: 244 seconds)
23:55 πŸ”— i336_ the archive team has a project to archive all the XSPF files that form the folder system on the site; these XSPF files are XML filelists
23:55 πŸ”— joepie91 i336_: as in, you're *looking* for a 5gbps-10gbps uplink?
23:56 πŸ”— dashcloud has quit IRC (Ping timeout: 244 seconds)
23:56 πŸ”— purplebot has quit IRC (Ping timeout: 244 seconds)
23:56 πŸ”— i0npulse has joined #archiveteam-bs
23:57 πŸ”— purplebot has joined #archiveteam-bs
23:57 πŸ”— joepie91 i336_: if yes, servercrate claims to offer 10gbps connectivity on their storage VPSes: https://servercrate.com/vps-storage
23:57 πŸ”— closure has quit IRC (Ping timeout: 244 seconds)
23:58 πŸ”— i336_ I'm not 100% sure. my reasoning is going along the lines of "this site is going down for good in 20 days, faster is better!"
23:58 πŸ”— joepie91 I can't vouch for whether it actually does
23:58 πŸ”— joepie91 I know that the network is not *great* but it works
23:59 πŸ”— i336_ I see.
23:59 πŸ”— joepie91 the problem is more likely to be traffic
23:59 πŸ”— Kaz ah, so you want lots of 1gbps links
23:59 πŸ”— joepie91 10gbps unmetered basically costs a fortune :P
23:59 πŸ”— tapedrive I doubt very much that you'll actaully be able to use a connection that fast - the site will probably rate limit you.
23:59 πŸ”— i336_ joepie91: good point. very good point
23:59 πŸ”— Kaz unless there's one hell of a beefy machine pushing it
23:59 πŸ”— i336_ tapedrive: also very good point
23:59 πŸ”— i336_ Kaz: oh yeah
23:59 πŸ”— * i336_ brb
23:59 πŸ”— closure has joined #archiveteam-bs
23:59 πŸ”— Kaz I think we can do that
23:59 πŸ”— i336_ Kaz: you're right, I do want lots of 1gbps links
23:59 πŸ”— i336_ !
23:59 πŸ”— Kaz do you have code?
23:59 πŸ”— Medowar_ has quit IRC (Ping timeout: 244 seconds)

irclogger-viewer