Time |
Nickname |
Message |
00:02
π
|
|
BartoCH has joined #archiveteam-bs |
01:11
π
|
compu_85 |
look into Novus plastic polish |
01:11
π
|
compu_85 |
it works really well |
01:11
π
|
compu_85 |
using it I got some laserdiscs to just about play |
01:33
π
|
nicolas17 |
ok, I ripped the Scooter's Magic Castle CD, can't get MIDI to work in dosbox though |
01:34
π
|
nicolas17 |
that's playing it from the CD... if I run the installer, it tries to install *into the CD* and fails |
01:50
π
|
jrwr |
doing something kinda like docker, but using UML instead, mostly just to see how it would be done, and then we will have a fully usermode AT Warrior that runs on x64 Linux |
01:50
π
|
jrwr |
ls |
01:57
π
|
|
PovAddict has joined #archiveteam-bs |
01:57
π
|
tpw_rules |
. .. |
01:57
π
|
|
PovAddict is now known as nicolas18 |
01:57
π
|
nicolas18 |
stupid internet |
01:58
π
|
nicolas18 |
I still can't get midi to work in dosbox :/ |
01:59
π
|
|
nicolas17 has quit IRC (Read error: Operation timed out) |
02:02
π
|
|
nicolas18 is now known as nicolas17 |
02:02
π
|
nicolas17 |
oh well |
02:14
π
|
|
VADemon has joined #archiveteam-bs |
02:19
π
|
VADemon |
I don't know if it's ok to use archivebot for this kind of requests, but I've a little website made specifically for scams that may disappear any time soon and will definitely not be saved by waybackbot: http://csxbets.com/ Will be nice if someone would put it in, otherwise I'll make my own copy |
02:21
π
|
nicolas17 |
wut |
02:24
π
|
VADemon |
not "I've" in "I created" but rather "I found" or was attempted to be screwed over |
02:24
π
|
VADemon |
as in* |
02:26
π
|
nicolas17 |
http://pastebin.com/5U7vTzxR this? |
02:27
π
|
|
Lord_Nigh has quit IRC (Ping timeout: 244 seconds) |
02:30
π
|
VADemon |
looks quite the same, yes. the guy was russian, in a hurry and wrote like a real typical asshole and tried to get me on "how we'll screw over these americans while they're sleeping" |
02:30
π
|
VADemon |
the domain was registered on november 15th using a popular russian registrar |
02:32
π
|
VADemon |
line 278: Greezy: ΡΠ·ΠΊΠΊΠ½ |
02:32
π
|
nicolas17 |
there you go |
02:34
π
|
|
Stiletto has joined #archiveteam-bs |
02:48
π
|
|
Lord_Nigh has joined #archiveteam-bs |
03:08
π
|
|
VADemon has quit IRC (Quit: left4dead) |
03:09
π
|
|
Lord_Nigh has quit IRC (Ping timeout: 244 seconds) |
03:10
π
|
|
jsp12345 has joined #archiveteam-bs |
03:11
π
|
joepie91 |
yipdw: https://twitter.com/Arcaire/status/786720522977640448 |
03:14
π
|
dashcloud |
nicolas17: there's different CD options you can play with in DOSBOX to handle how a CD is mounted: http://www.dosbox.com/wiki/MOUNT |
03:15
π
|
nicolas17 |
dashcloud: setting the CD label fixed "installer says the CD isn't present", and mounting a C: drive fixed "installer tries to install into CD" |
03:15
π
|
nicolas17 |
never got MIDI to work though |
03:16
π
|
dashcloud |
there's MIDI pages here: http://www.dosbox.com/wiki/Configuration:MIDI and here: http://www.dosbox.com/wiki/MIDI_software_devices |
03:16
π
|
dashcloud |
I haven't tried to do any MIDI things in many years (and even then, only in ScummVM) |
03:16
π
|
nicolas17 |
nah I gave up ^^ uploading the iso now |
03:38
π
|
|
ndiddy has quit IRC (Quit: Leaving) |
03:44
π
|
|
jrwr has quit IRC (Remote host closed the connection) |
03:57
π
|
|
Lord_Nigh has joined #archiveteam-bs |
03:59
π
|
nicolas17 |
https://nicolas17.s3.amazonaws.com/SCOOTCD.iso?Signature=D8%2F6%2F92lFxpZO%2Fd%2FrLXWrHU4kBU%3D&AWSAccessKeyId=AKIAJ4WFWKA3POOZTJSA&Expires=1481277564 |
04:18
π
|
yipdw |
joepie91: I've never dealt with the Go community, but if that's true that's pretty shitty |
04:18
π
|
yipdw |
C++ community is like that, too |
04:31
π
|
joepie91 |
yipdw: most language communities are, in my experience |
04:31
π
|
joepie91 |
yipdw: only ones I've seen that aren't, so far, are Node.js and Rust |
04:31
π
|
joepie91 |
oh, and Lua |
04:31
π
|
yipdw |
yes |
04:32
π
|
nicolas17 |
yeah |
04:32
π
|
nicolas17 |
in nodejs they won't tell you "why are you using X? you should be using Y" because they are assholes |
04:32
π
|
yipdw |
fortunately, with C++ there's sites like cppreference and tools like the compiler explorer that mean that I don't need to actually talk to people |
04:32
π
|
nicolas17 |
they will tell you that because X really did get replaced by Y, and that happens often :P |
04:32
π
|
yipdw |
if I want to see if something works, I can just read it and do it |
04:35
π
|
|
Lord_Nigh has quit IRC (Ping timeout: 633 seconds) |
04:35
π
|
|
Lord_Nigh has joined #archiveteam-bs |
04:39
π
|
|
Start has joined #archiveteam-bs |
04:42
π
|
|
Lord_Nigh has quit IRC (Read error: Operation timed out) |
04:50
π
|
joepie91 |
nicolas17: well, I wouldn't say "replaced" |
04:50
π
|
joepie91 |
nicolas17: rather "superseded" (in the sense that it does the thing better) or "what you're trying to do is a bad idea because <technical reason>" |
04:50
π
|
joepie91 |
things very rarely really get *replaced* in JS |
04:50
π
|
joepie91 |
even if it may seem like that to people who just chase the latest framework from hackernews |
04:50
π
|
joepie91 |
:p |
04:54
π
|
|
Lord_Nigh has joined #archiveteam-bs |
05:39
π
|
|
Sk1d has quit IRC (Ping timeout: 194 seconds) |
05:45
π
|
|
Sk1d has joined #archiveteam-bs |
06:26
π
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
06:40
π
|
|
ravetcofx has quit IRC (Read error: Connection reset by peer) |
06:40
π
|
|
BartoCH has joined #archiveteam-bs |
06:46
π
|
|
Start has quit IRC (Read error: Connection reset by peer) |
06:47
π
|
|
Start has joined #archiveteam-bs |
06:47
π
|
|
ravetcofx has joined #archiveteam-bs |
06:50
π
|
|
kristian_ has quit IRC (Quit: Leaving) |
07:15
π
|
|
Lord_Nigh has quit IRC (Read error: Operation timed out) |
07:16
π
|
|
Lord_Nigh has joined #archiveteam-bs |
07:21
π
|
|
Lord_Nigh has quit IRC (Ping timeout: 250 seconds) |
07:23
π
|
|
ravetcofx has quit IRC (Read error: Operation timed out) |
07:30
π
|
|
Lord_Nigh has joined #archiveteam-bs |
07:31
π
|
|
nicolas17 has quit IRC (Quit: oh hey it's 4am *passes out*) |
07:33
π
|
|
GE has joined #archiveteam-bs |
08:19
π
|
|
Lord_Nigh has quit IRC (Ping timeout: 632 seconds) |
08:45
π
|
|
Lord_Nigh has joined #archiveteam-bs |
08:48
π
|
|
Lord_Nigh has quit IRC (Read error: Operation timed out) |
09:23
π
|
|
Lord_Nigh has joined #archiveteam-bs |
09:42
π
|
|
Lord_Nigh has quit IRC (Read error: Operation timed out) |
10:10
π
|
|
will has quit IRC (Quit: Goodbye) |
10:12
π
|
|
GE has quit IRC (Quit: zzz) |
10:13
π
|
|
will has joined #archiveteam-bs |
10:43
π
|
|
Lord_Nigh has joined #archiveteam-bs |
10:48
π
|
|
Lord_Nigh has quit IRC (Read error: Operation timed out) |
10:56
π
|
|
Lord_Nigh has joined #archiveteam-bs |
11:01
π
|
|
Lord_Nigh has quit IRC (Ping timeout: 250 seconds) |
11:09
π
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
11:27
π
|
|
SN4T14_ has joined #archiveteam-bs |
11:29
π
|
|
SN4T14_ has quit IRC (Client Quit) |
11:45
π
|
|
i336 has joined #archiveteam-bs |
11:45
π
|
i336 |
hi |
11:45
π
|
Igloo^_^ |
So, We don't not archive stuff |
11:45
π
|
Igloo^_^ |
If that makes sense :) |
11:45
π
|
i336 |
cooool. okay :> |
11:45
π
|
i336 |
I'll start from the top |
11:46
π
|
Igloo^_^ |
It's more a case of the people being @'d making a decision on if it's valuable use of our resources. Which I am not able to do say either way |
11:46
π
|
i336 |
that makes absolute sense. |
11:46
π
|
Igloo^_^ |
Plead your case :) it'll be noted and the answer will come down |
11:46
π
|
i336 |
okay, will do *types* |
11:47
π
|
trs80 |
first up, what is ex.ua? a file locker? |
11:47
π
|
i336 |
ex.ua was the 11th largest site in Ukraine. the content was somewhere vaguely within the spectrum of bittorrent and DC++, sometimes better than both, with a dash of what.cd thrown in in the form of user accounts that could manage folder lists. |
11:47
π
|
i336 |
the site went ukraine-only in 2010, but I accidentally discovered that fuzzing the user agent string let people in, so I happily browsed from australia |
11:48
π
|
i336 |
I grabbed a couple TB a few months ago - someone archived pages and pages and pages of FLAC discographies, I found the user profile via google one day |
11:48
π
|
i336 |
anyway, on the 1st of this month the site went dark and browsing stopped working. |
11:48
π
|
i336 |
so the what.cd-like aspects of the site disappeared. |
11:49
π
|
i336 |
:( |
11:49
π
|
i336 |
HOWEVER |
11:49
π
|
i336 |
good news: the web archive crawler was allowed in!!! it has some percentage of the site content |
11:49
π
|
i336 |
bad news: CDX is a puddle on the floor if I request ex.ua&matchType=domain. |
11:49
π
|
i336 |
so that's the user profiles linked to folder lists and file lists |
11:50
π
|
i336 |
the difference between ex.ua and what.cd is that what.cd was just a list of hashes at the end of the day |
11:50
π
|
i336 |
ex.ua is more than that: all the content is still on the servers. |
11:50
π
|
i336 |
these servers go down on the 31st. forever. |
11:50
π
|
i336 |
https://torrentfreak.com/major-cyberlocker-ex-ua-to-shut-down-amid-police-crackdown-161118/ |
11:51
π
|
i336 |
this happened 20 days ago - I had no idea, I decided to check the site out randomly, 2 days ago |
11:51
π
|
i336 |
so. here's how archiving works |
11:51
π
|
i336 |
wget ex.ua/filelist/71339549.xspf |
11:52
π
|
i336 |
that ID came from a bookmark from several months back |
11:52
π
|
|
GE has joined #archiveteam-bs |
11:52
π
|
i336 |
that gives you back an XSPF file with a <title> - very important - and a series of filenames and URLs |
11:52
π
|
i336 |
then you just follow the /get/.... URLs in the XSPF |
11:52
π
|
i336 |
it 301 Moved's you to the file location **OR** a /view/ link if the file has been silently deleted |
11:53
π
|
i336 |
that ^ is the entire crawler. the wayback machine actually has a small archive of the file servers, they're that trivial to access. |
11:53
π
|
i336 |
so. the major problem is that the IDs are unknown. |
11:53
π
|
Igloo^_^ |
That's not too bad |
11:53
π
|
Igloo^_^ |
Just increment from 1 - x on the .xspf |
11:53
π
|
i336 |
yup |
11:54
π
|
Igloo^_^ |
and follow the links it produces |
11:54
π
|
i336 |
x might be really high :P and the other issue is it's kinda slow |
11:54
π
|
Igloo^_^ |
Problem is where to store the content. |
11:54
π
|
Igloo^_^ |
(and the questionable legality of same) |
11:54
π
|
i336 |
I was wondering about *requisitioning* the Petabox? something something https://archive.org/details/whatcdcrawl |
11:54
π
|
i336 |
:> |
11:54
π
|
Igloo^_^ |
I don't know if the internet archive would be appreciative of 6PB of questionable legality. But that's not my decision |
11:55
π
|
i336 |
s/questionable legality/"shh, it will be gone soon"/ |
11:55
π
|
Igloo^_^ |
(Just to make it clear we're not the internet archive, we're just a team who provide resource to capture the sites before they're gone forever) |
11:55
π
|
i336 |
I completely understand your position though. I only just learned the IA has 15PB total. |
11:55
π
|
i336 |
(right.) |
11:55
π
|
i336 |
although backblaze have 200pb |
11:57
π
|
Igloo^_^ |
Ok, So the archiving itself is trivial, we can break it down into a warrior project easy enough. It's just the two questions a) do we want it (probably - typical attitude is archive first questions later) b) where do we put it?! |
11:57
π
|
i336 |
I'm impressed you can make the warrior do ex.ua |
11:57
π
|
i336 |
lol |
11:57
π
|
Igloo^_^ |
If you're happy to hang around then i'm sure we'll get an answer soon enough |
11:57
π
|
i336 |
okay, sure. I might see if I can run irssi from my friend's PC for after I put my laptop to sleep |
11:58
π
|
Igloo^_^ |
The warrior just takes a string of URLs and follows a set pattern for data. Then syncs them somewhere |
11:58
π
|
i336 |
yes, but it's accessing ex.ua, a site known to be a really big file locker |
11:59
π
|
i336 |
things did change in 2010 though, it went ukraine-only, so perhaps most ISPs have forgotten about them. I just don't want to cause issues for residential customers |
12:00
π
|
Igloo^_^ |
This wouldn't be the first time. |
12:00
π
|
i336 |
that's reassuring :] |
12:00
π
|
i336 |
also - I'm actually in an annoying/amusing position atm - I was using a friend's VPS but after a bit of a disagreement with systemd and system administration I kinda got booted off. I'd likely have some sort of sketchy proof-of-concept crawler/indexer otherwise. |
12:01
π
|
i336 |
(I'm currently working on getting my own VPS, but due to some inane IRL issues that's taking a bit longer than I'd like.) |
12:01
π
|
Igloo^_^ |
Leave it with us :) |
12:01
π
|
i336 |
awesome to hear :D |
12:01
π
|
Igloo^_^ |
No promises |
12:02
π
|
i336 |
I understand |
12:02
π
|
i336 |
one thing I can recommend is, if you can, bulk-fetch a full dump of the web archive's crawls of ex.ua |
12:02
π
|
Sanqui |
i definitely recommend hanging around if you want the project to lift off the ground |
12:02
π
|
i336 |
okay, will do... it's 11:02PM right now, but I'll hang around for as long as I can |
12:02
π
|
i336 |
(thanks) |
12:02
π
|
Sanqui |
i mean, not right now, but in the long term :p |
12:02
π
|
i336 |
oh! okay |
12:03
π
|
i336 |
also, if you fetch the IA's copy of ex.ua, you can just `grep -o 'ex.ua/[0-9]\+'` from all of the HTML |
12:03
π
|
i336 |
that would very likely give you a very good idea of the upper bounds you're dealing with |
12:03
π
|
i336 |
although you could determine those easily enough |
12:04
π
|
i336 |
bruteforcing sounds easier though, since what's in IA is stable, and having it available doesn't provide all that much right now |
12:04
π
|
i336 |
also, if you can let me loose on a decent machine, I can help :D |
12:04
π
|
i336 |
(I'd be using my own VPS right now if I could) |
12:05
π
|
Igloo^_^ |
It's the storage, I could start grabbing it at a decent rate (i've just tested a few bits) but i need the disk space |
12:05
π
|
i336 |
hmm. |
12:05
π
|
i336 |
one of my friends has about 6TB free right now and I could borrow that, but that would be decidedly temporary |
12:06
π
|
i336 |
I'm very curious what would happen if the IA woke up tomorrow (in a manner of speaking) and found a few hundred TB missing... |
12:07
π
|
Sanqui |
i also recommend creating an article on the wiki |
12:07
π
|
Sanqui |
describe the project, structure, approximate size, possible action etc |
12:08
π
|
Sanqui |
basically what you did here, but in a more lasting place |
12:08
π
|
i336 |
okay then... |
12:08
π
|
i336 |
and link to it from the front page? |
12:08
π
|
|
Lord_Nigh has joined #archiveteam-bs |
12:09
π
|
Igloo^_^ |
Put it in the at risk bit |
12:09
π
|
Igloo^_^ |
yahoosucks is the secret =] |
12:09
π
|
i336 |
thanks! |
12:10
π
|
i336 |
should I title the page "ex.ua"? |
12:10
π
|
Igloo^_^ |
yup |
12:10
π
|
ranma |
ukraine, eh? |
12:10
π
|
i336 |
yup |
12:11
π
|
Sanqui |
it doesn't have to be tidy, but it's much better to have an information dump on the wiki than lost in irc logs :) |
12:11
π
|
i336 |
of course, doing that now. thanks for the directive |
12:16
π
|
Igloo^_^ |
No problem |
12:18
π
|
arkiver |
i336: IA has 32 PB |
12:19
π
|
|
vitzli has joined #archiveteam-bs |
12:22
π
|
i336 |
arkiver: oh okay! very interesting. that's awesome :) |
12:22
π
|
arkiver |
105 PB if you add free space, backups, other stuf |
12:23
π
|
arkiver |
stuff* |
12:23
π
|
arkiver |
I had a look at the site |
12:23
π
|
arkiver |
what day is it shutting down exactly? |
12:24
π
|
Igloo^_^ |
31st dec |
12:24
π
|
Igloo^_^ |
22 days |
12:24
π
|
|
Lord_Nigh has quit IRC (Ping timeout: 250 seconds) |
12:26
π
|
arkiver |
We can do a project to at least archive the xspf files, so we have the metadata of what was on the site |
12:26
π
|
arkiver |
I'll also support the actual files, but we won't just be queueing whatever we can find due to the size |
12:27
π
|
arkiver |
the plan is that people can send lists of files they want to have preserved and then we'll preserve them |
12:28
π
|
|
Lord_Nigh has joined #archiveteam-bs |
12:29
π
|
i336 |
arkiver: thank you so much. the wiki page is 99% done |
12:32
π
|
i336 |
arkiver, Igloo^_^, ranma, Sanqui: http://www.archiveteam.org/index.php?title=Ex.ua |
12:32
π
|
i336 |
suggestions welcome |
12:33
π
|
i336 |
adding to front page now |
12:33
π
|
i336 |
cool |
12:33
π
|
i336 |
okay I can't edit the front page and I can't find anything named "at risk" |
12:33
π
|
i336 |
I think someone else needs to do that |
12:34
π
|
arkiver |
yeah, the front page isn't maintained very well |
12:34
π
|
i336 |
ah. okay |
12:34
π
|
Sanqui |
It's included from this page: http://www.archiveteam.org/index.php?title=Current_Projects |
12:34
π
|
i336 |
arkiver: also, another note - when the /get/ URLs give you the 301 Moved, the URL you get sent to can be sent HEAD and you get back a valid Content-Length. |
12:34
π
|
arkiver |
http://tracker.archiveteam.org gives a better idea of running/upcoming projects |
12:34
π
|
i336 |
thanks |
12:34
π
|
Sanqui |
but yeah, not maintained |
12:34
π
|
i336 |
right |
12:35
π
|
Sanqui |
I'll add a project template to the page |
12:35
π
|
i336 |
okay then |
12:37
π
|
arkiver |
I think the wikipage should be a little different |
12:37
π
|
arkiver |
I'll edit it a bit |
12:38
π
|
i336 |
sure thing - I have no idea how to wikicode, and I've no idea what I'm doing in this situation either |
12:39
π
|
i336 |
ooh, the page has an infobox :> |
12:40
π
|
i336 |
I'm wondering if there's anyone out there with a very large pile of diskspace who might not mind holding these kinds of files indefinitely. |
12:40
π
|
Igloo^_^ |
6PB is a very very very expensive setup |
12:41
π
|
i336 |
I can't deny that :/ |
12:41
π
|
|
JSharp___ has joined #archiveteam-bs |
12:44
π
|
i336 |
I think I might go out and look for disk space |
12:45
π
|
i336 |
should ex.ua have its own IRC channel? might be a tad early to ask and/or presumptive, but I figure I might as well ask in advance |
12:49
π
|
i336 |
also, do you think if I mention this on /r/datahoarder and /hoard/ ex.ua might get wind and kill the API access? |
12:49
π
|
i336 |
should we archive the XSPFs *then* see what we can get, or take the risk? |
12:50
π
|
i336 |
(of making a bit of noise, in the hope it gets us extra diskspace) |
12:58
π
|
trs80 |
grab the XSPFs first regardless. do they have size info? |
13:01
π
|
i336 |
trs80: unfortunately not - only URL and local filename :/ |
13:01
π
|
i336 |
I wish they did |
13:02
π
|
trs80 |
what about a HEAD on the content URLs (following the redirect if necessary)? |
13:03
π
|
trs80 |
Looks like HEAD includes Content-Length: |
13:04
π
|
i336 |
yup it does |
13:04
π
|
i336 |
but it requires HEAD on every URL listed in the XSPF |
13:04
π
|
i336 |
which is annoying |
13:04
π
|
trs80 |
yeah. maybe have it as a stage 1.5 grab? (normally stage 1 is grab lists of URLs, stage 2 is grab content) |
13:06
π
|
i336 |
that sounds interesting |
13:07
π
|
|
Lord_Nigh has quit IRC (Read error: Operation timed out) |
13:10
π
|
arkiver |
so let's do the XSPF URLs and the /get/ URLs without redirect |
13:10
π
|
arkiver |
err |
13:10
π
|
arkiver |
without the data they redirect to |
13:10
π
|
i336 |
right |
13:10
π
|
i336 |
I recommend HEADing the URLs the /get/ URLs point to - this will 1) give us file size and 2) identify dead links |
13:10
π
|
arkiver |
then, people can give us lists of /get/ URLs they want to have archived and we'll archive actual file for that too |
13:10
π
|
i336 |
right |
13:11
π
|
arkiver |
I changed the wikipage |
13:12
π
|
i336 |
right. that looks much more professional |
13:12
π
|
Sanqui |
...dutch? |
13:12
π
|
i336 |
I think the original text is russian, not du...what Sanqui said |
13:13
π
|
arkiver |
uh |
13:13
π
|
arkiver |
oops |
13:13
π
|
Sanqui |
I think the original wikipage had some useful info that's been omitted |
13:14
π
|
arkiver |
like how the XSPF file looked? |
13:15
π
|
Sanqui |
a lot - the fact that the site had a catalog that's been partly archived by IA, that the API isn't meant to be public, etc |
13:15
π
|
i336 |
it wasn't presented in the most professional style (I tend to turn into a pile of nerves in situations like this, I blame that) |
13:15
π
|
Sanqui |
history about the website (going ukraine only in 2010),... |
13:21
π
|
i336 |
THE SITE IS FULL OF TORRENTS |
13:21
π
|
i336 |
http://web.archive.org/web/20140912032528/http://www.ex.ua/80745202? |
13:22
π
|
i336 |
"Toppeht" (I can't type unicode, it looks like that) |
13:22
π
|
i336 |
curl -I http://www.ex.ua/torrent/125030659 |
13:22
π
|
i336 |
Location: http://fs61.www.ex.ua/torrent/beafc19e3c8a355ca8bd0401392649b6/125030659/48.%20Ori%20Uplift%20-%20Uplifting%20Only%20Fan%20Favorites%202013%202014%20(Continuous%20Dj%20Mix%20Part%202).flac.EX.UA.torrent |
13:22
π
|
Igloo^_^ |
I must say I love the enthusiasm i336 |
13:22
π
|
i336 |
we'd need to bruteforce that unfortunately, the XSPF files don't tell us |
13:23
π
|
i336 |
oh. I'm just freaking out about this thing going away in ~20 days |
13:23
π
|
i336 |
I wanna save the what.cd-like thing :P |
13:23
π
|
arkiver |
fixed. |
13:23
π
|
arkiver |
look at the wiki again |
13:23
π
|
arkiver |
also, let's create a channel for this |
13:23
π
|
Igloo^_^ |
#exexbaby ? |
13:24
π
|
arkiver |
yes |
13:24
π
|
i336 |
lol okay |
13:24
π
|
Igloo^_^ |
(stupid channel names are part of the woodwork here) |
13:24
π
|
i336 |
(right) |
13:37
π
|
|
RichardG_ has joined #archiveteam-bs |
13:38
π
|
|
RichardG has quit IRC (Ping timeout: 260 seconds) |
14:01
π
|
|
i336 has quit IRC (leaving) |
14:02
π
|
|
i336_ has joined #archiveteam-bs |
14:13
π
|
|
RichardG_ is now known as RichardG |
14:22
π
|
|
Lord_Nigh has joined #archiveteam-bs |
14:22
π
|
|
Boltsie has joined #archiveteam-bs |
14:32
π
|
|
Lord_Nigh has quit IRC (Ping timeout: 244 seconds) |
14:39
π
|
|
Lord_Nigh has joined #archiveteam-bs |
15:08
π
|
|
Lord_Nigh has quit IRC (Read error: Operation timed out) |
15:13
π
|
godane |
i'm starting to upload The Portugal News pdf i got: https://archive.org/details/The_Portugal_News-898 |
15:13
π
|
godane |
*pdfs |
15:17
π
|
|
Lord_Nigh has joined #archiveteam-bs |
15:26
π
|
|
Flierp has quit IRC (Quit: Page closed) |
15:33
π
|
|
alembic has quit IRC () |
15:34
π
|
|
alembic has joined #archiveteam-bs |
15:41
π
|
|
Start has quit IRC (Quit: Disconnected.) |
15:54
π
|
|
Lord_Nigh has quit IRC (Read error: Operation timed out) |
16:04
π
|
|
Lord_Nigh has joined #archiveteam-bs |
17:32
π
|
tpw_rules |
https://web.archive.org/web/*/http://media.fukung.net/images/10174/* |
17:32
π
|
tpw_rules |
why is the 'to' date way before the 'from' date? and when i click on it it says the page is ont in the archive |
17:33
π
|
tpw_rules |
not |
17:34
π
|
xmc |
it was fetched exactly once, on 21-jun-2013, and yielded a 500-series error |
17:34
π
|
xmc |
https://web-beta.archive.org/web/*/http://media.fukung.net/images/10174/crackers.jpg |
17:34
π
|
tpw_rules |
ah okay |
17:34
π
|
xmc |
web-beta.archive.org handles these cases better than web.archive.org |
17:35
π
|
xmc |
they're in the middle of a big rewrite so, |
17:35
π
|
tpw_rules |
shit i bet all the pictures from that site are like that |
17:35
π
|
tpw_rules |
to prevent hotlinking |
17:35
π
|
xmc |
randomly selected https://web-beta.archive.org/web/20130703155351/http://media.fukung.net/images/10033/b0146902f9402a370a54792116702f75.gif |
17:35
π
|
tpw_rules |
okay so some work |
17:36
π
|
xmc |
https://web-beta.archive.org/web/*/http://media.fukung.net/images/* |
17:36
π
|
xmc |
oh huh, fukung.net is gone |
17:36
π
|
xmc |
damn |
17:36
π
|
tpw_rules |
yeah |
17:36
π
|
xmc |
i didn't know |
17:37
π
|
tpw_rules |
i have a collection of 400 or so pics from them i saved a while back |
17:38
π
|
|
vitzli has quit IRC (Leaving) |
17:43
π
|
godane |
i'm starting to upload more ABC RN Late Night Live |
17:47
π
|
|
HCross has quit IRC (Read error: Operation timed out) |
17:49
π
|
|
REiN^ has quit IRC (Max SendQ exceeded) |
17:50
π
|
|
REiN^ has joined #archiveteam-bs |
17:54
π
|
|
Mayonaise has quit IRC (Read error: Operation timed out) |
18:19
π
|
|
Aranje has joined #archiveteam-bs |
18:45
π
|
|
Lord_Nigh has quit IRC (Ping timeout: 250 seconds) |
18:56
π
|
|
Lord_Nigh has joined #archiveteam-bs |
19:04
π
|
|
nicolas17 has joined #archiveteam-bs |
19:28
π
|
|
Ravenloft has joined #archiveteam-bs |
19:36
π
|
godane |
i'm at 1018k items now |
19:46
π
|
|
Aranje has quit IRC (Ping timeout: 260 seconds) |
19:46
π
|
|
Start has joined #archiveteam-bs |
20:22
π
|
|
Lord_Nigh has quit IRC (Read error: Operation timed out) |
20:22
π
|
|
Lord_Nigh has joined #archiveteam-bs |
20:27
π
|
|
Specular has joined #archiveteam-bs |
20:29
π
|
Specular |
the person who runs archive.is seems pretty open via his informal Q&A on the blog. Think if anything happened to the site he'd be responsive to backup collabs? |
20:45
π
|
alembic |
I would think so |
20:53
π
|
|
VADemon has joined #archiveteam-bs |
20:55
π
|
|
ndiddy has joined #archiveteam-bs |
20:59
π
|
|
Start has quit IRC (Quit: Disconnected.) |
21:16
π
|
|
i336_ has quit IRC (Read error: Operation timed out) |
21:35
π
|
|
i336 has joined #archiveteam-bs |
21:56
π
|
|
Asparagir has quit IRC (Asparagir) |
22:05
π
|
|
Start has joined #archiveteam-bs |
22:29
π
|
|
Start has quit IRC (Quit: Disconnected.) |
22:36
π
|
|
GE has quit IRC (Remote host closed the connection) |
22:43
π
|
|
achip has quit IRC (Ping timeout: 255 seconds) |
22:47
π
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
22:48
π
|
|
achip has joined #archiveteam-bs |
22:52
π
|
|
BartoCH has joined #archiveteam-bs |
22:57
π
|
|
Yoshimura has quit IRC (Quit: quit) |
23:04
π
|
|
Yoshimura has joined #archiveteam-bs |
23:11
π
|
|
BlueMaxim has joined #archiveteam-bs |
23:19
π
|
|
Start has joined #archiveteam-bs |
23:28
π
|
|
i0npulse has quit IRC (Ping timeout: 244 seconds) |
23:28
π
|
|
i0npulse has joined #archiveteam-bs |
23:29
π
|
|
krazedkat has quit IRC (Ping timeout: 244 seconds) |
23:30
π
|
|
Specular has quit IRC (Ping timeout: 244 seconds) |
23:31
π
|
|
espes__ has quit IRC (Ping timeout: 244 seconds) |
23:35
π
|
|
i0npulse has quit IRC (Ping timeout: 244 seconds) |
23:35
π
|
|
i0npulse has joined #archiveteam-bs |
23:37
π
|
|
Specular has joined #archiveteam-bs |
23:37
π
|
|
closure has quit IRC (Ping timeout: 244 seconds) |
23:38
π
|
|
ravetcofx has joined #archiveteam-bs |
23:38
π
|
|
espes__ has joined #archiveteam-bs |
23:38
π
|
joepie91 |
Specular: iirc there have been problems with the guy before but I don't recall the specifics |
23:39
π
|
|
closure has joined #archiveteam-bs |
23:40
π
|
|
i0npulse has quit IRC (Ping timeout: 244 seconds) |
23:40
π
|
|
i0npulse has joined #archiveteam-bs |
23:41
π
|
|
useretail has quit IRC (Ping timeout: 244 seconds) |
23:44
π
|
|
bsmith093 has quit IRC (Ping timeout: 244 seconds) |
23:45
π
|
|
useretail has joined #archiveteam-bs |
23:45
π
|
joepie91 |
Specular: ref http://archiveteam.org/index.php?title=Archive.is |
23:47
π
|
|
pikhq_ has quit IRC (Ping timeout: 244 seconds) |
23:47
π
|
joepie91 |
SketchCow: I'm sure you'll love how Wikipedians here have wiki-linked "rage", "paranoia" and "kleptomania" in your quote: https://en.wikipedia.org/wiki/Archive_Team :P |
23:47
π
|
|
SketchCow has quit IRC (Ping timeout: 244 seconds) |
23:49
π
|
|
i336_ has joined #archiveteam-bs |
23:49
π
|
|
i0npulse has quit IRC (Ping timeout: 244 seconds) |
23:49
π
|
i336_ |
I understand some of the people here have access to crazy internet bandwidth. I currently have a very big use for a 5gbps or 10gbps link to run an indexer on what's left of ex.ua, which is disappearing for good in about 20 days. I'd need only a few GB of disk space as I would only be using this for indexing efforts and prototyping. |
23:49
π
|
|
pikhq has joined #archiveteam-bs |
23:49
π
|
i336_ |
if anyone can help please PM me |
23:49
π
|
|
i0npulse has joined #archiveteam-bs |
23:49
π
|
nicolas17 |
a DigitalOcean VPS maybe? |
23:49
π
|
|
bsmith093 has joined #archiveteam-bs |
23:49
π
|
|
Medowar0 has quit IRC (Ping timeout: 244 seconds) |
23:49
π
|
i336_ |
that sounds awesome, but I unfortunately can't waltz over to the site and grab one right now |
23:50
π
|
|
bwn has quit IRC (Ping timeout: 244 seconds) |
23:51
π
|
|
zenguy has quit IRC (Ping timeout: 244 seconds) |
23:51
π
|
|
Simpbrain has quit IRC (Ping timeout: 244 seconds) |
23:51
π
|
|
Frogging has quit IRC (Ping timeout: 244 seconds) |
23:53
π
|
i336_ |
yeah, really awesome. but also noteworthily expensive |
23:53
π
|
nicolas17 |
their pricing says how much data transfer you get on each droplet size... but afaik nothing actually happens when you reach the "limit" |
23:53
π
|
Kaz |
do you *need* 5-10gbps? |
23:53
π
|
Kaz |
like are you actually going to be able to saturate it |
23:53
π
|
i336_ |
well... tbh, I'm not honestly sure. |
23:53
π
|
i336_ |
further explanation: (details in #exexbaby) |
23:53
π
|
i336_ |
ex.ua was the 11th biggest site in ukraine. it was a file host with user accounts and public access. kind of like what.cd |
23:53
π
|
|
hook54321 has quit IRC (Ping timeout: 244 seconds) |
23:54
π
|
|
Medowar_ has joined #archiveteam-bs |
23:55
π
|
|
i0npulse has quit IRC (Ping timeout: 244 seconds) |
23:55
π
|
i336_ |
the archive team has a project to archive all the XSPF files that form the folder system on the site; these XSPF files are XML filelists |
23:55
π
|
joepie91 |
i336_: as in, you're *looking* for a 5gbps-10gbps uplink? |
23:56
π
|
|
dashcloud has quit IRC (Ping timeout: 244 seconds) |
23:56
π
|
|
purplebot has quit IRC (Ping timeout: 244 seconds) |
23:56
π
|
|
i0npulse has joined #archiveteam-bs |
23:57
π
|
|
purplebot has joined #archiveteam-bs |
23:57
π
|
joepie91 |
i336_: if yes, servercrate claims to offer 10gbps connectivity on their storage VPSes: https://servercrate.com/vps-storage |
23:57
π
|
|
closure has quit IRC (Ping timeout: 244 seconds) |
23:58
π
|
i336_ |
I'm not 100% sure. my reasoning is going along the lines of "this site is going down for good in 20 days, faster is better!" |
23:58
π
|
joepie91 |
I can't vouch for whether it actually does |
23:58
π
|
joepie91 |
I know that the network is not *great* but it works |
23:59
π
|
i336_ |
I see. |
23:59
π
|
joepie91 |
the problem is more likely to be traffic |
23:59
π
|
Kaz |
ah, so you want lots of 1gbps links |
23:59
π
|
joepie91 |
10gbps unmetered basically costs a fortune :P |
23:59
π
|
tapedrive |
I doubt very much that you'll actaully be able to use a connection that fast - the site will probably rate limit you. |
23:59
π
|
i336_ |
joepie91: good point. very good point |
23:59
π
|
Kaz |
unless there's one hell of a beefy machine pushing it |
23:59
π
|
i336_ |
tapedrive: also very good point |
23:59
π
|
i336_ |
Kaz: oh yeah |
23:59
π
|
* |
i336_ brb |
23:59
π
|
|
closure has joined #archiveteam-bs |
23:59
π
|
Kaz |
I think we can do that |
23:59
π
|
i336_ |
Kaz: you're right, I do want lots of 1gbps links |
23:59
π
|
i336_ |
! |
23:59
π
|
Kaz |
do you have code? |
23:59
π
|
|
Medowar_ has quit IRC (Ping timeout: 244 seconds) |