#archiveteam-bs 2017-06-10,Sat

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)

WhoWhatWhen
***Stiletto has quit IRC () [00:47]
..... (idle for 20mn)
arkiverthanks chfoo yipdw [01:07]
***j08nY has quit IRC (Remote host closed the connection)
BlueMaxim has joined #archiveteam-bs
Stilett0 has joined #archiveteam-bs
Stilett0 has quit IRC (Client Quit)
Stilett0 has joined #archiveteam-bs
Stilett0 is now known as Stiletto
ndiddy has quit IRC ()
[01:10]
ndiddy has joined #archiveteam-bs
schbirid2 has joined #archiveteam-bs
schbirid has quit IRC (Read error: Operation timed out)
[01:28]
REiN^ has quit IRC (Max SendQ exceeded)
BlueMaxim has quit IRC (Read error: Operation timed out)
REiN^ has joined #archiveteam-bs
[01:46]
...... (idle for 25mn)
Stiletto has quit IRC (Read error: Operation timed out)
Stilett0 has joined #archiveteam-bs
[02:12]
.... (idle for 16mn)
SketchCowOK, I'm fixing all the New Computer Express stuff the guy's uploading.
https://archive.org/details/NewComputerExpress000 will be the first one that finishes, I bet
[02:28]
***pizzaiolo has quit IRC (Quit: pizzaiolo) [02:34]
......... (idle for 44mn)
BlueMaxim has joined #archiveteam-bs [03:18]
............... (idle for 1h13mn)
Odd0002 has quit IRC (Remote host closed the connection) [04:31]
...... (idle for 25mn)
Sk1d has quit IRC (Ping timeout: 250 seconds) [04:56]
Sk1d has joined #archiveteam-bs [05:03]
................ (idle for 1h19mn)
fie has quit IRC (Ping timeout: 506 seconds) [06:22]
fie has joined #archiveteam-bs [06:31]
powerArch has quit IRC (Remote host closed the connection) [06:37]
vitzli has joined #archiveteam-bs [06:48]
................. (idle for 1h21mn)
Honno has joined #archiveteam-bs
vitzli has quit IRC (Quit: Leaving)
[08:09]
godanei'm asking the retromags people if they will look at the New Computer Express on IA
i figure they would make a edit version of it and also a smaller size cbz for there site
[08:18]
............ (idle for 57mn)
***SHODAN_UI has joined #archiveteam-bs [09:16]
........ (idle for 39mn)
godaneSketchCow: the pdf derive sucks ass on this one: https://archive.org/details/TNM_The_Apple_Collection_Catalog
there is like no text in the pdf at all
the only derive that works well is normal the jp2.zip files
[09:55]
.... (idle for 18mn)
***j08nY has joined #archiveteam-bs [10:14]
......... (idle for 42mn)
BlueMaxim has quit IRC (Read error: Operation timed out)
BlueMaxim has joined #archiveteam-bs
[10:56]
fie has quit IRC (Quit: Leaving) [11:02]
fie has joined #archiveteam-bs [11:16]
..... (idle for 23mn)
SHODAN_UI has quit IRC (Remote host closed the connection) [11:39]
....... (idle for 34mn)
robinak has joined #archiveteam-bs
robink has quit IRC (Read error: Connection reset by peer)
[12:13]
.......... (idle for 48mn)
C4K3 has quit IRC (Quit: leaving)
C4K3 has joined #archiveteam-bs
BlueMaxim has quit IRC (Read error: Connection reset by peer)
[13:01]
...... (idle for 28mn)
fie has quit IRC (Read error: Operation timed out)
yuitimoth has quit IRC (Remote host closed the connection)
yuitimoth has joined #archiveteam-bs
[13:32]
SHODAN_UI has joined #archiveteam-bs
j08nY has quit IRC (Read error: Operation timed out)
[13:40]
...... (idle for 27mn)
j08nY has joined #archiveteam-bs [14:09]
............. (idle for 1h0mn)
pizzaiolo has joined #archiveteam-bs [15:09]
..... (idle for 21mn)
rocodeWhat is a good upload rate limit on uploading to IA? I noticed some of my grabs are offline, so I need to get them on IA before I get hit by a bus, but I don't want to overload the server with concurrent uploads. Is there a guideline? [15:30]
........... (idle for 52mn)
Kazif you use the s3 interface, it'll throw 429's at you if you go too fast
as for bandwidth, I'm sure newsgrabber was sending in the 500-700mbit/s at it pretty constantly
[16:22]
***dashcloud has joined #archiveteam-bs
godane has quit IRC (Read error: Operation timed out)
[16:29]
JAAI'm grabbing Tanobb now. It's quite slow, probably at least partially because their servers are located in Japan, but well, I'll try to get as much as possible before they shut down. [16:37]
***godane has joined #archiveteam-bs [16:42]
...... (idle for 28mn)
godane has quit IRC (Quit: Leaving.) [17:10]
...... (idle for 27mn)
TheLovina has joined #archiveteam-bs [17:37]
..... (idle for 23mn)
RichardG has quit IRC (Read error: Connection reset by peer) [18:00]
.... (idle for 17mn)
RichardG has joined #archiveteam-bs [18:17]
...... (idle for 25mn)
godane has joined #archiveteam-bs [18:42]
.... (idle for 15mn)
pizzaiolo has quit IRC (Read error: Operation timed out)
Florian_ has joined #archiveteam-bs
Florian_ has quit IRC (Client Quit)
[18:57]
SHODAN_UI has quit IRC (Remote host closed the connection) [19:05]
...... (idle for 27mn)
jrwr has joined #archiveteam-bs [19:32]
..... (idle for 21mn)
odemgtimmc, Kaz as ero was built primarily for reddit we can assume that a very high percentage of the links were posted on reddit, this allows us to grep datasets like this one - http://files.pushshift.io/reddit/ - for eroshare links and download them [19:53]
Kazyeah, that should make it a ton easier to work through [19:54]
odemgshould I get to work pulling the links out of that data? [19:54]
Kazonly seems to go up to april though
probbaly not a bad idea to start working it through, if we do end up grabbing
[19:54]
odemgwe can worry about that afterwards, ero was born when? I know it's only bee around a few years at this point so no need to grab earlier files [19:55]
Kazyou probably know better than me when ero started.. :)
domain is 10 years old
highly doubt it's been used for that long, somehow.
[19:56]
odemg.... whois on the domain say its was regged on 2006-11-30
wut
[19:57]
Kazactually, #nofap is probably the best place if we're actually going to start grabbing [19:57]
***ZexaronS has joined #archiveteam-bs [20:02]
odemgKaz, explain? [20:02]
KazProject channel [20:04]
jrwrI got my Archive Team Warrior Stickers in
they are nice!
[20:06]
***wp494 has quit IRC (Read error: Operation timed out) [20:10]
icedice has joined #archiveteam-bs [20:17]
timmcodemg: Ah, interesting. [20:20]
jrwrhttps://goo.gl/photos/MnPQMLAe1ixzjedV9
Works very well on my mug
[20:21]
***pizzaiolo has joined #archiveteam-bs [20:26]
.... (idle for 16mn)
Pudsey has joined #archiveteam-bs [20:42]
.... (idle for 16mn)
SHODAN_UI has joined #archiveteam-bs [20:58]
pizzaiolo has quit IRC (Ping timeout: 506 seconds) [21:07]
icedice has quit IRC (Quit: Leaving) [21:20]
pizzaiolo has joined #archiveteam-bs
SilSte has quit IRC (Remote host closed the connection)
SilSte has joined #archiveteam-bs
Pudsey has quit IRC (Remote host closed the connection)
[21:28]
jrwrSilSte: Howd [21:40]
***Silas has joined #archiveteam-bs [21:40]
jrwrSo Silas
can you hover over the invaild config icon at the bottom
what does it say
[21:40]
***pizzaiolo has quit IRC (Read error: Operation timed out) [21:41]
Kazjrwr: re cygwin port, I guess it's a good idea if it's stable and works [21:41]
Silasits just a warning about there not being enough video memory to go into fullscreen or seamless [21:41]
jrwrOk [21:41]
Kazthat said, WSL might be better? albeit only win10 support [21:41]
jrwrdamn, that should boot then [21:41]
***pizzaiolo has joined #archiveteam-bs [21:41]
jrwrKaz: true but since its pretty simple on the scripts, and would want it to even work on win7/win8
make a little installer + some shortcuts to some scripts to turn off the warrior and such
[21:42]
Silasi should check my bios to make sure vt-x is enabled in the first place [21:43]
jrwrthe processor doesn't support it [21:43]
Silasoh
:/
[21:43]
jrwrVMware player might work in the this case, it can import the OVA
Or
a 2.99 Euro a month Virtual Machine Instance at scaleway works very well
[21:44]
Silasill try out vmware player [21:44]
jrwrLet us know how it goes, ill be here all night
Kaz: if we keep the docker support, Ill work on a installer for it, wont be too hard at all. ill do some testing today on it, I'm pretty bored right now
[21:45]
***Odd0002 has joined #archiveteam-bs [21:54]
JAASeriously though, the warrior is a ridiculous security risk (which is why I'd never let it anywhere near my machines). I'm not sure about the Docker container, though based on a quick look it seems to be based on a 3-month-old version of phusion/baseimage and probably also has some security issues. [21:56]
SilasI got the warrior running in VMWare Player and set to run whatever ArchiveTeam's Choice is, everything looks good!
Thanks jrwr
[21:57]
jrwrAwesome!
jrwr: Ill have the cygwin version only listen on 127.0.0.1
[21:57]
Odd0002did you just talk to yourself? [21:58]
jrwrI did
I ment JAA
[21:58]
Odd0002ah [21:58]
JAAWell, it still needs an internet connection... [21:58]
jrwrYa
and it is running random code from the internet
:)
[21:58]
JAATrue. It also auto-updates scripts and fetches the wget-lua source via HTTP without a checksum check. :-| (Ping JensRex, you implemented that, didn't you? What happened with that?)
As I said, ridiculous security risk.
[22:00]
jrwrit is a botnet after all [22:01]
JAAThat's true, but it could be improved a lot. [22:02]
jrwrYes
I've though about doing it, making a new Virtual Machine, but I think that should be up to the project managers, as I have no say around here
[22:02]
***decay has quit IRC (Read error: Operation timed out)
decay has joined #archiveteam-bs
[22:03]
jrwrwow, that was easy.. almost too easy [22:12]
***Silas has quit IRC (Quit: help)
Silas has joined #archiveteam-bs
Silas has quit IRC (Client Quit)
[22:21]
..... (idle for 21mn)
DFJustinwe've had problems with people using cygwin in the past because of the case-insensitive filesystem and certain filenames being illegal on windows
that's one of the reasons the warrior was created in the first place
to have a consistent no-surprises environment
[22:45]
***Ravenloft has joined #archiveteam-bs [22:47]
Ravenlofthttp://www.os2museum.com/wp/rich-heimlichs-patch-set-overview/ [22:48]
DFJustinif you're not willing to run the warrior and you don't have access to a linux system then it's better to just let someone else do it rather than cause problems [22:48]
***SHODAN_UI has quit IRC (Remote host closed the connection) [22:48]
DFJustinwe usually don't have a shortage of warriors running [22:48]
jrwrThis is true
also wget-lua hates cygwin atm
jrwr is just trying to be helpful
[22:50]
DFJustinyeah it's just that if it creates a mess and an admin has to clean it up that far outweighs the benefit of 1 more client running
usually the rate limiter is the site being archived or our staging servers rather than number of people crawling
[22:55]
***Silas has joined #archiveteam-bs
Ravenloft has quit IRC ()
Ravenloft has joined #archiveteam-bs
[22:56]
jrwrYa i provided a staging server for pixiv
it was crazy the amount of traffic inbound I was getting, poor FOS just cant keep up
[23:09]
joepie91jrwr: there's not really such a thing as "the project managers" :P
stuff gets done when somebody decides to do it
[23:10]
Silasi love the fact there's a leaderboard
im a sucker for stat tracking lol
[23:11]
jrwrIts a loose collection of people, There are people with access to infra and they tend to help when a need arises, those are the true "managers" of AT
like I would save arkiver has been a amazing "manager" of savepixiv
[23:12]
Ravenloftthats what she said [23:14]
***j08nY has quit IRC (Remote host closed the connection)
j08nY has joined #archiveteam-bs
[23:19]
..... (idle for 20mn)
JAAWhat the hell, wpull is suddenly ignoring my --reject-regex options. O.o [23:40]
***Silas has quit IRC (Quit: Page closed) [23:46]
JAAOk, looks like it never worked, but now I seriously wonder why.
Ooh, you can only have one --reject-regex option. Well, that's... unexpected.
[23:48]
***Honno has quit IRC (Read error: Operation timed out) [23:59]

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)