Time |
Nickname |
Message |
00:22
π
|
|
Start has quit IRC (Quit: Disconnected.) |
00:25
π
|
|
Start has joined #archiveteam-bs |
00:26
π
|
|
Medowar has quit IRC (Ping timeout: 244 seconds) |
00:29
π
|
|
Medowar has joined #archiveteam-bs |
00:40
π
|
|
ndozzle has joined #archiveteam-bs |
00:42
π
|
|
ndozzle is now known as ndiddy |
00:48
π
|
|
ndizzle has quit IRC (Read error: Operation timed out) |
01:53
π
|
|
kristian_ has quit IRC (Quit: Leaving) |
02:51
π
|
|
ndiddy has quit IRC (Read error: Connection reset by peer) |
03:37
π
|
|
ravetcofx has quit IRC (Read error: Operation timed out) |
03:46
π
|
|
ravetcofx has joined #archiveteam-bs |
04:07
π
|
|
BlueMaxim has joined #archiveteam-bs |
04:09
π
|
|
nightpool has joined #archiveteam-bs |
04:25
π
|
|
Sk1d has quit IRC (Ping timeout: 194 seconds) |
04:33
π
|
|
Sk1d has joined #archiveteam-bs |
05:20
π
|
|
Aranje has quit IRC (Quit: Three sheets to the wind) |
05:42
π
|
|
nightpool has quit IRC (what the water wants is hurricanes) |
05:43
π
|
|
nightpool has joined #archiveteam-bs |
06:08
π
|
|
nightpool has quit IRC (what the water wants is hurricanes) |
06:08
π
|
|
Start has quit IRC (Quit: Disconnected.) |
06:19
π
|
|
Start has joined #archiveteam-bs |
06:56
π
|
|
ravetcofx has quit IRC (Ping timeout: 506 seconds) |
08:00
π
|
|
GE has joined #archiveteam-bs |
08:50
π
|
|
GE has quit IRC (Remote host closed the connection) |
09:54
π
|
|
GE has joined #archiveteam-bs |
10:32
π
|
|
brayden_ has joined #archiveteam-bs |
10:32
π
|
|
swebb sets mode: +o brayden_ |
10:33
π
|
|
GE has quit IRC (Quit: zzz) |
10:37
π
|
|
brayden has quit IRC (Read error: Operation timed out) |
10:56
π
|
|
tfgbd_znc has quit IRC (Read error: Connection reset by peer) |
11:03
π
|
|
signius has quit IRC (Read error: Operation timed out) |
11:27
π
|
|
signius has joined #archiveteam-bs |
11:39
π
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
11:51
π
|
|
brayden_ has quit IRC (Read error: Operation timed out) |
11:54
π
|
|
kristian_ has joined #archiveteam-bs |
12:27
π
|
|
GE has joined #archiveteam-bs |
12:51
π
|
|
VADemon has joined #archiveteam-bs |
13:03
π
|
|
brayden_ has joined #archiveteam-bs |
13:03
π
|
|
swebb sets mode: +o brayden_ |
14:14
π
|
|
VADemon has quit IRC (left4dead) |
14:21
π
|
|
Stiletto has quit IRC (Read error: Connection reset by peer) |
14:21
π
|
|
Stiletto has joined #archiveteam-bs |
14:25
π
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
14:36
π
|
|
Start has quit IRC (Quit: Disconnected.) |
14:38
π
|
|
BartoCH has joined #archiveteam-bs |
14:50
π
|
|
nicba1010 has joined #archiveteam-bs |
14:50
π
|
nicba1010 |
joepie91: I got into python :) |
14:50
π
|
joepie91 |
uh oh :P |
14:51
π
|
nicba1010 |
I did some scraping scripts |
14:51
π
|
nicba1010 |
https://github.com/Nicba1010/Scraping-Scripts |
14:51
π
|
nicba1010 |
wall.alphacoders.com.py is my favourite I wanna make a torrent of all these wallpapers |
14:52
π
|
nicba1010 |
I'll gladly adapt them to the framework you guys have. Just need to figure it out |
14:52
π
|
nicba1010 |
I was in the process of making a server for concurrent file downloading but seems you guys already have it |
14:57
π
|
nicba1010 |
joepie91: BTW a guy donated 15 bucks to me for helping archive a website XD |
14:59
π
|
joepie91 |
nicba1010: did you read http://archiveteam.org/index.php?title=Dev yet? |
15:02
π
|
nicba1010 |
Woot thank you :) I will |
15:03
π
|
Aoede |
Poor will, getting pinged all the day |
15:05
π
|
nicba1010 |
joes name is will> |
15:05
π
|
nicba1010 |
?* |
15:06
π
|
Aoede |
No, some other user. My irc client highlights usernames |
15:06
π
|
Aoede |
So when someone types "will", I assume he gets pinged :P |
15:07
π
|
nicba1010 |
oh poor guy |
15:18
π
|
|
ravetcofx has joined #archiveteam-bs |
15:25
π
|
nicba1010 |
How, lets say legal would it be to archive wall.alphacoders.com |
15:28
π
|
xmc |
? |
15:28
π
|
nicba1010 |
If I made the script compatible with seesaw |
15:28
π
|
nicba1010 |
would it be archived |
15:29
π
|
Kaz |
if the content is at risk of disappearing, maybe |
15:29
π
|
nicba1010 |
Or do you just archive sites that are about to shut down |
15:30
π
|
nicba1010 |
I don't have any proof, I just like hoarding data |
15:30
π
|
nicba1010 |
They have a lot of great wallpapers |
15:32
π
|
nicba1010 |
And how come you dont use python 3? |
15:33
π
|
xmc |
how come you ask so many questions |
15:35
π
|
nicba1010 |
I'm at my college programming 1 class and I am reaaaaly bored |
15:36
π
|
xmc |
ok that explains a thing or two |
15:36
π
|
nicba1010 |
What are those things if you dont mind me asking? |
15:37
π
|
xmc |
hahahahaha |
15:37
π
|
nicba1010 |
I was hoping to get some criticism |
15:38
π
|
nicba1010 |
I'm quite acceptive of it TBH. I like to be criticized, escpecially my code |
15:38
π
|
nicba1010 |
Or are you talking about this script pantyhoseplaza.com.py |
15:40
π
|
yipdw |
seesaw *does* work under Python 3, as does the warrior. the code behind the VM image supports both because Python 2 installs are still out in the wild |
15:40
π
|
nicba1010 |
Thx yipdw |
15:41
π
|
|
t2t2 has quit IRC (Read error: Operation timed out) |
15:42
π
|
yipdw |
if alphacoders isn't going anywhere they may or may not be too happy about many machines downloading things. it depends on how much load they can take and how much their admins watch over things |
15:43
π
|
xmc |
if it's not going away and it's just a bunch of wallpapers gathered from around the web, it's a pretty darn low priority for us |
15:43
π
|
yipdw |
but in general, if a place isn't going anywhere, pointing the warrior at it isn't a thing; we only did that with e.g. puu.sh and a few other places. nothing frequent |
15:43
π
|
xmc |
normally the only thing we point warrior at, that isn't going away, is url shorteners, because they're a big hole in the web |
15:44
π
|
nicba1010 |
Okay, I'll keep it archived on my end |
15:45
π
|
nicba1010 |
If it ever goes byebye I'll be here :P |
15:46
π
|
Kaz |
nicba1010: reminder that for archives to be usesful for injecting into the wayback machine, you'll need to have the site saves through WARCs, not just downloading the images etc |
15:48
π
|
nicba1010 |
I'll make an effort on winter break. In the meanwhile I'm just gonna bundle it into a torrent |
15:49
π
|
|
t2t2 has joined #archiveteam-bs |
16:10
π
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
16:23
π
|
Medowar0 |
nicba1010: alphacoders is not on the verge of disappearing. I am one of the uploaders and sometime in contact with David(site owner) and the site is doing great, he works and lives off of it fulltime. |
16:31
π
|
Kaz |
Medowar0: I guess it'd also be correct to assume the alphacoders does not hold the only copies of the images, too? |
16:31
π
|
Kaz |
So most of the data isn't at any real risk even if the site did go down |
16:32
π
|
|
BartoCH has joined #archiveteam-bs |
17:15
π
|
Medowar0 |
correct |
17:18
π
|
Medowar0 |
basically all is collected wallpapers from somehwere else. There is a separate collection of everything, that is original Content on the site, which may get archived, but for the rest, not really necessary |
17:36
π
|
godane |
i'm at 952k items |
17:37
π
|
|
Aoede has quit IRC (Quit: "reboot") |
17:38
π
|
JW_work |
nicba1010: if you are looking for things to do, #urlteam can certainly use your energy |
17:40
π
|
|
Aoede has joined #archiveteam-bs |
18:13
π
|
|
JW_work has quit IRC (Quit: Leaving.) |
18:27
π
|
|
ndiddy has joined #archiveteam-bs |
18:56
π
|
|
RichardG has quit IRC (Ping timeout: 250 seconds) |
19:00
π
|
|
RichardG has joined #archiveteam-bs |
19:31
π
|
|
GE has quit IRC (Ping timeout: 255 seconds) |
19:36
π
|
|
GE has joined #archiveteam-bs |
19:43
π
|
|
antomati_ is now known as antomatic |
20:03
π
|
|
VADemon has joined #archiveteam-bs |
20:18
π
|
|
altlabel_ has quit IRC (Quit: aaand it's gone) |
20:19
π
|
|
JW_work has joined #archiveteam-bs |
20:32
π
|
|
GE has quit IRC (Remote host closed the connection) |
20:49
π
|
godane |
so fake passport spam in the review of this item: https://archive.org/details/ERIC_ED497499 |
20:52
π
|
|
GE has joined #archiveteam-bs |
20:55
π
|
JW_work |
godane: send it to info@archive |
21:01
π
|
godane |
its sent now |
21:20
π
|
|
JW_work has quit IRC (Quit: Leaving.) |
21:23
π
|
|
JW_work has joined #archiveteam-bs |
21:27
π
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
22:04
π
|
Kaz |
'we add between 13 and 15 terabytes of content per day' |
22:04
π
|
Kaz |
from https://blog.archive.org/2016/10/25/20000-hard-drives-on-a-mission/ |
22:04
π
|
|
GE has quit IRC (Quit: zzz) |
22:05
π
|
Kaz |
so we're currently.. a large amount |
22:08
π
|
yipdw |
on a per-day basis, not too big |
22:24
π
|
|
BlueMaxim has joined #archiveteam-bs |
22:28
π
|
|
VADemon has quit IRC (Quit: left4dead) |
22:34
π
|
|
BartoCH has joined #archiveteam-bs |
22:50
π
|
xmc |
rebooting gitorious because it's been offline for apparently two weeks by my monitoring |
22:50
π
|
xmc |
this is why you don't want me as your sysadmin |
22:52
π
|
JW_work |
xmc: that's a good sign that demand is dropping off (which is expected, and welcomed) |
22:53
π
|
JW_work |
as long as there's a copy stored elsewhere (which there is, right)? 2 weeks downtime is probably fine. |
22:53
π
|
xmc |
yes, i've given a dd image to the EU software preservation people |
22:54
π
|
xmc |
and i keep meaning to make a script to, in some reasonable manner, split it into smaller chunks and put into archive.org |
22:58
π
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
22:58
π
|
JW_work |
nicba1010: that's something you might try and work on, maybe |
23:01
π
|
|
BartoCH has joined #archiveteam-bs |
23:01
π
|
JW_work |
If you divided it up into 1000 project chunks, that would average out to about 42 GB per chunk and 120 chunks⦠|
23:02
π
|
JW_work |
"I want to thank the small and extremely hard-working individuals at Internet Archive who maintain and evolve the compute and storage infrastructure that enables us to pursue our mission and service our patrons." β IA's servers are maintained by gnomes! I knew it! |
23:04
π
|
|
nicba1010 has quit IRC (Ping timeout: 268 seconds) |
23:33
π
|
|
Start has joined #archiveteam-bs |