Time |
Nickname |
Message |
00:44
🔗
|
|
kristian_ has quit IRC (Quit: Leaving) |
00:47
🔗
|
|
Simpbrain has quit IRC (Remote host closed the connection) |
00:53
🔗
|
|
BlueMaxim has joined #archiveteam |
00:55
🔗
|
|
Feld0 has quit IRC (Quit: Pixel Wavelength is best pony.) |
01:13
🔗
|
|
Feld0 has joined #archiveteam |
01:16
🔗
|
|
QBcrusher has quit IRC (Ping timeout: 244 seconds) |
01:20
🔗
|
|
tfgbd_znc has joined #archiveteam |
01:21
🔗
|
|
tfgbd_znc has quit IRC (Client Quit) |
01:25
🔗
|
|
tfgbd_znc has joined #archiveteam |
01:38
🔗
|
|
kyounko|2 has joined #archiveteam |
01:40
🔗
|
|
kyounko has quit IRC (Ping timeout: 260 seconds) |
02:02
🔗
|
|
yan has quit IRC (Read error: Operation timed out) |
02:12
🔗
|
|
yan has joined #archiveteam |
02:18
🔗
|
|
DiscantX has quit IRC (Read error: Operation timed out) |
02:34
🔗
|
|
ndiddy has quit IRC (Read error: Connection reset by peer) |
03:14
🔗
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
03:16
🔗
|
|
BartoCH has joined #archiveteam |
03:21
🔗
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
03:36
🔗
|
|
VADemon has quit IRC (Read error: Operation timed out) |
03:49
🔗
|
|
BartoCH has joined #archiveteam |
03:55
🔗
|
|
brayden has quit IRC (Read error: Connection reset by peer) |
03:56
🔗
|
|
brayden has joined #archiveteam |
04:33
🔗
|
|
jrwr has quit IRC (Remote host closed the connection) |
04:33
🔗
|
|
jrwr has joined #archiveteam |
04:36
🔗
|
|
i336__ has quit IRC (Read error: Operation timed out) |
04:51
🔗
|
|
Start has quit IRC (Quit: Disconnected.) |
04:54
🔗
|
|
Start has joined #archiveteam |
05:02
🔗
|
|
jrwr has quit IRC (Remote host closed the connection) |
05:04
🔗
|
|
Sk1d has quit IRC (Ping timeout: 250 seconds) |
05:11
🔗
|
|
Sk1d has joined #archiveteam |
05:37
🔗
|
|
Honno has joined #archiveteam |
06:33
🔗
|
|
pizzaiolo has quit IRC (Remote host closed the connection) |
06:42
🔗
|
|
Meeh has joined #archiveteam |
06:42
🔗
|
|
Ymgve has joined #archiveteam |
06:42
🔗
|
|
Jogie has joined #archiveteam |
06:42
🔗
|
|
alfie has joined #archiveteam |
06:42
🔗
|
|
PurpleSym has joined #archiveteam |
06:42
🔗
|
|
toddf has joined #archiveteam |
06:42
🔗
|
|
altlabel has joined #archiveteam |
06:42
🔗
|
|
PotcFdk has joined #archiveteam |
06:42
🔗
|
|
irc.homelien.no sets mode: +o altlabel |
06:50
🔗
|
|
Start has quit IRC (Quit: Disconnected.) |
06:51
🔗
|
|
buchno has joined #archiveteam |
06:54
🔗
|
buchno |
Wanted to notify that the popular Swedish movie rating website and community Filmtipset.se is scheduled to close 2017-03-31 unless someone else volunteers to maintain it. |
06:57
🔗
|
|
DiscantX has joined #archiveteam |
07:02
🔗
|
|
buchno has quit IRC (Quit: Page closed) |
07:15
🔗
|
|
Aranje has quit IRC (Quit: Three sheets to the wind) |
07:29
🔗
|
|
maelstrom has quit IRC (Quit: Leaving) |
08:19
🔗
|
|
ravetcofx has quit IRC (Read error: Operation timed out) |
08:24
🔗
|
|
ravetcofx has joined #archiveteam |
08:42
🔗
|
|
Rondom has quit IRC (Remote host closed the connection) |
08:43
🔗
|
|
Rondom has joined #archiveteam |
08:44
🔗
|
|
midas1 has quit IRC (Ping timeout: 250 seconds) |
08:48
🔗
|
|
ravetcofx has quit IRC (Read error: Operation timed out) |
08:53
🔗
|
|
fie has quit IRC (Read error: Connection reset by peer) |
08:54
🔗
|
|
atomotic has joined #archiveteam |
09:05
🔗
|
|
Honno has quit IRC (Ping timeout: 370 seconds) |
09:09
🔗
|
|
midas1 has joined #archiveteam |
09:15
🔗
|
|
fie has joined #archiveteam |
09:20
🔗
|
|
Rondom has quit IRC (Remote host closed the connection) |
09:20
🔗
|
|
Rondom has joined #archiveteam |
09:45
🔗
|
|
Igloo has quit IRC (Read error: Operation timed out) |
09:46
🔗
|
|
PepsiMax has quit IRC (Ping timeout: 244 seconds) |
09:51
🔗
|
|
Igloo has joined #archiveteam |
09:52
🔗
|
|
PepsiMax has joined #archiveteam |
10:12
🔗
|
|
QBcrusher has joined #archiveteam |
10:41
🔗
|
|
victor has quit IRC (Ping timeout: 260 seconds) |
10:53
🔗
|
|
victor has joined #archiveteam |
10:54
🔗
|
|
fie has quit IRC (Ping timeout: 245 seconds) |
11:05
🔗
|
|
i336__ has joined #archiveteam |
11:36
🔗
|
|
Honno has joined #archiveteam |
12:25
🔗
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
13:35
🔗
|
|
atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) |
13:46
🔗
|
|
pizzaiolo has joined #archiveteam |
13:59
🔗
|
|
atomotic has joined #archiveteam |
14:05
🔗
|
|
sep332_ has joined #archiveteam |
14:05
🔗
|
|
sep332_ is now known as sep332 |
14:09
🔗
|
|
atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) |
14:32
🔗
|
|
kristian_ has joined #archiveteam |
14:56
🔗
|
|
Start has joined #archiveteam |
14:57
🔗
|
|
Start has quit IRC (Client Quit) |
14:58
🔗
|
|
i336__ has quit IRC (Read error: Operation timed out) |
16:00
🔗
|
|
Honno has quit IRC (Ping timeout: 370 seconds) |
16:04
🔗
|
|
Aranje has joined #archiveteam |
16:04
🔗
|
|
stefan_ has joined #archiveteam |
16:06
🔗
|
stefan_ |
Hi! The archives on https://gitorious.org/ are giving 404 errors. Is this a known problem? |
16:14
🔗
|
rocode |
chronomex --^ |
16:25
🔗
|
|
Honno has joined #archiveteam |
16:36
🔗
|
|
edsu has quit IRC (Quit: leaving) |
16:39
🔗
|
|
rocode has quit IRC (Read error: Operation timed out) |
16:52
🔗
|
Kaz |
xmc* ^ |
16:54
🔗
|
|
yan has quit IRC (Ping timeout: 506 seconds) |
17:04
🔗
|
|
DiscantX has quit IRC (Ping timeout: 244 seconds) |
17:07
🔗
|
|
yan has joined #archiveteam |
17:32
🔗
|
|
vitzli has joined #archiveteam |
17:47
🔗
|
|
fie has joined #archiveteam |
17:49
🔗
|
|
schbirid has joined #archiveteam |
17:49
🔗
|
xmc |
stefan_ Kaz thanks for the heads-up, i'll poke it later today |
17:53
🔗
|
|
kristian_ has quit IRC (Quit: Leaving) |
18:04
🔗
|
|
Pudsey has joined #archiveteam |
18:18
🔗
|
|
Pudsey has quit IRC (Remote host closed the connection) |
18:21
🔗
|
|
stefan_ has quit IRC (Quit: Page closed) |
19:01
🔗
|
|
vitzli has quit IRC (Quit: Leaving) |
19:04
🔗
|
|
nwf has joined #archiveteam |
19:10
🔗
|
|
nwf__ has quit IRC (Read error: Operation timed out) |
19:15
🔗
|
|
Yoshimura has joined #archiveteam |
19:17
🔗
|
|
rocode has joined #archiveteam |
19:19
🔗
|
|
SketchCow has quit IRC (Read error: Connection reset by peer) |
19:19
🔗
|
|
SketchCow has joined #archiveteam |
19:45
🔗
|
|
atomotic has joined #archiveteam |
19:52
🔗
|
|
pizzaiol1 has joined #archiveteam |
19:55
🔗
|
|
pizzaiolo has quit IRC (Read error: Operation timed out) |
20:22
🔗
|
|
atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) |
20:41
🔗
|
|
maelstrom has joined #archiveteam |
21:01
🔗
|
|
pizzaiol1 has quit IRC (Ping timeout: 250 seconds) |
21:15
🔗
|
|
pizzaiolo has joined #archiveteam |
21:59
🔗
|
|
kristian_ has joined #archiveteam |
21:59
🔗
|
|
sep332_ has joined #archiveteam |
22:00
🔗
|
|
sep332 has quit IRC (Read error: Operation timed out) |
22:02
🔗
|
|
schbirid has quit IRC (Quit: Leaving) |
22:07
🔗
|
|
Honno has quit IRC (Ping timeout: 370 seconds) |
22:07
🔗
|
|
ndiddy has joined #archiveteam |
22:50
🔗
|
|
FrankJ has joined #archiveteam |
22:55
🔗
|
FrankJ |
Hello, I've a question. I have downloaded a WARC file for the Hyves Archive project team. The WARC file is around the 50GB in size. I've extracted the WARC.gz file with 7-zip and now i'm looking for some tools how to extract the information inside the WARC files (found the wiki page about the WARC ecosystem). I hope there are tools that can simulate the websites inside the WARC file in a browser. I've found the webarchiveplayer, but that program crashed |
22:56
🔗
|
wp494 |
IT'S! FUCKING! HAPPENING! |
22:56
🔗
|
wp494 |
https://twitter.com/TheRegister/status/818592023452057600 |
22:58
🔗
|
yipdw |
FrankJ: what's the crash? |
22:59
🔗
|
yipdw |
(FYI, you don't need to gunzip the WARC -- webarchiveplayer will handle that) |
23:02
🔗
|
rocode |
Fuck me. Yahoo just Yahoo'd. |
23:02
🔗
|
xmc |
^ |
23:02
🔗
|
FrankJ |
Ah okay. The crash is that it takes around an hour (our a few hours) and then the process is completely gone in the task manager. It's running on a Windows Server 2016 machine with 10GB of RAM. With smaller WARC archive files, it works fine. |
23:03
🔗
|
yipdw |
you might have better luck with the gzipped version |
23:03
🔗
|
yipdw |
if you can get a crash log, that'd help -- I'm not sure where that'd come from on Windows, unfortunately |
23:03
🔗
|
xmc |
yeah, leave it gzipped |
23:04
🔗
|
FrankJ |
I've tried the gz version before, but it's the same thing... I think webarchiveplayer cant handle the 50GB files? Or is that not the reason? |
23:05
🔗
|
FrankJ |
Its growing in memory very slow all the time before the process is gone |
23:05
🔗
|
yipdw |
I guess it depends on the crash reason |
23:05
🔗
|
yipdw |
I can't remember if the WARC index for webarchiveplayer is in-memory or on-disk |
23:06
🔗
|
FrankJ |
Ah okay, i think it's not safed on the disk. Its reading with 12MB/s the WARC archive and it's cached into memory.... |
23:06
🔗
|
FrankJ |
saved* |
23:06
🔗
|
yipdw |
code says it's on-disk |
23:07
🔗
|
yipdw |
must be something else going on |
23:07
🔗
|
yipdw |
which Hyves archive? |
23:08
🔗
|
FrankJ |
20131122005844 |
23:11
🔗
|
FrankJ |
Also tried on my laptop with W10 and 8GB of RAM and enough storage, but the same happened.. Have downloaded the file again and tried another hyves archive... and both Gz and Warc |
23:13
🔗
|
arkiver |
let's move this to #archiveteam-bs |
23:14
🔗
|
arkiver |
This channel is more for annoucements |
23:14
🔗
|
|
kristian_ has quit IRC (Quit: Leaving) |
23:14
🔗
|
FrankJ |
Ah okay, thx |
23:22
🔗
|
|
nicolas17 has joined #archiveteam |
23:36
🔗
|
wp494 |
here's a hot announcement: |
23:36
🔗
|
wp494 |
[16:56:37] <wp494> IT'S! FUCKING! HAPPENING! |
23:36
🔗
|
wp494 |
[16:56:37] <wp494> https://twitter.com/TheRegister/status/818592023452057600 |
23:37
🔗
|
arkiver |
<wp494>[16:56:37] <wp494> IT'S! FUCKING! HAPPENING! |
23:37
🔗
|
arkiver |
<wp494>[16:56:37] <wp494> https://twitter.com/TheRegister/status/818592023452057600 |
23:37
🔗
|
arkiver |
superhot |
23:37
🔗
|
rocode |
Not enough sirens and lingering shots on war torn landscape tbh |
23:37
🔗
|
nicolas17 |
"I thought the name was the only thing nearing an asset." lol |