Time |
Nickname |
Message |
00:24
π
|
|
DarkStar1 has quit IRC (Quit: http://quassel-irc.org - Chat comfortably. Anywhere.) |
00:34
π
|
|
kbtoo_ has quit IRC (Read error: Connection reset by peer) |
00:36
π
|
|
kbtoo has joined #archiveteam-bs |
00:40
π
|
|
Darkstar has joined #archiveteam-bs |
00:40
π
|
|
wp494 has quit IRC (Read error: Operation timed out) |
00:40
π
|
|
wp494 has joined #archiveteam-bs |
00:52
π
|
|
julientm has joined #archiveteam-bs |
01:47
π
|
|
Despatche has joined #archiveteam-bs |
01:59
π
|
|
Exairnous has quit IRC (Ping timeout: 246 seconds) |
02:03
π
|
|
fuzy802 has joined #archiveteam-bs |
02:06
π
|
|
fuzzy8021 has quit IRC (Ping timeout: 252 seconds) |
02:13
π
|
|
fuzy802 is now known as fuzzy8021 |
02:14
π
|
|
SimpBrain has quit IRC (Read error: Connection reset by peer) |
02:14
π
|
|
SimpBrain has joined #archiveteam-bs |
02:28
π
|
|
user3434 has joined #archiveteam-bs |
02:32
π
|
|
user3434 has quit IRC (Quit: Page closed) |
02:36
π
|
|
PhrackD has quit IRC (Read error: Connection reset by peer) |
02:41
π
|
|
PhrackD has joined #archiveteam-bs |
02:44
π
|
|
Odd0002_ has joined #archiveteam-bs |
02:48
π
|
|
Odd0002 has quit IRC (Ping timeout: 615 seconds) |
02:48
π
|
|
Odd0002_ is now known as Odd0002 |
03:02
π
|
|
SimpBrain has quit IRC (Read error: Connection reset by peer) |
03:02
π
|
|
SimpBrain has joined #archiveteam-bs |
03:41
π
|
|
julientm has quit IRC (Ping timeout: 615 seconds) |
03:42
π
|
|
Despatche has quit IRC (Remote host closed the connection) |
04:00
π
|
|
Stiletto has quit IRC (Ping timeout: 360 seconds) |
04:00
π
|
|
Stilett0 has joined #archiveteam-bs |
04:17
π
|
|
julientm has joined #archiveteam-bs |
04:37
π
|
|
qw3rty117 has joined #archiveteam-bs |
04:39
π
|
|
robogoat has quit IRC (Read error: Operation timed out) |
04:43
π
|
|
qw3rty116 has quit IRC (Read error: Operation timed out) |
04:46
π
|
|
robogoat has joined #archiveteam-bs |
04:55
π
|
|
odemgi has quit IRC (Ping timeout: 246 seconds) |
04:56
π
|
|
ndiddy has quit IRC () |
04:58
π
|
|
odemgi has joined #archiveteam-bs |
05:01
π
|
|
odemg has quit IRC (Ping timeout: 615 seconds) |
05:08
π
|
|
odemg has joined #archiveteam-bs |
05:43
π
|
|
Exairnous has joined #archiveteam-bs |
05:53
π
|
|
bsmith093 has quit IRC (Ping timeout: 616 seconds) |
05:53
π
|
|
julientm has quit IRC (Remote host closed the connection) |
05:55
π
|
|
bsmith093 has joined #archiveteam-bs |
05:59
π
|
|
kyonko has quit IRC (Read error: Connection reset by peer) |
06:00
π
|
|
kyonko has joined #archiveteam-bs |
06:05
π
|
|
SimpBrain has quit IRC (Remote host closed the connection) |
06:06
π
|
|
SimpBrain has joined #archiveteam-bs |
06:07
π
|
|
bsmith093 has quit IRC (Read error: Operation timed out) |
06:10
π
|
|
bsmith093 has joined #archiveteam-bs |
06:21
π
|
|
killsushi has quit IRC (Quit: Leaving) |
06:25
π
|
|
deevious has joined #archiveteam-bs |
06:35
π
|
|
SumTingWo has joined #archiveteam-bs |
06:47
π
|
|
SumTingWo has quit IRC (Read error: Connection reset by peer) |
06:51
π
|
|
julientm has joined #archiveteam-bs |
06:55
π
|
|
julientre has joined #archiveteam-bs |
07:08
π
|
|
Exairnous has quit IRC (Ping timeout: 255 seconds) |
07:11
π
|
|
S1mpbrain has joined #archiveteam-bs |
07:11
π
|
|
SimpBrain has quit IRC (Read error: Connection reset by peer) |
07:13
π
|
|
julientre has quit IRC (Quit: Leaving) |
07:17
π
|
|
julientm1 has joined #archiveteam-bs |
07:24
π
|
|
julientm has quit IRC (Remote host closed the connection) |
07:24
π
|
|
julientm1 has quit IRC (Leaving) |
07:25
π
|
|
julientm has joined #archiveteam-bs |
07:34
π
|
|
deevious has quit IRC (Quit: deevious) |
08:11
π
|
|
deevious has joined #archiveteam-bs |
08:40
π
|
|
Exairnous has joined #archiveteam-bs |
09:33
π
|
|
julientm has quit IRC (Remote host closed the connection) |
09:35
π
|
|
julientm1 has joined #archiveteam-bs |
09:41
π
|
|
wp494 has quit IRC (Ping timeout: 492 seconds) |
09:43
π
|
|
wp494 has joined #archiveteam-bs |
09:52
π
|
|
julientm1 is now known as julientm |
11:00
π
|
|
julientm has quit IRC (Remote host closed the connection) |
11:29
π
|
|
kyonko has quit IRC (Read error: Connection reset by peer) |
12:10
π
|
|
julientm has joined #archiveteam-bs |
12:15
π
|
lenary |
SketchCow: those pdfs of Spare Rib came back up, so I'm grabbing them now |
12:22
π
|
|
julientm has quit IRC (Remote host closed the connection) |
12:35
π
|
|
S1mpbrain has quit IRC (Remote host closed the connection) |
12:35
π
|
|
S1mpbrain has joined #archiveteam-bs |
13:13
π
|
|
deevious has quit IRC (Quit: deevious) |
13:35
π
|
|
BlueMax has quit IRC (Read error: Connection reset by peer) |
13:50
π
|
|
S1mpbrain has quit IRC (Read error: Operation timed out) |
13:57
π
|
|
SimpBrain has joined #archiveteam-bs |
14:27
π
|
|
deevious has joined #archiveteam-bs |
15:08
π
|
|
ndiddy has joined #archiveteam-bs |
15:21
π
|
|
deevious has quit IRC (Ping timeout: 252 seconds) |
15:29
π
|
|
deevious has joined #archiveteam-bs |
15:54
π
|
|
julientm has joined #archiveteam-bs |
15:57
π
|
|
SimpBrain has quit IRC (Read error: Connection reset by peer) |
15:58
π
|
|
SimpBrain has joined #archiveteam-bs |
16:00
π
|
|
deevious1 has joined #archiveteam-bs |
16:01
π
|
|
deevious has quit IRC (Read error: Connection reset by peer) |
16:01
π
|
|
deevious1 is now known as deevious |
16:09
π
|
|
julientm has quit IRC (Remote host closed the connection) |
16:11
π
|
|
Czechball has joined #archiveteam-bs |
16:11
π
|
Czechball |
So here I am |
16:11
π
|
JAA |
Czechball: Yes, it's possible to run all our projects outside of the warrior as well. |
16:11
π
|
JAA |
The instructions for that are normally in the code repository, which is linked on each project's wiki page. |
16:11
π
|
Czechball |
yeah I noticed that |
16:12
π
|
Czechball |
I thought maybe you have a Warrior distro ready for real hardware |
16:12
π
|
JAA |
There is a Docker image. |
16:13
π
|
Czechball |
Alright, I'll check that out |
16:13
π
|
Czechball |
I could probably even run some scripts on my RaspberryPi |
16:24
π
|
|
godane has joined #archiveteam-bs |
16:28
π
|
|
bitBaron has joined #archiveteam-bs |
16:36
π
|
|
SimpBrain has quit IRC (Read error: Operation timed out) |
16:40
π
|
|
SimpBrain has joined #archiveteam-bs |
17:02
π
|
|
Stilett0 is now known as Stiletto |
17:29
π
|
|
omarroth has joined #archiveteam-bs |
17:39
π
|
|
julientm has joined #archiveteam-bs |
17:40
π
|
|
Ravenloft has quit IRC (Read error: Connection reset by peer) |
17:47
π
|
|
turnkit has joined #archiveteam-bs |
17:56
π
|
|
julientm has quit IRC (Remote host closed the connection) |
18:01
π
|
|
julientm has joined #archiveteam-bs |
18:02
π
|
|
Moder112 has joined #archiveteam-bs |
18:04
π
|
Moder112 |
well |
18:04
π
|
Moder112 |
I tried splitting the megawarc I mentioned with those tools but unfortunately it resulted in the creation of either thousands of improperly named warc files |
18:05
π
|
Moder112 |
or a directory structure exactly the same as the original site but impossible to browse |
18:06
π
|
Moder112 |
I got the json metadata file for the archive, is it possible to split it into the files listed in the json? |
18:06
π
|
Moder112 |
or do I need to code a program from scratch to do that |
18:17
π
|
|
bitBaron has quit IRC (Quit: My computer has gone to sleep. π΄πͺZZZzzzβ¦) |
18:33
π
|
Moder112 |
I thought I could use this |
18:33
π
|
Moder112 |
https://github.com/alard/megawarc |
18:33
π
|
Moder112 |
but the tar file from the specific archive can't be downloaded |
18:34
π
|
Moder112 |
or I mean |
18:34
π
|
Moder112 |
It can be |
18:34
π
|
Moder112 |
but it saves as a 0b file |
18:35
π
|
Moder112 |
which I guess would make sense as the tar is for non-warc files |
18:36
π
|
Moder112 |
but the script really doesn't like that |
18:40
π
|
|
wp494 has quit IRC (Read error: Operation timed out) |
18:41
π
|
|
wp494 has joined #archiveteam-bs |
18:43
π
|
|
Czechball has quit IRC (Leaving) |
19:04
π
|
|
bitBaron has joined #archiveteam-bs |
19:08
π
|
JAA |
Moder112: I've never used it, but 'megawarc restore' should split the megawarc back up into the original files. But that'll write out all of the WARCs, not just the one(s) you want. |
19:11
π
|
JAA |
Moder112: Oh, forgot to mention, you can also only download the relevant part of the megawarc from IA with a range request. |
19:13
π
|
|
Oddly has joined #archiveteam-bs |
19:16
π
|
JAA |
For example, to download the third WARC contained in https://archive.org/download/archiveteam_tumblr20181218030723 : |
19:16
π
|
JAA |
curl -L -H 'Range: bytes=46559344-46567731' https://archive.org/download/archiveteam_tumblr20181218030723/tumblr_20181218030723.megawarc.warc.gz > tumblr-tumblr-blog_obssesivebloodykisses-20181218-025918.warc.gz |
19:17
π
|
JAA |
The numbers in the range header come from the JSON record for that file: "offset":46559344,"size":8388 |
19:17
π
|
JAA |
The second number is offset + size - 1. |
19:20
π
|
JAA |
You could also extract that file from the megawarc using this: tail -c+46559345 tumblr_20181218030723.megawarc.warc.gz | head -c8388 > tumblr-tumblr-blog_obssesivebloodykisses-20181218-025918.warc.gz |
19:20
π
|
JAA |
Where the number for tail is the offset + 1 and the one for head is the size. |
19:45
π
|
Moder112 |
well |
19:46
π
|
Moder112 |
unfortunately I tried using the megawarc program |
19:46
π
|
Moder112 |
and it crashes on me |
19:48
π
|
Moder112 |
I'm gonna try out the partial download solution tomorrow |
19:49
π
|
Moder112 |
I'll come back to say how it went tomorrow |
19:49
π
|
Moder112 |
thanks a lot |
20:00
π
|
|
omarroth has quit IRC (Ping timeout: 506 seconds) |
20:01
π
|
|
Moder112 has quit IRC (Quit: Page closed) |
20:01
π
|
|
Stilett0 has joined #archiveteam-bs |
20:05
π
|
|
Stiletto has quit IRC (Ping timeout: 615 seconds) |
20:22
π
|
|
kiska1 has quit IRC (Read error: Operation timed out) |
20:23
π
|
|
kiska1 has joined #archiveteam-bs |
20:31
π
|
|
kiska1 has quit IRC (Ping timeout (120 seconds)) |
20:31
π
|
|
kiska1 has joined #archiveteam-bs |
20:40
π
|
|
Despatche has joined #archiveteam-bs |
20:54
π
|
SketchCow |
lenary: Excellent. |
20:55
π
|
lenary |
SketchCow: how do I get the warcs somewhere safer than my spinning rust? |
20:55
π
|
SketchCow |
Just upload into IA's opensource collection, a script puts them in a safe place. |
20:55
π
|
lenary |
I havenβt attempted the other ephemera yet. Need to get Helena up and running |
20:55
π
|
lenary |
Oh neat |
20:59
π
|
|
Oddly has quit IRC (Quit: Leaving) |
21:00
π
|
|
BlueMax has joined #archiveteam-bs |
21:42
π
|
|
BlueMax has quit IRC (Read error: Connection reset by peer) |
21:43
π
|
|
SimpBrain has quit IRC (Remote host closed the connection) |
21:47
π
|
|
SimpBrain has joined #archiveteam-bs |
21:57
π
|
|
wyatt8740 has joined #archiveteam-bs |
22:23
π
|
|
SimpBrain has quit IRC (Remote host closed the connection) |
22:26
π
|
|
SimpBrain has joined #archiveteam-bs |
23:00
π
|
|
Despatche has quit IRC (Quit: Connection reset by deer) |
23:02
π
|
|
fuzzy8021 has quit IRC (Read error: Connection reset by peer) |
23:03
π
|
|
fuzzy8021 has joined #archiveteam-bs |
23:25
π
|
dashcloud |
godane: I found a Hauppauge PVR-250 & an ASUS PVR-416 in my collection, and both seem to work |