Time |
Nickname |
Message |
00:24
🔗
|
|
Ryz has joined #webroasting |
02:33
🔗
|
|
kiska has quit IRC (Ping timeout: 622 seconds) |
02:38
🔗
|
|
logchfoo2 starts logging #webroasting at Wed Mar 11 02:38:31 2020 |
02:38
🔗
|
|
logchfoo2 has joined #webroasting |
02:50
🔗
|
|
atphoenix has quit IRC (Read error: Operation timed out) |
02:51
🔗
|
|
atphoenix has joined #webroasting |
07:36
🔗
|
|
kiska has joined #webroasting |
08:49
🔗
|
|
kiska has quit IRC (Read error: Operation timed out) |
08:54
🔗
|
|
kiska has joined #webroasting |
09:18
🔗
|
|
Craigle has quit IRC (Quit: Ping timeout (120 seconds)) |
09:27
🔗
|
|
themadpro has joined #webroasting |
10:22
🔗
|
|
themadpro has quit IRC (Quit: This computer has gone to sleep) |
11:00
🔗
|
|
themadpro has joined #webroasting |
11:15
🔗
|
Jeroen |
@wessel151 We do have to upload all our WARCs sometime to a staging server before actually uploading it. |
11:15
🔗
|
Jeroen |
One that both of us can upload data to and read them. |
11:15
🔗
|
Jeroen |
And correct any missing or broken WARCs. |
11:18
🔗
|
Jeroen |
And I wonder what the maximum WARC size is that the IA accepts for the Wayback Machine. |
11:19
🔗
|
|
themadpro has quit IRC (Quit: This computer has gone to sleep) |
11:21
🔗
|
eythian |
Reminds me, I have a warc for this project I want to see about getting into the WBM |
11:24
🔗
|
eythian |
https://archive.org/details/homepages.inspire.net.nz_2019-04-18_inspire-user-homepages specifically this |
13:18
🔗
|
|
themadpro has joined #webroasting |
13:57
🔗
|
wessel151 |
Jeroen: we can use my home server for that |
13:58
🔗
|
wessel151 |
JAA: wast the best way to upload IA |
13:59
🔗
|
wessel151 |
have over 700 GB and its still growing |
14:00
🔗
|
JAA |
https://archive.org/services/docs/api/internetarchive/ |
14:02
🔗
|
JAA |
Jeroen: As far as I know, the WBM doesn't care how large the WARC file is. However, the derive process may time out if there are too many records in it. I generally use 5 GiB WARCs. The DPoS projects are usually uploaded as 50 GiB megawarcs, but that has caused problems occasionally before when the records were very small (e.g. TinyPic). |
14:11
🔗
|
wessel151 |
Jeroen: grap-site has 5 GB WARC by deflate so no need to worry |
14:57
🔗
|
|
themadpro has quit IRC (Quit: This computer has gone to sleep) |
16:13
🔗
|
|
Craigle has joined #webroasting |
17:00
🔗
|
|
ahmetone has joined #webroasting |
18:04
🔗
|
|
ahmetone has quit IRC (Quit: This computer has gone to sleep) |
18:10
🔗
|
|
ahmetone has joined #webroasting |
22:31
🔗
|
|
tech234a has quit IRC (Remote host closed the connection) |
22:40
🔗
|
|
tech234a has joined #webroasting |
22:50
🔗
|
|
Ctrl-S___ has quit IRC (Remote host closed the connection) |
22:50
🔗
|
|
hook54321 has quit IRC (Remote host closed the connection) |
22:53
🔗
|
|
Ctrl-S___ has joined #webroasting |
22:53
🔗
|
|
hook54321 has joined #webroasting |
23:50
🔗
|
|
ahmetone has quit IRC (Quit: Leaving) |