Time |
Nickname |
Message |
00:07
🔗
|
|
Ghost_of_ has joined #archiveteam |
00:09
🔗
|
|
bwn has quit IRC (Read error: Operation timed out) |
00:13
🔗
|
|
BubuAnabe has quit IRC (http://www.kiwiirc.com/ - A hand crafted IRC client) |
00:15
🔗
|
|
BubuAnabe has joined #archiveteam |
00:54
🔗
|
|
Start has joined #archiveteam |
01:01
🔗
|
X1011 |
Hi, I have recently learned that the recordings of a local radio show I often listen to will be taken offline soon, as the show is moving to a different station. They are in mp3 at http://frania.wedo810.com . What would be the best way for me to archive them? Should I use ArchiveBot? |
01:09
🔗
|
|
nickname has quit IRC (Ping timeout: 240 seconds) |
01:17
🔗
|
|
Ravenloft has joined #archiveteam |
01:34
🔗
|
dashcloud |
archivebot is a good choice, but if you want a copy for yourself, grabbing them with wpull or some other file downloader would be better |
01:48
🔗
|
BubuAnabe |
can someone run ig ere6eh6hx0igqj75w94bzclr5 .*\/?page_id=25686(&+.*) in archivebot, as it get stucked int that page please |
01:49
🔗
|
BubuAnabe |
I think the regex is fine, but check it anyway |
01:49
🔗
|
BubuAnabe |
Thanks! |
01:52
🔗
|
|
_desu___ has quit IRC (Ping timeout: 260 seconds) |
01:54
🔗
|
|
_desu___ has joined #archiveteam |
02:10
🔗
|
|
schbirid2 has joined #archiveteam |
02:13
🔗
|
|
username1 has quit IRC (Read error: Operation timed out) |
02:13
🔗
|
|
philpem has quit IRC (Ping timeout: 260 seconds) |
02:35
🔗
|
|
K4k_ has joined #archiveteam |
02:39
🔗
|
|
K4k_ has quit IRC (Ping timeout: 252 seconds) |
02:42
🔗
|
|
Stiletto has quit IRC () |
02:51
🔗
|
|
Stiletto has joined #archiveteam |
03:10
🔗
|
|
Start has quit IRC (Quit: Disconnected.) |
03:12
🔗
|
|
Start has joined #archiveteam |
03:18
🔗
|
|
BubuAnabe has quit IRC (http://www.kiwiirc.com/ - A hand crafted IRC client) |
03:22
🔗
|
|
BubuAnabe has joined #archiveteam |
03:25
🔗
|
|
BubuAnabe has quit IRC (http://www.kiwiirc.com/ - A hand crafted IRC client) |
03:27
🔗
|
|
BubuAnabe has joined #archiveteam |
03:34
🔗
|
|
BubuAnabe has quit IRC (http://www.kiwiirc.com/ - A hand crafted IRC client) |
03:46
🔗
|
|
nertzy2 has joined #archiveteam |
03:47
🔗
|
|
Ghost_of_ has quit IRC (Quit: Leaving) |
03:48
🔗
|
|
BubuAnabe has joined #archiveteam |
04:10
🔗
|
|
BubuAnabe has quit IRC (http://www.kiwiirc.com/ - A hand crafted IRC client) |
04:14
🔗
|
|
BubuAnabe has joined #archiveteam |
04:15
🔗
|
|
BubuAnabe has quit IRC (Client Quit) |
04:17
🔗
|
|
nertzy2 has quit IRC (Quit: This computer has gone to sleep) |
04:19
🔗
|
|
BubuAnabe has joined #archiveteam |
04:19
🔗
|
|
megaminxw has quit IRC (Quit: Leaving.) |
04:20
🔗
|
|
megaminxw has joined #archiveteam |
04:20
🔗
|
|
megaminxw has quit IRC (Client Quit) |
04:22
🔗
|
|
BubuAnabe has quit IRC (http://www.kiwiirc.com/ - A hand crafted IRC client) |
04:24
🔗
|
|
BubuAnabe has joined #archiveteam |
04:57
🔗
|
|
BubuAnabe has quit IRC (http://www.kiwiirc.com/ - A hand crafted IRC client) |
05:00
🔗
|
|
BubuAnabe has joined #archiveteam |
05:35
🔗
|
|
megaminxw has joined #archiveteam |
05:37
🔗
|
|
JesseW has joined #archiveteam |
05:53
🔗
|
|
chazchaz has quit IRC (Read error: Operation timed out) |
06:00
🔗
|
|
chazchaz has joined #archiveteam |
06:11
🔗
|
|
K4k_ has joined #archiveteam |
06:16
🔗
|
|
K4k_ has quit IRC (Ping timeout: 260 seconds) |
06:31
🔗
|
|
BubuAnabe has quit IRC (http://www.kiwiirc.com/ - A hand crafted IRC client) |
07:00
🔗
|
|
JesseW has quit IRC (Leaving.) |
07:11
🔗
|
|
JesseW has joined #archiveteam |
07:23
🔗
|
|
turnkit|2 has left Once you know what it is you want to be true, instinct is a very useful device for enabling you to know that it is |
07:46
🔗
|
|
logan has quit IRC (Read error: Operation timed out) |
07:47
🔗
|
|
PepsiMax has joined #archiveteam |
07:52
🔗
|
|
godane has quit IRC (Read error: Operation timed out) |
08:08
🔗
|
|
JesseW has quit IRC (Leaving.) |
08:39
🔗
|
|
FAMAS has joined #archiveteam |
09:02
🔗
|
|
FAMAS has quit IRC (Quit: http://chat.efnet.org (Ping timeout)) |
09:28
🔗
|
|
nertzy2 has joined #archiveteam |
09:51
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
09:52
🔗
|
arkiver |
SketchCow: chfoo: can you please create a rsync target on FOS for musicbrainz? |
09:54
🔗
|
|
dashcloud has joined #archiveteam |
09:58
🔗
|
|
nertzy2 has quit IRC (Quit: This computer has gone to sleep) |
10:41
🔗
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
10:47
🔗
|
|
zer0rest has joined #archiveteam |
11:02
🔗
|
|
atomotic has joined #archiveteam |
11:07
🔗
|
|
zer0rest has quit IRC (Read error: Connection reset by peer) |
11:10
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
11:13
🔗
|
|
dashcloud has joined #archiveteam |
11:18
🔗
|
|
atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) |
11:18
🔗
|
|
signius has quit IRC (Ping timeout: 260 seconds) |
11:21
🔗
|
|
signius has joined #archiveteam |
11:35
🔗
|
|
zer0rest has joined #archiveteam |
11:38
🔗
|
|
zer0rest has quit IRC (Read error: Connection reset by peer) |
11:52
🔗
|
|
zer0rest has joined #archiveteam |
12:15
🔗
|
|
godane has joined #archiveteam |
12:20
🔗
|
|
K4k_ has joined #archiveteam |
12:21
🔗
|
|
zer0rest has quit IRC (Ping timeout: 260 seconds) |
12:31
🔗
|
|
zer0rest has joined #archiveteam |
12:35
🔗
|
|
zer0rest has quit IRC (Ping timeout: 260 seconds) |
12:54
🔗
|
|
zer0rest has joined #archiveteam |
12:55
🔗
|
|
K4k_ has quit IRC (Read error: Operation timed out) |
12:55
🔗
|
|
VADemon has joined #archiveteam |
13:13
🔗
|
|
atomotic has joined #archiveteam |
13:15
🔗
|
|
K4k_ has joined #archiveteam |
13:22
🔗
|
|
zer0rest has quit IRC (Ping timeout: 260 seconds) |
13:30
🔗
|
|
K4k_ has quit IRC (Ping timeout: 252 seconds) |
13:35
🔗
|
|
K4k_ has joined #archiveteam |
13:43
🔗
|
|
WinterFox has quit IRC (Remote host closed the connection) |
14:06
🔗
|
|
atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) |
14:07
🔗
|
|
vitzli has joined #archiveteam |
14:11
🔗
|
|
REiN^ has quit IRC () |
14:18
🔗
|
|
K4k__ has joined #archiveteam |
14:19
🔗
|
|
K4k_ has quit IRC (Read error: Operation timed out) |
14:36
🔗
|
|
philpem has joined #archiveteam |
14:42
🔗
|
|
nertzy2 has joined #archiveteam |
15:13
🔗
|
|
nertzy2 has quit IRC (Quit: This computer has gone to sleep) |
15:23
🔗
|
|
FAMAS has joined #archiveteam |
15:31
🔗
|
|
zer0rest has joined #archiveteam |
15:49
🔗
|
|
FAMAS has quit IRC (Quit: http://chat.efnet.org (EOF)) |
15:51
🔗
|
|
megaminxw has quit IRC (Quit: Leaving.) |
15:58
🔗
|
Sketchcow |
Done |
15:59
🔗
|
arkiver |
SketchCow: thanks! It's in the chfoo dir right? |
16:00
🔗
|
Sketchcow |
Yes |
16:00
🔗
|
Sketchcow |
musicbrainz/ |
16:00
🔗
|
arkiver |
ok, we'll start the grab soon |
16:00
🔗
|
HCross |
are the the scripts on GitHub? |
16:01
🔗
|
arkiver |
not yet |
16:02
🔗
|
Sketchcow |
What are you grabbing, again? The linked items from Musicbrainz entries? |
16:09
🔗
|
|
xhades has quit IRC (Ping timeout: 260 seconds) |
16:11
🔗
|
arkiver |
The external links linked to from musicbrainz |
16:11
🔗
|
arkiver |
For example http://musicbrainz.org/artist/f27ec8db-af05-4f36-916e-3d57f91ecf5e |
16:12
🔗
|
arkiver |
Right under "External links" there's a large list of links |
16:12
🔗
|
arkiver |
Those will all be grabbed |
16:15
🔗
|
Sketchcow |
GOt it |
16:15
🔗
|
Sketchcow |
Yeah, good choice |
16:22
🔗
|
|
MMovie1 has joined #archiveteam |
16:22
🔗
|
phuzion |
arkiver: mind throwing me a ping once the scripts go up on github? |
16:23
🔗
|
|
MMovie has quit IRC (Read error: Operation timed out) |
16:27
🔗
|
arkiver |
phuzion: will do |
16:40
🔗
|
|
vitzli has quit IRC (Leaving) |
16:42
🔗
|
Atluxity |
could we archive all external links from all wikipedia? |
16:44
🔗
|
|
MMovie has joined #archiveteam |
16:44
🔗
|
|
Ravenloft has quit IRC (Ping timeout: 252 seconds) |
16:44
🔗
|
|
MMovie1 has quit IRC (Read error: Operation timed out) |
16:46
🔗
|
Sketchcow |
IA does that. |
16:46
🔗
|
Sketchcow |
Like, REALLY does that. Constantly. |
16:47
🔗
|
Atluxity |
great |
16:47
🔗
|
|
RichardG_ is now known as RichardG |
16:52
🔗
|
arkiver |
https://archive.org/details/NO404 |
16:52
🔗
|
arkiver |
They also do wordpress and GDELT |
16:57
🔗
|
|
nertzy2 has joined #archiveteam |
17:07
🔗
|
|
Morbus has joined #archiveteam |
17:22
🔗
|
|
zer0rest has quit IRC (Ping timeout: 260 seconds) |
17:23
🔗
|
|
zer0rest has joined #archiveteam |
17:29
🔗
|
|
nertzy2 has quit IRC (Quit: This computer has gone to sleep) |
17:57
🔗
|
|
zer0rest has quit IRC (Read error: Connection reset by peer) |
17:58
🔗
|
|
scyther has joined #archiveteam |
18:11
🔗
|
|
xhades has joined #archiveteam |
18:16
🔗
|
|
zer0rest has joined #archiveteam |
18:16
🔗
|
|
bentpins has quit IRC (Read error: Operation timed out) |
18:20
🔗
|
|
K4k__ has quit IRC (Quit: WeeChat 1.3) |
18:29
🔗
|
|
nertzy2 has joined #archiveteam |
18:36
🔗
|
JW_work |
Regarding IA's scraping of Wikipedia outlinks — does that cover all the language editions? And what about the other wikimedia projects (like wiktionary, wikisource, etc.)? And what about links on talk pages? |
18:37
🔗
|
JW_work |
(I can send these questions in to info@ if that would be a better place to ask them.) |
18:45
🔗
|
|
zer0rest has quit IRC (Read error: Connection reset by peer) |
18:51
🔗
|
|
zer0rest has joined #archiveteam |
18:51
🔗
|
|
zer0rest has quit IRC (Client Quit) |
18:51
🔗
|
|
zer0rest has joined #archiveteam |
19:02
🔗
|
|
nertzy2 has quit IRC (Quit: This computer has gone to sleep) |
19:06
🔗
|
JW_work |
I see that Nemo_bis ( and SJ Klein) already asked about multi-language stuff (back in 2013). Sadly, there doesn't seem to have been an answer (yet). :-) |
19:37
🔗
|
|
zer0rest has quit IRC (Ping timeout: 260 seconds) |
19:37
🔗
|
Nemo_bis |
JW_work: so you probably saw that IA is indeed crawling links from multiple languages. I don't know how many exactly. |
19:38
🔗
|
Nemo_bis |
https://www.mediawiki.org/wiki/Talk:Archived_Pages |
19:38
🔗
|
JW_work |
ah, cool |
19:47
🔗
|
|
slyphic is now known as slyphic|a |
20:07
🔗
|
|
brayden has quit IRC (Read error: Connection reset by peer) |
20:14
🔗
|
|
brayden has joined #archiveteam |
20:14
🔗
|
|
swebb sets mode: +o brayden |
20:21
🔗
|
|
nertzy2 has joined #archiveteam |
20:32
🔗
|
|
JW_work has quit IRC (Read error: Connection reset by peer) |
20:47
🔗
|
|
zer0rest has joined #archiveteam |
20:58
🔗
|
|
nertzy2 has quit IRC (Quit: This computer has gone to sleep) |
21:19
🔗
|
|
nertzy2 has joined #archiveteam |
21:31
🔗
|
|
Ghost_of_ has joined #archiveteam |
21:33
🔗
|
|
lbft_ has quit IRC (Read error: Operation timed out) |
21:52
🔗
|
|
Ravenloft has joined #archiveteam |
22:04
🔗
|
|
sigkell has quit IRC (Ping timeout: 260 seconds) |
22:04
🔗
|
|
sigkell has joined #archiveteam |
22:05
🔗
|
|
Rickster has quit IRC (Ping timeout: 260 seconds) |
22:05
🔗
|
|
Muad-Dib has quit IRC (Ping timeout: 260 seconds) |
22:07
🔗
|
|
Rickster has joined #archiveteam |
22:18
🔗
|
|
zer0rest has quit IRC (Read error: Connection reset by peer) |
22:18
🔗
|
|
zer0rest has joined #archiveteam |
22:18
🔗
|
|
zer0rest has quit IRC (Client Quit) |
22:30
🔗
|
|
scyther has quit IRC (Read error: Connection reset by peer) |
22:33
🔗
|
|
nertzy2 has quit IRC (Quit: This computer has gone to sleep) |
22:38
🔗
|
|
chazchaz has quit IRC (Ping timeout: 369 seconds) |
22:40
🔗
|
|
chazchaz has joined #archiveteam |
22:41
🔗
|
|
lbft has joined #archiveteam |
22:42
🔗
|
|
Ghost_of_ has quit IRC (Quit: Leaving) |
23:00
🔗
|
|
JW_work has joined #archiveteam |
23:13
🔗
|
|
WinterFox has joined #archiveteam |
23:15
🔗
|
|
megaminxw has joined #archiveteam |
23:22
🔗
|
|
schbirid2 has quit IRC (Quit: Leaving) |
23:56
🔗
|
|
Ghost_of_ has joined #archiveteam |