Time |
Nickname |
Message |
00:06
🔗
|
|
wp494 has quit IRC (Read error: Operation timed out) |
00:06
🔗
|
|
wp494 has joined #archiveteam-ot |
00:19
🔗
|
|
kiskaLap has quit IRC (Leaving) |
00:19
🔗
|
|
kiskaLap has joined #archiveteam-ot |
00:27
🔗
|
|
kiskaLap has quit IRC (Leaving) |
00:27
🔗
|
|
kiskaLap has joined #archiveteam-ot |
00:50
🔗
|
|
ta9le has quit IRC (Quit: Connection closed for inactivity) |
01:04
🔗
|
|
dxrt_ has quit IRC (Ping timeout: 268 seconds) |
01:08
🔗
|
|
dxrt_ has joined #archiveteam-ot |
01:08
🔗
|
|
dxrt sets mode: +o dxrt_ |
02:15
🔗
|
|
Flashfire has joined #archiveteam-ot |
02:27
🔗
|
|
redlizard has quit IRC (Read error: Connection reset by peer) |
03:56
🔗
|
|
odemg has quit IRC (Ping timeout: 268 seconds) |
04:07
🔗
|
|
odemg has joined #archiveteam-ot |
06:36
🔗
|
|
kiska3 has joined #archiveteam-ot |
06:39
🔗
|
|
kiska has quit IRC (Ping timeout: 252 seconds) |
08:54
🔗
|
|
ivan has quit IRC (Read error: Operation timed out) |
08:54
🔗
|
|
JAA has quit IRC (Read error: Operation timed out) |
08:54
🔗
|
|
JAA has joined #archiveteam-ot |
08:54
🔗
|
|
ivan has joined #archiveteam-ot |
08:55
🔗
|
|
jspiros has quit IRC (Read error: Operation timed out) |
08:55
🔗
|
|
bakJAA sets mode: +o JAA |
08:55
🔗
|
|
SketchCow has quit IRC (Read error: Operation timed out) |
09:00
🔗
|
|
wp494 has quit IRC (Read error: Operation timed out) |
09:00
🔗
|
|
wp494 has joined #archiveteam-ot |
09:03
🔗
|
|
schbirid has joined #archiveteam-ot |
09:18
🔗
|
|
BlueMaxim has joined #archiveteam-ot |
09:23
🔗
|
|
BlueMax has quit IRC (Read error: Operation timed out) |
09:29
🔗
|
|
betamax has quit IRC (Read error: Operation timed out) |
09:32
🔗
|
|
betamax has joined #archiveteam-ot |
09:54
🔗
|
|
ta9le has joined #archiveteam-ot |
09:59
🔗
|
|
jspiros has joined #archiveteam-ot |
10:29
🔗
|
|
Flashfire has quit IRC (Bye) |
10:38
🔗
|
|
dxrt has quit IRC (Remote host closed the connection) |
10:38
🔗
|
|
dxrt has joined #archiveteam-ot |
10:59
🔗
|
|
vitzli has joined #archiveteam-ot |
11:04
🔗
|
|
t2t2 has quit IRC (Read error: Operation timed out) |
11:05
🔗
|
|
t2t2 has joined #archiveteam-ot |
14:16
🔗
|
|
vitzli has quit IRC (Quit: Leaving) |
15:30
🔗
|
|
schbirid has quit IRC (Quit: Leaving) |
15:30
🔗
|
|
schbirid has joined #archiveteam-ot |
16:11
🔗
|
|
jut_ has joined #archiveteam-ot |
16:14
🔗
|
|
jut has quit IRC (Ping timeout: 252 seconds) |
16:29
🔗
|
|
BlueMaxim has quit IRC (Leaving) |
17:50
🔗
|
|
icedice has joined #archiveteam-ot |
17:51
🔗
|
icedice |
Does anybody here know if HGST HDDs are still good after the Western Digital merger? |
17:57
🔗
|
|
icedice has quit IRC (Quit: Leaving) |
17:58
🔗
|
|
icedice has joined #archiveteam-ot |
18:09
🔗
|
schbirid |
discussion reliability benefits for single/few hard drives is idiotic |
18:09
🔗
|
schbirid |
pick any you like |
18:09
🔗
|
schbirid |
you have backups so why worry about it |
18:25
🔗
|
|
SketchCow has joined #archiveteam-ot |
18:54
🔗
|
|
godane has quit IRC (Ping timeout: 252 seconds) |
20:29
🔗
|
|
MrRadar has quit IRC (Read error: Operation timed out) |
20:30
🔗
|
|
Soni has quit IRC (Ping timeout: 264 seconds) |
20:35
🔗
|
|
arkiver has quit IRC (Ping timeout: 360 seconds) |
20:37
🔗
|
|
arkiver has joined #archiveteam-ot |
20:37
🔗
|
|
Soni has joined #archiveteam-ot |
20:41
🔗
|
|
Stiletto has quit IRC (Ping timeout: 360 seconds) |
20:42
🔗
|
|
Stilett0 has joined #archiveteam-ot |
20:44
🔗
|
|
dxrt- has joined #archiveteam-ot |
20:44
🔗
|
|
chirlu` has quit IRC (Excess Flood) |
20:45
🔗
|
|
dxrt has quit IRC (Ping timeout: 360 seconds) |
20:49
🔗
|
|
MrRadar has joined #archiveteam-ot |
20:50
🔗
|
|
chirlu has joined #archiveteam-ot |
20:53
🔗
|
|
astrid has quit IRC (Read error: Operation timed out) |
20:53
🔗
|
|
zino has quit IRC (Read error: Connection reset by peer) |
20:53
🔗
|
|
zino has joined #archiveteam-ot |
20:55
🔗
|
|
MrRadar has quit IRC (Read error: Connection reset by peer) |
20:55
🔗
|
|
MrRadar_ has joined #archiveteam-ot |
20:59
🔗
|
|
astrid has joined #archiveteam-ot |
21:00
🔗
|
|
schbirid has quit IRC (Ping timeout: 1212 seconds) |
21:03
🔗
|
|
icedice has quit IRC (Ping timeout: 252 seconds) |
22:27
🔗
|
|
nightpool has joined #archiveteam-ot |
22:39
🔗
|
hook54321 |
Anyone know a grep pattern/command/whatever that I could use to extract a all URLs from a text document? |
22:48
🔗
|
JAA |
hook54321: That's a fairly difficult problem, especially when it also involves stuff like "check out this page (http://domain/file.html)" where you'd want to not include the closing parenthesis. |
22:49
🔗
|
JAA |
But if there's whitespace around the URLs, try grep -o 'http\S*' file |
22:49
🔗
|
JAA |
(Which would obviously also find things that aren't URLs.) |
22:50
🔗
|
JAA |
http URLs* |
22:52
🔗
|
JAA |
Here's another simple one that would extract URLs with any scheme and be a bit more strict about it: grep -o '\<[a-z]\+://\S*' |
23:01
🔗
|
hook54321 |
JAA: With that second pattern there's still " after them (and some other characters after ") |
23:01
🔗
|
hook54321 |
on most of them at least |
23:02
🔗
|
JAA |
Yeah, that's the "difficult" part I mentioned above. |
23:06
🔗
|
JAA |
hook54321: grep -Po '(^|\s)([^a-z]?)\K[a-z]+://\S*(?=\2(\s|$))' file |
23:06
🔗
|
JAA |
That still won't deal with parentheses correctly though. |
23:06
🔗
|
JAA |
It does handle quotes around URLs though. |
23:11
🔗
|
hook54321 |
ok, thanks |