Time |
Nickname |
Message |
00:08
🔗
|
|
Coderjoe has quit IRC (Read error: Connection reset by peer) |
00:09
🔗
|
|
Coderjoe has joined #archiveteam |
00:32
🔗
|
|
Start-mob has joined #archiveteam |
00:35
🔗
|
|
Start-mob has quit IRC (Remote host closed the connection) |
00:36
🔗
|
SketchCow |
Yeah, grooveshark took it in the pants |
00:39
🔗
|
|
mistym_ has quit IRC (Remote host closed the connection) |
00:46
🔗
|
|
kyan has joined #archiveteam |
00:53
🔗
|
* |
SketchCow is in Atlanta now |
00:53
🔗
|
|
kyan has quit IRC (Remote host closed the connection) |
01:02
🔗
|
|
wacky has quit IRC (Read error: Operation timed out) |
01:02
🔗
|
|
wacky has joined #archiveteam |
01:10
🔗
|
SketchCow |
Progress on FOS |
01:10
🔗
|
SketchCow |
Halo uploads are still ridiculous, of course |
01:11
🔗
|
SketchCow |
But we're well out of the woods of any other non-halo project getting honked |
01:12
🔗
|
|
schbirid2 has joined #archiveteam |
01:13
🔗
|
|
ikreymer has joined #archiveteam |
01:14
🔗
|
|
schbirid has quit IRC (Read error: Operation timed out) |
01:17
🔗
|
ikreymer |
hi archiveteam, a serious question/quick survey: would you sign-up for a personal web archiving service if you had the option? |
01:19
🔗
|
|
primus104 has quit IRC (Leaving.) |
01:21
🔗
|
SketchCow |
HI ILYA |
01:21
🔗
|
SketchCow |
Ilya the question is if we'd sign up for YOURS |
01:22
🔗
|
SketchCow |
In general we trust nobody, anywhere, ever, and this policy has worked out well. |
01:22
🔗
|
joepie91_ |
heh |
01:22
🔗
|
ikreymer |
you trust brewster ;) |
01:23
🔗
|
joepie91_ |
ikreymer: "trust" vs "delegate appropriate and strictly necessary amounts of responsibility" |
01:23
🔗
|
joepie91_ |
:) |
01:23
🔗
|
joepie91_ |
see also http://archiveteam.org/index.php?title=INTERNETARCHIVE.BAK |
01:26
🔗
|
ikreymer |
in that case, i'm not asking for your trust either :) |
01:26
🔗
|
ikreymer |
i'm aware of the .bak project, yes |
01:28
🔗
|
SketchCow |
Also, trying to survey these monkeys, you might as well yell out your phone number at a ball game |
01:29
🔗
|
SketchCow |
Let's put it this way - we're never going to endorse a personal web archiving service, I doubt any here will use it, and the best you could hope for is maybe we'd help you distribute an open source version |
01:29
🔗
|
SketchCow |
You're coming into the EMT waiting/smoking room asking if anyone wants to buy a first aid kit |
01:34
🔗
|
ikreymer |
i wasn't asking for endorsements or selling anything, just curious about your opinions. thanks for sharing |
01:55
🔗
|
dashcloud |
yes, I have signed up for one- pinboard is awesome (for two reasons: the guy behind it is amazing, and the site is marvelous in its simplicity) |
01:58
🔗
|
|
mistym has joined #archiveteam |
02:38
🔗
|
|
kyan has joined #archiveteam |
02:57
🔗
|
|
Muffin has joined #archiveteam |
03:00
🔗
|
|
Stiletto has joined #archiveteam |
03:14
🔗
|
|
dPhoenix has quit IRC (Remote host closed the connection) |
03:41
🔗
|
SketchCow |
huggy hugg hug |
03:45
🔗
|
|
Ymgve has quit IRC () |
04:14
🔗
|
|
dashcloud has quit IRC (Read error: Connection reset by peer) |
04:14
🔗
|
|
dashcloud has joined #archiveteam |
04:19
🔗
|
|
aaaaaaaaa has quit IRC (Leaving) |
05:56
🔗
|
|
VADemon has quit IRC (Read error: Connection reset by peer) |
06:08
🔗
|
|
mistym has quit IRC (Remote host closed the connection) |
06:13
🔗
|
|
ikreymer has quit IRC () |
06:33
🔗
|
|
Start_ has joined #archiveteam |
06:33
🔗
|
|
Start has quit IRC (Read error: Connection reset by peer) |
06:39
🔗
|
|
SimpBrain has joined #archiveteam |
06:39
🔗
|
|
mistym has joined #archiveteam |
06:55
🔗
|
|
scyther has joined #archiveteam |
07:57
🔗
|
|
primus104 has joined #archiveteam |
08:12
🔗
|
|
dashcloud has quit IRC (Ping timeout: 265 seconds) |
08:13
🔗
|
|
rejon has quit IRC (Read error: Operation timed out) |
08:14
🔗
|
|
Muffin has quit IRC () |
08:16
🔗
|
|
dashcloud has joined #archiveteam |
08:18
🔗
|
|
mistym has quit IRC (Remote host closed the connection) |
08:18
🔗
|
|
mistym has joined #archiveteam |
08:35
🔗
|
|
schbirid2 has quit IRC (Quit: Leaving) |
08:39
🔗
|
|
primus104 has quit IRC (Leaving.) |
08:43
🔗
|
|
scyther has quit IRC (Leaving) |
08:50
🔗
|
|
schbirid has joined #archiveteam |
08:56
🔗
|
|
schbirid has quit IRC (Quit: Leaving) |
08:58
🔗
|
|
schbirid has joined #archiveteam |
09:05
🔗
|
|
mistym has quit IRC (Remote host closed the connection) |
09:31
🔗
|
|
Start_ has quit IRC (Disconnected.) |
10:06
🔗
|
|
mistym has joined #archiveteam |
10:20
🔗
|
|
mistym has quit IRC (Read error: Operation timed out) |
10:33
🔗
|
|
insane_al has joined #archiveteam |
11:07
🔗
|
|
mistym has joined #archiveteam |
11:15
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
11:18
🔗
|
|
dashcloud has joined #archiveteam |
11:21
🔗
|
|
mistym has quit IRC (Read error: Operation timed out) |
11:43
🔗
|
|
SimpBrain has quit IRC (Read error: Connection reset by peer) |
11:54
🔗
|
|
habi has joined #archiveteam |
11:54
🔗
|
|
habi has left |
11:55
🔗
|
|
Ymgve has joined #archiveteam |
12:26
🔗
|
|
BlueMaxim has quit IRC (Read error: Connection reset by peer) |
12:36
🔗
|
|
primus104 has joined #archiveteam |
12:46
🔗
|
|
primus104 has quit IRC (Leaving.) |
12:57
🔗
|
|
dashcloud has quit IRC (Remote host closed the connection) |
12:59
🔗
|
|
dashcloud has joined #archiveteam |
13:00
🔗
|
|
kyan has quit IRC (Quit: This computer has gone to sleep) |
13:43
🔗
|
|
primus104 has joined #archiveteam |
14:40
🔗
|
|
habi has joined #archiveteam |
14:45
🔗
|
|
habi has left |
15:01
🔗
|
|
primus104 has quit IRC (Leaving.) |
15:04
🔗
|
|
VADemon has joined #archiveteam |
15:14
🔗
|
|
nertzy has joined #archiveteam |
15:27
🔗
|
ersi |
Infreq: Depends on how open it is, if it's completely open and not fucking about; maybe yes |
15:28
🔗
|
ersi |
I meant ikreymer, but he isn't here so I accidentally tabbed Infreq >_> |
15:39
🔗
|
|
nertzy has quit IRC (This computer has gone to sleep) |
15:41
🔗
|
|
www2 has quit IRC (Read error: Operation timed out) |
15:48
🔗
|
|
habi has joined #archiveteam |
16:01
🔗
|
|
habi has quit IRC (Quit: Leaving.) |
16:03
🔗
|
|
Start has joined #archiveteam |
16:05
🔗
|
|
Stiletto has quit IRC (Ping timeout: 306 seconds) |
16:09
🔗
|
|
Stiletto has joined #archiveteam |
16:17
🔗
|
|
Stiletto has quit IRC (Ping timeout: 306 seconds) |
16:17
🔗
|
|
aaaaaaaaa has joined #archiveteam |
16:23
🔗
|
|
Stiletto has joined #archiveteam |
16:38
🔗
|
|
Smiley has quit IRC (Quit: http://www.milkme.co.uk - You'll never understand.) |
16:42
🔗
|
|
T31M has quit IRC (Quit: Leaving) |
16:49
🔗
|
|
Smiley has joined #archiveteam |
16:59
🔗
|
|
dashcloud has quit IRC (Remote host closed the connection) |
17:01
🔗
|
|
dashcloud has joined #archiveteam |
17:04
🔗
|
|
mistym has joined #archiveteam |
17:12
🔗
|
|
dashcloud has quit IRC (Remote host closed the connection) |
17:15
🔗
|
|
dashcloud has joined #archiveteam |
17:20
🔗
|
|
dashcloud has quit IRC (Remote host closed the connection) |
17:21
🔗
|
|
Smiley has quit IRC (http://www.milkme.co.uk - You'll never understand.) |
17:21
🔗
|
|
Smiley has joined #archiveteam |
17:22
🔗
|
|
dashcloud has joined #archiveteam |
17:23
🔗
|
|
dashcloud has quit IRC (Remote host closed the connection) |
17:26
🔗
|
|
dashcloud has joined #archiveteam |
17:54
🔗
|
|
primus104 has joined #archiveteam |
18:04
🔗
|
|
kyan has joined #archiveteam |
18:16
🔗
|
|
habi has joined #archiveteam |
18:17
🔗
|
|
signius has quit IRC (Ping timeout: 265 seconds) |
18:18
🔗
|
|
primus104 has quit IRC (Leaving.) |
18:20
🔗
|
|
kyan has quit IRC (Quit: Leaving) |
18:24
🔗
|
|
www2 has joined #archiveteam |
18:32
🔗
|
|
signius has joined #archiveteam |
18:53
🔗
|
|
mistym has quit IRC (Remote host closed the connection) |
19:02
🔗
|
|
habi has left |
19:10
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
19:16
🔗
|
|
dashcloud has joined #archiveteam |
19:16
🔗
|
|
www2 has quit IRC (Remote host closed the connection) |
19:28
🔗
|
|
primus104 has joined #archiveteam |
19:35
🔗
|
|
ehea617 has joined #archiveteam |
19:38
🔗
|
ehea617 |
Hey, I'm trying to create a wget script that downloads a website. I ran into an issue where the downloaded page refuses to load when javascript is enabled in the web browser. Is there a way to make sure wget doesn't download any javascript and automatically remove any <script> tags in an html document? |
19:42
🔗
|
|
Smiley has quit IRC (http://www.milkme.co.uk - You'll never understand.) |
19:42
🔗
|
|
Smiley has joined #archiveteam |
19:53
🔗
|
|
insane_al has quit IRC (Leaving) |
19:54
🔗
|
|
mistym has joined #archiveteam |
19:54
🔗
|
|
SN4T14_ has joined #archiveteam |
19:59
🔗
|
|
SN4T14 has quit IRC (Read error: Operation timed out) |
20:01
🔗
|
|
mistym has quit IRC (Read error: Operation timed out) |
20:30
🔗
|
|
Coderjoe has quit IRC (Read error: Operation timed out) |
20:31
🔗
|
schbirid |
ehea617: wget does not execute any js |
20:31
🔗
|
schbirid |
it only downloads js files if linked |
20:32
🔗
|
ehea617 |
I know |
20:32
🔗
|
ehea617 |
The issue is that if the page is opened when you have an active internet connection it crashes |
20:33
🔗
|
Sanqui |
that is a complex issue; I don't think wget can (or should) parse html and change the page source |
20:33
🔗
|
schbirid |
i have no idea what you are describing |
20:33
🔗
|
Sanqui |
you should find another tool, or write a script yourself (for example using python and beautifulsoup) |
20:33
🔗
|
schbirid |
you can make wget ignore certain URLs if needed |
20:34
🔗
|
Sanqui |
schbirid: i think they want to strip away <script> tags |
20:34
🔗
|
Sanqui |
ehea617: maybe you could just disable js in your browser? |
20:34
🔗
|
ehea617 |
that fixes it but I'm trying to make it work with js enabled |
20:35
🔗
|
Sanqui |
you need to modify the page source then; wget won't help you |
20:35
🔗
|
Sanqui |
wget's job is just to download stuff |
20:38
🔗
|
ehea617 |
does it not convert links to js? |
20:39
🔗
|
ehea617 |
I mean does it not convert the links to javascript to the script's location on the file system? |
20:40
🔗
|
schbirid |
if you use --convert-links it will update references to external files to the downloaded copies (if they were downloaded, you might need --page-requisites and --span-hosts) |
20:41
🔗
|
ehea617 |
I have those and it is converting links for everything fine except for links to javascript |
20:51
🔗
|
augusztin |
ehea617: if the script tags are manually constructed, then there is nothing wget can do |
20:51
🔗
|
augusztin |
you need to make your own script which will modify the resulting webpage yourself |
20:55
🔗
|
|
Coderjoe has joined #archiveteam |
21:19
🔗
|
|
ehea617 has quit IRC (Ping timeout: 240 seconds) |
21:43
🔗
|
|
BlueMaxim has joined #archiveteam |
21:43
🔗
|
|
RichardG has quit IRC (Read error: Connection reset by peer) |
21:45
🔗
|
|
RichardG has joined #archiveteam |
21:45
🔗
|
|
w0rp has joined #archiveteam |
21:47
🔗
|
|
w0rp has quit IRC (Client Quit) |
21:51
🔗
|
|
w0rp has joined #archiveteam |
21:56
🔗
|
|
w0rp has quit IRC (Quit: GONE) |
21:57
🔗
|
|
w0rp has joined #archiveteam |
22:05
🔗
|
|
w0rp has quit IRC (Quit: GONE) |
22:12
🔗
|
|
w0rp has joined #archiveteam |
22:18
🔗
|
|
bsmith093 has quit IRC (Read error: Operation timed out) |
22:31
🔗
|
|
nertzy has joined #archiveteam |
23:02
🔗
|
|
nertzy has quit IRC (This computer has gone to sleep) |