Time |
Nickname |
Message |
02:24
π
|
Dec-31-99 |
Why is there a "Tele2" part of the Warrior dashboard? |
02:24
π
|
vantec |
Because that is a project. |
02:24
π
|
vantec |
Part of the swipnet project. |
02:24
π
|
Dec-31-99 |
Why is Swipnet and Tele2 on separate dashboards, I'm saying. |
02:25
π
|
vantec |
Different pipelines and code. |
02:26
π
|
Dec-31-99 |
Code? |
02:27
π
|
vantec |
https://github.com/ArchiveTeam/tele2-grab/commits/master |
02:28
π
|
vantec |
You can read alll the changes there between swipnet and tele2 |
02:28
π
|
vantec |
*all |
02:29
π
|
Dec-31-99 |
Ah, okay. I also installed cygwin, like suggested by many. :) |
02:37
π
|
Dec-31-99 |
Are there any URL changes with the Tele2 tracker? |
02:37
π
|
vantec |
URL changes as far as what? |
02:38
π
|
Dec-31-99 |
Are the crawled URL names (e.g. home.swipnet.se) in the Tele2 tracker different from the ones in the Swipnet tracker? That's what I'm asking. |
02:39
π
|
vantec |
Yes, they use http://home.tele2.se/ |
02:40
π
|
Dec-31-99 |
Cool! |
02:43
π
|
Dec-31-99 |
Doesn't look like much to crawl...: http://preview.tinyurl.com/n2xtglf |
02:44
π
|
Dec-31-99 |
I only got 255 results |
02:45
π
|
aaaaaaaaa |
their scraping is a little more thorough, and doesn't have safe search on |
02:46
π
|
Dec-31-99 |
Dang, I'll get more than 250,000 results without SafeSearch? O_O |
02:48
π
|
vantec |
Well the internet is only good for 2 things |
02:49
π
|
Dec-31-99 |
What are they? |
02:50
π
|
vantec |
youtube and https://www.youtube.com/watch?v=j6eFNRKEROw |
02:50
π
|
aaaaaaaaa |
vantec: I was hoping you would link to that. |
02:51
π
|
Dec-31-99 |
I tried with SafeSearch off. Still gave 255 results. |
06:28
π
|
EG |
fuck me Rotab just fuck me |
06:28
π
|
EG |
Rotab: just put it in my butt |
06:29
π
|
EG |
my man pussy is ready for that whole lot of man Rotab |
06:48
π
|
xmc |
this eg troll is chasing me around irc |
06:48
π
|
xmc |
joke's on him, I'm going to watch tv and drink scotch |
06:48
π
|
xmc |
ding my disconnected screen session all you want, fool |
09:08
π
|
schbirid |
http://www.zara.com/il/en/kids/baby-boy-(3-months---3-years)/t-shirts/striped-%22sheriff%22-t-shirt-c271048p2096022.html please |
09:13
π
|
midas |
oh god no, i found it |
09:22
π
|
midas |
schbirid: i have the warc file, just need to grab the actual data |
09:23
π
|
midas |
for now, i have saved the entire page just to be sure it doesnt get lost |
09:23
π
|
schbirid |
oh this wasnt archivebot's channel =) |
09:23
π
|
schbirid |
never thought about regional "fences" for archiving |
09:24
π
|
midas |
and a bloody redirect page |
09:24
π
|
midas |
not sure if it grabbed it correctly |
09:25
π
|
midas |
(archivebot that is) |
10:35
π
|
godane |
the jay severin show is starting to get uploaded: https://archive.org/details/the-jay-severin-show-02-25-2013 |
10:36
π
|
godane |
i'm also uploading 2005 bootlegs of pear jam |
10:36
π
|
godane |
and msnbc.com videos for 2004-09 |
10:37
π
|
Smiley |
pear jam/ |
10:37
π
|
Smiley |
yum yum |
10:49
π
|
godane |
there are also rnc speeches in this item: https://archive.org/details/msnbc.com-video-2004-09-02 |
10:50
π
|
ohhdemgir |
godane, those pj from the site i linked a while back (open dir of torrents) |
10:50
π
|
ohhdemgir |
? |
10:50
π
|
godane |
yes |
10:51
π
|
ohhdemgir |
I need to update from there it's been awhile |
10:52
π
|
godane |
ohhdemgir: http://www.guitars101.com/forums/f145/ |
10:52
π
|
godane |
they have tons of bootlegs |
10:52
π
|
ohhdemgir |
:D |
10:53
π
|
godane |
i already got that topic a week ago |
10:53
π
|
godane |
i'm grabbing all filefactory links since i have premum |
10:58
π
|
ohhdemgir |
stuffing them into ia? |
11:14
π
|
godane |
yes |
11:16
π
|
godane |
looks like there are beatles boot legs too |
11:41
π
|
godane |
where is my welcome to the scene item |
11:43
π
|
godane |
found it: https://archive.org/details/welcometothescene_version2.0_xvid |
11:44
π
|
godane |
i'm going to re uploaded so SketchCow can get a collection of it |
11:44
π
|
godane |
i'm also my get all 3 formats for each episode |
11:48
π
|
Rotab |
haha, welcome to the scene |
11:49
π
|
godane |
i have to at least upload season 1 |
11:49
π
|
godane |
that is not fully there |
12:18
π
|
godane |
i'm uploading the www.guitars101.com lossess audio bootlegs topics pages |
12:33
π
|
godane |
uploaded: https://archive.org/details/www.guitars101.com-topics-lossless-audio-bootlegs-20140817 |
12:58
π
|
godane |
good news everyone |
12:58
π
|
godane |
i found a way to brute force the older nbc clips |
12:58
π
|
godane |
i added -T 1 -t 1 |
13:02
π
|
godane |
now i just need a way to force a try no matter what when i get this error: wget: unable to resolve host address Γ’ΒΒmsnbc.vo.llnwd.netΓ’ΒΒ |
13:34
π
|
godane |
I NEED HELP |
13:35
π
|
godane |
i don't know how to force wget to just keep retry if there is a resolve hosting problem |
13:35
π
|
midas |
you can set the retry to 9000 or something with 5 second intervals |
13:37
π
|
godane |
how? |
13:39
π
|
midas |
-T 5 -t 9000 |
13:39
π
|
midas |
it will keep trying forever |
13:39
π
|
midas |
or atleast for 9000x5 seconds |
13:41
π
|
godane |
problem with that is i need just a retry on unresolve host error |
13:41
π
|
godane |
not if file is not there |
13:41
π
|
godane |
the msnbc.vo.llnwd.net/l1/video/flash path as long wait time as it is |
13:41
π
|
godane |
if file is not there |
13:41
π
|
midas |
maybe with --retry-connrefused ? |
13:41
π
|
godane |
i have that in it |
13:44
π
|
midas |
--dns-timeout specified in seconds will abort the grab |
13:44
π
|
midas |
for that file |
13:44
π
|
midas |
maybe with a output file for your grab and some grep to figure out which files failed? |
13:46
π
|
godane |
i have a out file |
13:46
π
|
godane |
i add --dns-timeout 5 to my script |
13:52
π
|
godane |
anyways i'm starting to upload 13 new episodes of diy tryin |
13:53
π
|
godane |
a revision 3 series |
15:23
π
|
godane |
looks like there is no full logs of jsmess on badcheese.com: |
15:23
π
|
godane |
http://badcheese.com/~steve/atlogs/?chan=jsmess&day=2014-06-02 |
16:52
π
|
yipdw |
ooh yes |
16:52
π
|
yipdw |
mistym: not sure if you saw this but http://chrisclee.com/axent-wear-kickstarter-shoot/ |
16:53
π
|
yipdw |
we can almost throw money at yuumei |
16:53
π
|
mistym |
yipdw: Oh cool! I didn't, thanks! |
18:19
π
|
EG|G |
xmc: fuck it harder |
18:21
π
|
yipdw |
xmc: nice, ban without kick |
18:21
π
|
yipdw |
I like it |
18:21
π
|
* |
xmc nods quietly |
19:18
π
|
SketchCow |
Salutations, bastards |
19:30
π
|
Smiley |
hey SketchCow |
19:30
π
|
Smiley |
idea: some kind of "info for press" page on the wiki? |
19:30
π
|
xmc |
the term of art is "press kit" |
19:31
π
|
Smiley |
you offering? :D |
19:31
π
|
* |
xmc hides |
19:31
π
|
SketchCow |
Probably planning |
19:31
π
|
SketchCow |
Started to work on it |
19:31
π
|
SketchCow |
Needed help, you guys are punks |
19:31
π
|
Smiley |
:D |
19:31
π
|
Smiley |
just we seem to be being noticed more |
19:32
π
|
SketchCow |
No, we've been hot shit for years |
19:32
π
|
SketchCow |
Like Jay Z |
19:32
π
|
Smiley |
:D |
19:32
π
|
Smiley |
no one noticed him until he put a ring on it |
20:45
π
|
godane |
i got this fun error: |
20:45
π
|
godane |
<Error><Code>AccessDenied</Code><Message>Access Denied</Message><Resource>This item has been taken offline</Resource><RequestId>63d82d14-fdcd-414b-996b-c86410635bbf</RequestId></Error> |
20:45
π
|
godane |
i have access to it back now |
22:33
π
|
dashcloud |
I did figure out how to download just the pictures from the shareware cd collection, and if you're interested, this is how I did it: ia list -l $each | ia download -v -c -o --format='JPEG' --format='PNG' $each |
22:34
π
|
SketchCow |
That'll do it |
22:47
π
|
dashcloud |
good thing IA has such good metadata- it helped me solve a problem I wasn't sure how to solve otherwise |
23:15
π
|
DFJustin |
there are some GIFs in there too I think |