Time |
Nickname |
Message |
00:04
π
|
|
dashcloud has joined #archiveteam |
00:19
π
|
|
wvdp___ has joined #archiveteam |
00:22
π
|
|
Stilett0 has joined #archiveteam |
00:23
π
|
|
Stiletto has quit IRC (Ping timeout: 306 seconds) |
00:24
π
|
|
JesseW has joined #archiveteam |
00:27
π
|
|
godane has quit IRC (Ping timeout: 258 seconds) |
00:29
π
|
|
godane has joined #archiveteam |
00:38
π
|
|
godane has quit IRC (Quit: Leaving.) |
00:38
π
|
|
godane has joined #archiveteam |
00:43
π
|
|
philpem has quit IRC (Ping timeout: 252 seconds) |
00:45
π
|
|
Stilett0 is now known as Stiletto |
01:05
π
|
|
JesseW has quit IRC (Ping timeout: 600 seconds) |
01:13
π
|
|
Start has joined #archiveteam |
01:21
π
|
|
JesseW has joined #archiveteam |
01:25
π
|
|
primus104 has quit IRC (Leaving.) |
01:40
π
|
JesseW |
Whatever happened to http://archiveteam.org/index.php?title=FlickrFckr ? |
01:49
π
|
|
Start has quit IRC (Quit: Disconnected.) |
01:55
π
|
chfoo |
arkiver: i can only give logs for 2015-08-02 and 2015-08-07. the rest of the logs are on the old tracker |
02:02
π
|
chfoo |
i checked redis and i don't know why it has settings stored as an item |
02:27
π
|
S[h]O[r]T |
yipdw how do i compile wget-lua from our repo |
02:28
π
|
yipdw |
S[h]O[r]T: autoconf; ./configure --prefix=PREFIX; make install |
02:28
π
|
yipdw |
alternatively one of us can update the tarball but that will take some time |
02:28
π
|
yipdw |
you can also try wpull if you're starting a new project |
02:31
π
|
S[h]O[r]T |
im trying to get downloaders up for blip grab and that 5.18 error is stopping me :( |
02:31
π
|
S[h]O[r]T |
getting problems configuring...trying to fix that |
02:32
π
|
aaaaaaaaa |
try adding the following to the get-wget-lua script |
02:32
π
|
aaaaaaaaa |
sed -e "s/\(item \)\([0-9]\)/\1\.\2/" ./doc/wget.texi > ./doc/wget.texi.tmp && mv ./doc/wget.texi.tmp ./doc/wget.texi |
02:32
π
|
aaaaaaaaa |
at around line 38 |
02:35
π
|
S[h]O[r]T |
wget-lua successfully built. |
02:36
π
|
S[h]O[r]T |
aaaaaaaaa is a useful name ive now learned |
02:45
π
|
|
robink has quit IRC (Ping timeout: 492 seconds) |
02:55
π
|
|
Start has joined #archiveteam |
03:01
π
|
|
JesseW has quit IRC (Read error: Operation timed out) |
03:10
π
|
|
robink has joined #archiveteam |
04:10
π
|
|
aaaaaaaaa has quit IRC (Leaving) |
04:31
π
|
|
JesseW has joined #archiveteam |
04:42
π
|
|
xk_id has joined #archiveteam |
04:52
π
|
|
xk_id has quit IRC (Remote host closed the connection) |
05:07
π
|
|
brayden_ has joined #archiveteam |
05:07
π
|
|
brayden has quit IRC (Read error: Connection reset by peer) |
05:12
π
|
|
godane has quit IRC (Leaving.) |
05:13
π
|
|
godane has joined #archiveteam |
06:21
π
|
|
xk_id has joined #archiveteam |
06:39
π
|
|
JesseW has quit IRC (Read error: Operation timed out) |
06:55
π
|
|
bassiexp_ has joined #archiveteam |
07:11
π
|
|
bassiexp_ has quit IRC (Quit: Page closed) |
07:36
π
|
|
bentpins has joined #archiveteam |
07:37
π
|
bentpins |
Any thought on soundcloud? http://thump.vice.com/en_au/article/the-great-soundcloud-purge-of-2015-has-begun |
08:12
π
|
godane |
i'm grabbing the first 100k urls rss feeds |
08:14
π
|
godane |
after that i can then give you guys a mp3 list |
08:15
π
|
godane |
after 200 users rss urls i got 94 mp3 urls |
08:16
π
|
arkiver |
godane: from what? |
08:21
π
|
godane |
each url has a rss feed: http://feeds.soundcloud.com/users/soundcloud:users:648/sounds.rss |
08:21
π
|
godane |
and a number |
08:21
π
|
godane |
so it easly brute forceible |
08:23
π
|
|
schbirid has joined #archiveteam |
08:25
π
|
godane |
there are m4a files also: http://feeds.soundcloud.com/users/soundcloud:users:2/sounds.rss |
08:26
π
|
godane |
code for getting mp3 urls in web archive: zcat *.warc.gz | grep url= | sed 's|.* url="||g' | sed 's|" .*||g' |
08:57
π
|
bentpins |
Good stuff |
09:01
π
|
arkiver |
I see |
09:11
π
|
|
primus104 has joined #archiveteam |
09:22
π
|
arkiver |
SketchCow: how the situation is right now we are likely ot able to get blip saved 100% before the deadline |
09:22
π
|
arkiver |
SketchCow: I see you've been in contact with someone from blip, can you please ask him if blip's shutdown can be delayed by two weeks? |
09:33
π
|
|
primus104 has quit IRC (Leaving.) |
10:55
π
|
|
nmnn has joined #archiveteam |
11:04
π
|
|
xk_id has quit IRC (Remote host closed the connection) |
11:52
π
|
|
Ungstein has joined #archiveteam |
11:53
π
|
|
Ungstein has quit IRC (Client Quit) |
11:54
π
|
|
primus104 has joined #archiveteam |
12:02
π
|
|
Ungstein has joined #archiveteam |
12:35
π
|
schbirid |
does anyone know a decent twitter scraper for selected accounts that will grab their timeline, tweets and images (:orig!) without requiring you to submit your blood type and food preferences for OAuth twitter access? |
12:35
π
|
schbirid |
for running daily or something |
12:40
π
|
bentpins |
WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD |
12:41
π
|
xmc |
what is your quest |
12:41
π
|
bentpins |
prices are a bit old http://www.archiveteam.org/index.php?title=Storage_Media |
12:42
π
|
xmc |
yahoosucks |
12:42
π
|
bentpins |
cheers |
12:42
π
|
xmc |
<3 |
12:42
π
|
ersi |
bentpins: Thanks for updating them :) |
12:42
π
|
ersi |
and welcome~ |
12:44
π
|
xmc |
^ |
12:45
π
|
|
sivoais has quit IRC (Read error: Operation timed out) |
12:45
π
|
|
espes__ has quit IRC (Read error: Operation timed out) |
12:46
π
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
12:48
π
|
|
PurpleSym has joined #archiveteam |
12:54
π
|
|
dashcloud has joined #archiveteam |
12:57
π
|
|
sivoais has joined #archiveteam |
13:05
π
|
SketchCow |
OK, I have two coding requests, if possible. |
13:05
π
|
SketchCow |
The first, is a piece of code that, given a Wiki, pulls every external reference of that Wiki out and submits it to archivebot or Internet Archive. |
13:07
π
|
SketchCow |
The second is can wait |
13:07
π
|
ersi |
What kind of wiki? (MediaWiki?) |
13:07
π
|
|
SketchCow sets mode: +oooo beardicus BlueMaxim Cameron_D chfoo |
13:07
π
|
|
SketchCow sets mode: +oooo dashcloud db48x dcmorton DFJustin |
13:07
π
|
|
SketchCow sets mode: +oo ersi Famicoman |
13:08
π
|
SketchCow |
Yes, Mediawiki. |
13:09
π
|
ersi |
Hm~ |
13:09
π
|
ersi |
I guess we can use the wikiteam dump scrips and then suck out the URLs from the dump |
13:18
π
|
|
espes__ has joined #archiveteam |
13:22
π
|
PurpleSym |
If the wiki is still online thereβs https://www.mediawiki.org/wiki/Help:Linksearch |
13:22
π
|
arkiver |
SketchCow: I'll write a bit of code for that |
13:26
π
|
|
wvdp_ has joined #archiveteam |
13:27
π
|
SketchCow |
I didn't know about :Linksearch |
13:27
π
|
SketchCow |
That's very nice. It might be useful for the script. |
13:32
π
|
|
wvdp___ has quit IRC (Read error: Operation timed out) |
13:39
π
|
|
nmnn has quit IRC (Ping timeout: 483 seconds) |
13:40
π
|
|
Stiletto has quit IRC (Read error: Operation timed out) |
13:41
π
|
|
Stiletto has joined #archiveteam |
13:50
π
|
|
expr_ has joined #archiveteam |
13:52
π
|
|
BlueMaxim has quit IRC (Read error: Connection reset by peer) |
13:52
π
|
bentpins |
https://news.ycombinator.com/item?id=10064565 |
13:57
π
|
SketchCow |
Fuck THAT guy and his manuals |
14:08
π
|
bentpins |
The guy who runs the store? |
14:10
π
|
|
rogal has joined #archiveteam |
14:12
π
|
rogal |
hi! After some time I'm reloading my project of archiving ownlog.com blog service. I'm an author of ownlog-grab scripts on archiveteam's github. Tracker for this project is also ready |
14:13
π
|
rogal |
What's the next step? I suppose I should have some permissions to upload items to the tracker - and I need rsync account created for this project |
14:14
π
|
|
rogal has quit IRC (Read error: Connection reset by peer) |
14:14
π
|
|
rogal has joined #archiveteam |
14:27
π
|
|
rogal has quit IRC (Read error: Connection reset by peer) |
14:29
π
|
|
rogal has joined #archiveteam |
14:39
π
|
|
chfoo has quit IRC (Ping timeout: 258 seconds) |
14:43
π
|
|
Stiletto has quit IRC () |
14:54
π
|
|
xk_id has joined #archiveteam |
15:14
π
|
|
SimpBrain has joined #archiveteam |
15:15
π
|
|
Stiletto has joined #archiveteam |
15:22
π
|
|
rogal has quit IRC (Read error: Connection reset by peer) |
15:22
π
|
|
rogal has joined #archiveteam |
15:26
π
|
|
chfoo has joined #archiveteam |
15:29
π
|
|
rogal has quit IRC (Read error: Connection reset by peer) |
15:30
π
|
|
rogal has joined #archiveteam |
15:42
π
|
|
Stiletto has quit IRC () |
15:45
π
|
|
Froggypwn has quit IRC (Ping timeout: 606 seconds) |
15:46
π
|
|
Froggypwn has joined #archiveteam |
15:53
π
|
|
Stiletto has joined #archiveteam |
15:56
π
|
|
rogal has quit IRC (Read error: Connection reset by peer) |
15:56
π
|
|
rogal has joined #archiveteam |
16:37
π
|
|
nmnn has joined #archiveteam |
16:37
π
|
|
xk_id has quit IRC (Read error: Connection reset by peer) |
16:40
π
|
|
chfoo0 has joined #archiveteam |
16:41
π
|
|
bamboo has joined #archiveteam |
16:41
π
|
bamboo |
hi |
16:41
π
|
bamboo |
anyone here working on blingee |
16:46
π
|
|
chfoo has quit IRC (Ping timeout: 483 seconds) |
16:46
π
|
bamboo |
i'd like to try scraping the stamps, which are stored as swfs |
16:47
π
|
bamboo |
bit of a process to get at them |
16:49
π
|
bamboo |
they're all stored as swfs |
16:52
π
|
|
xk_id has joined #archiveteam |
16:57
π
|
|
chfoo0 is now known as chfoo |
16:58
π
|
|
rogal has quit IRC (Read error: Connection reset by peer) |
16:58
π
|
|
rogal has joined #archiveteam |
17:16
π
|
garyrh |
I am! |
17:16
π
|
garyrh |
bamboo, Do you have an example stamp/swf url? |
17:16
π
|
|
chfoo0 has joined #archiveteam |
17:19
π
|
|
nmnn has quit IRC (Ping timeout: 483 seconds) |
17:22
π
|
|
nertzy has joined #archiveteam |
17:23
π
|
|
chfoo has quit IRC (Ping timeout: 483 seconds) |
17:27
π
|
|
JesseW has joined #archiveteam |
17:27
π
|
bamboo |
trying to get one, i don't htink i can generate them programatically |
17:28
π
|
bamboo |
i was going to scrape their search pages http://blingee.com/stamp/embedded_list?query=cat |
17:28
π
|
bamboo |
which pass an encrypted string back to the main blingee editor (flash app) which i decompiled and am lookin through |
17:28
π
|
bamboo |
they're AES encrypted |
17:28
π
|
bamboo |
the key appears to be "rAI1P8bpXoReutED8XOTT0lh26MWhWz87IH4t39LjJp3wxLkEHDKE2Er" |
17:32
π
|
garyrh |
From what I've seen, you can access stamps via http://blingee.com/stamp/view/$ID and then search the html for the bigbox div. |
17:32
π
|
garyrh |
For example, http://blingee.com/stamp/view/4906955 and http://image.blingee.com/images18/content/output/000/000/000/04a/670662943_920758.gif |
17:32
π
|
garyrh |
Not sure if that works for all of them though. |
17:35
π
|
|
rogal has quit IRC (Read error: Connection reset by peer) |
17:36
π
|
|
rogal has joined #archiveteam |
17:36
π
|
|
expr_ has quit IRC (My Mac has gone to sleep. ZZZzzzβ¦) |
17:40
π
|
|
rogal has quit IRC (Read error: Connection reset by peer) |
17:40
π
|
|
rogal has joined #archiveteam |
17:43
π
|
|
aaaaaaaaa has joined #archiveteam |
17:48
π
|
bamboo |
the gifs are useless though, they all have that checkerboard pattern |
17:49
π
|
bamboo |
i decrypted this thing finally lol |
17:49
π
|
bamboo |
http://image.blingee.com/images19/content/output/000/000/000/083/856589260_1244670.swf |
17:49
π
|
bamboo |
this is what the app is actually using |
17:49
π
|
bamboo |
they have transparency |
17:49
π
|
bamboo |
you can't generate the swf url from the gif alas |
17:50
π
|
bamboo |
the swf stickers actually have full alpha transparency |
17:51
π
|
garyrh |
ah |
17:52
π
|
bamboo |
i think it would be feasible to scrape search, decode these strings, and grab the swfs |
17:52
π
|
bamboo |
i'll see if there's something else we can scrape |
17:53
π
|
|
JesseW has quit IRC (Leaving.) |
17:53
π
|
|
rogal has quit IRC (Read error: Connection reset by peer) |
17:53
π
|
bamboo |
it seems like the archive bot has a lot of blingee captured already |
17:54
π
|
|
rogal has joined #archiveteam |
17:54
π
|
bamboo |
but these stamps are valuable, other gif-stamp sites exist but don't have the range |
17:54
π
|
bamboo |
alarming: the top stamp names are in korean |
17:54
π
|
|
rogal has quit IRC (Read error: Connection reset by peer) |
17:55
π
|
|
rogal has joined #archiveteam |
17:55
π
|
|
nmnn has joined #archiveteam |
17:56
π
|
|
rogal has quit IRC (Read error: Connection reset by peer) |
17:56
π
|
|
Start has quit IRC (Quit: Disconnected.) |
17:56
π
|
|
rogal has joined #archiveteam |
17:57
π
|
garyrh |
bamboo, do you know if there are swf urls for the actual blingees? or is it just gifs? |
17:57
π
|
bamboo |
i just pasted a swf url |
17:58
π
|
garyrh |
I mean non-stamps, like http://blingee.com/blingee/view/1 |
17:58
π
|
bamboo |
ah no, i think the final output is a gif |
17:59
π
|
bamboo |
lol 3000 cat swfs |
18:04
π
|
|
nmnn has quit IRC (Ping timeout: 483 seconds) |
18:06
π
|
bamboo |
wonder how you could get a list of stamp tags |
18:06
π
|
bamboo |
ah the stamp pages themselves have them |
18:12
π
|
|
rogal has quit IRC (Read error: Connection reset by peer) |
18:14
π
|
|
rogal has joined #archiveteam |
18:19
π
|
|
rogal has quit IRC (Read error: Connection reset by peer) |
18:20
π
|
lexicon |
meta-meta2-B /buffer +1 |
18:20
π
|
lexicon |
sorry |
18:26
π
|
bamboo |
the funny thing is a lot of these stamps have that gradient watermark pattern on them, idgi |
18:30
π
|
|
brayden_ has quit IRC (Read error: Connection reset by peer) |
18:31
π
|
|
primus104 has quit IRC (Leaving.) |
18:33
π
|
|
chfoo0 has quit IRC (Ping timeout: 483 seconds) |
18:43
π
|
|
nmnn has joined #archiveteam |
18:46
π
|
|
JesseW has joined #archiveteam |
19:08
π
|
|
godane has quit IRC (Quit: Leaving.) |
19:15
π
|
|
primus104 has joined #archiveteam |
19:27
π
|
|
aliz has quit IRC (Ping timeout: 252 seconds) |
19:27
π
|
|
nmnn has quit IRC (Ping timeout: 483 seconds) |
19:34
π
|
|
habi has joined #archiveteam |
19:40
π
|
|
habi has left |
19:43
π
|
|
yan has joined #archiveteam |
19:46
π
|
|
yan has quit IRC (Client Quit) |
20:07
π
|
|
nertzy has quit IRC (Quit: This computer has gone to sleep) |
20:15
π
|
|
nertzy has joined #archiveteam |
20:30
π
|
|
philpem has joined #archiveteam |
20:32
π
|
|
dashcloud has quit IRC (Remote host closed the connection) |
20:34
π
|
|
dashcloud has joined #archiveteam |
20:39
π
|
|
bentpins has quit IRC (Quit: Leaving) |
20:50
π
|
|
JesseW has quit IRC (Leaving.) |
21:10
π
|
|
PurpleSym has quit IRC (Remote host closed the connection) |
21:17
π
|
|
JesseW has joined #archiveteam |
21:19
π
|
|
SimpBrain has quit IRC (Leaving) |
21:19
π
|
bamboo |
welp i'm fetching 188 pages of "cat" swfs |
21:20
π
|
bamboo |
my friend wrote something to dump pngs out of the swfs |
21:21
π
|
garyrh |
Great! |
21:21
π
|
bamboo |
would it be funny to merge the pngs into an apng |
21:27
π
|
|
aliz has joined #archiveteam |
21:29
π
|
aaaaaaaaa |
That would be one massive collage |
21:29
π
|
aaaaaaaaa |
oh, oops read that wrong |
21:32
π
|
|
aliz has quit IRC (Remote host closed the connection) |
21:37
π
|
|
JesseW has quit IRC (Ping timeout: 600 seconds) |
21:43
π
|
|
godane has joined #archiveteam |
22:05
π
|
|
chfoo has joined #archiveteam |
22:17
π
|
|
nertzy has quit IRC (Quit: This computer has gone to sleep) |
22:21
π
|
|
nertzy has joined #archiveteam |
22:42
π
|
arkiver |
bamboo: scripts for blingee for a warrior project are (hopefully) ready tomorrow |
22:44
π
|
arkiver |
if you have anything you think we should know about, please write something about it here http://archiveteam.org/index.php?title=Blingee |
22:47
π
|
bamboo |
oh nice |
22:47
π
|
bamboo |
WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD |
22:47
π
|
garyrh |
yahoosucks |
22:47
π
|
arkiver |
yahoosucks |
22:47
π
|
arkiver |
heh |
22:47
π
|
bamboo |
lol |
22:48
π
|
bamboo |
i should probably incorporate my thing into yours, i made something to focus on the swfs |
22:49
π
|
garyrh |
I'm almost done with the scripts: https://github.com/garyrh/blingee-grab |
22:49
π
|
bamboo |
would need to move it over to lua, presumably |
22:49
π
|
garyrh |
You could do it in Lua, or in Python and just call it from the Lua script. |
22:49
π
|
bamboo |
cool i'll have a look later |
22:51
π
|
|
dcmorton has quit IRC (Quit: ZNC - http://znc.in) |
22:54
π
|
|
dcmorton has joined #archiveteam |
23:00
π
|
|
wvdp___ has joined #archiveteam |
23:01
π
|
bamboo |
my swf scraper is here https://github.com/julescarbon/blingee-stamp |
23:01
π
|
bamboo |
written in javascript because it was ready-to-hand, hope that's cool |
23:06
π
|
|
wvdp_ has quit IRC (Read error: Operation timed out) |
23:26
π
|
|
aaaaaaaa_ has joined #archiveteam |
23:26
π
|
|
aaaaaaaaa has quit IRC (Read error: Connection reset by peer) |
23:27
π
|
|
aaaaaaaa_ is now known as aaaaaaaaa |
23:36
π
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
23:39
π
|
|
dashcloud has joined #archiveteam |
23:40
π
|
|
Start has joined #archiveteam |
23:46
π
|
|
dcmorton_ has joined #archiveteam |
23:50
π
|
|
BlueMaxim has joined #archiveteam |