#archiveteam 2015-08-15,Sat

↑back Search

Time Nickname Message
00:04 πŸ”— dashcloud has joined #archiveteam
00:19 πŸ”— wvdp___ has joined #archiveteam
00:22 πŸ”— Stilett0 has joined #archiveteam
00:23 πŸ”— Stiletto has quit IRC (Ping timeout: 306 seconds)
00:24 πŸ”— JesseW has joined #archiveteam
00:27 πŸ”— godane has quit IRC (Ping timeout: 258 seconds)
00:29 πŸ”— godane has joined #archiveteam
00:38 πŸ”— godane has quit IRC (Quit: Leaving.)
00:38 πŸ”— godane has joined #archiveteam
00:43 πŸ”— philpem has quit IRC (Ping timeout: 252 seconds)
00:45 πŸ”— Stilett0 is now known as Stiletto
01:05 πŸ”— JesseW has quit IRC (Ping timeout: 600 seconds)
01:13 πŸ”— Start has joined #archiveteam
01:21 πŸ”— JesseW has joined #archiveteam
01:25 πŸ”— primus104 has quit IRC (Leaving.)
01:40 πŸ”— JesseW Whatever happened to http://archiveteam.org/index.php?title=FlickrFckr ?
01:49 πŸ”— Start has quit IRC (Quit: Disconnected.)
01:55 πŸ”— chfoo arkiver: i can only give logs for 2015-08-02 and 2015-08-07. the rest of the logs are on the old tracker
02:02 πŸ”— chfoo i checked redis and i don't know why it has settings stored as an item
02:27 πŸ”— S[h]O[r]T yipdw how do i compile wget-lua from our repo
02:28 πŸ”— yipdw S[h]O[r]T: autoconf; ./configure --prefix=PREFIX; make install
02:28 πŸ”— yipdw alternatively one of us can update the tarball but that will take some time
02:28 πŸ”— yipdw you can also try wpull if you're starting a new project
02:31 πŸ”— S[h]O[r]T im trying to get downloaders up for blip grab and that 5.18 error is stopping me :(
02:31 πŸ”— S[h]O[r]T getting problems configuring...trying to fix that
02:32 πŸ”— aaaaaaaaa try adding the following to the get-wget-lua script
02:32 πŸ”— aaaaaaaaa sed -e "s/\(item \)\([0-9]\)/\1\.\2/" ./doc/wget.texi > ./doc/wget.texi.tmp && mv ./doc/wget.texi.tmp ./doc/wget.texi
02:32 πŸ”— aaaaaaaaa at around line 38
02:35 πŸ”— S[h]O[r]T wget-lua successfully built.
02:36 πŸ”— S[h]O[r]T aaaaaaaaa is a useful name ive now learned
02:45 πŸ”— robink has quit IRC (Ping timeout: 492 seconds)
02:55 πŸ”— Start has joined #archiveteam
03:01 πŸ”— JesseW has quit IRC (Read error: Operation timed out)
03:10 πŸ”— robink has joined #archiveteam
04:10 πŸ”— aaaaaaaaa has quit IRC (Leaving)
04:31 πŸ”— JesseW has joined #archiveteam
04:42 πŸ”— xk_id has joined #archiveteam
04:52 πŸ”— xk_id has quit IRC (Remote host closed the connection)
05:07 πŸ”— brayden_ has joined #archiveteam
05:07 πŸ”— brayden has quit IRC (Read error: Connection reset by peer)
05:12 πŸ”— godane has quit IRC (Leaving.)
05:13 πŸ”— godane has joined #archiveteam
06:21 πŸ”— xk_id has joined #archiveteam
06:39 πŸ”— JesseW has quit IRC (Read error: Operation timed out)
06:55 πŸ”— bassiexp_ has joined #archiveteam
07:11 πŸ”— bassiexp_ has quit IRC (Quit: Page closed)
07:36 πŸ”— bentpins has joined #archiveteam
07:37 πŸ”— bentpins Any thought on soundcloud? http://thump.vice.com/en_au/article/the-great-soundcloud-purge-of-2015-has-begun
08:12 πŸ”— godane i'm grabbing the first 100k urls rss feeds
08:14 πŸ”— godane after that i can then give you guys a mp3 list
08:15 πŸ”— godane after 200 users rss urls i got 94 mp3 urls
08:16 πŸ”— arkiver godane: from what?
08:21 πŸ”— godane each url has a rss feed: http://feeds.soundcloud.com/users/soundcloud:users:648/sounds.rss
08:21 πŸ”— godane and a number
08:21 πŸ”— godane so it easly brute forceible
08:23 πŸ”— schbirid has joined #archiveteam
08:25 πŸ”— godane there are m4a files also: http://feeds.soundcloud.com/users/soundcloud:users:2/sounds.rss
08:26 πŸ”— godane code for getting mp3 urls in web archive: zcat *.warc.gz | grep url= | sed 's|.* url="||g' | sed 's|" .*||g'
08:57 πŸ”— bentpins Good stuff
09:01 πŸ”— arkiver I see
09:11 πŸ”— primus104 has joined #archiveteam
09:22 πŸ”— arkiver SketchCow: how the situation is right now we are likely ot able to get blip saved 100% before the deadline
09:22 πŸ”— arkiver SketchCow: I see you've been in contact with someone from blip, can you please ask him if blip's shutdown can be delayed by two weeks?
09:33 πŸ”— primus104 has quit IRC (Leaving.)
10:55 πŸ”— nmnn has joined #archiveteam
11:04 πŸ”— xk_id has quit IRC (Remote host closed the connection)
11:52 πŸ”— Ungstein has joined #archiveteam
11:53 πŸ”— Ungstein has quit IRC (Client Quit)
11:54 πŸ”— primus104 has joined #archiveteam
12:02 πŸ”— Ungstein has joined #archiveteam
12:35 πŸ”— schbirid does anyone know a decent twitter scraper for selected accounts that will grab their timeline, tweets and images (:orig!) without requiring you to submit your blood type and food preferences for OAuth twitter access?
12:35 πŸ”— schbirid for running daily or something
12:40 πŸ”— bentpins WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD
12:41 πŸ”— xmc what is your quest
12:41 πŸ”— bentpins prices are a bit old http://www.archiveteam.org/index.php?title=Storage_Media
12:42 πŸ”— xmc yahoosucks
12:42 πŸ”— bentpins cheers
12:42 πŸ”— xmc <3
12:42 πŸ”— ersi bentpins: Thanks for updating them :)
12:42 πŸ”— ersi and welcome~
12:44 πŸ”— xmc ^
12:45 πŸ”— sivoais has quit IRC (Read error: Operation timed out)
12:45 πŸ”— espes__ has quit IRC (Read error: Operation timed out)
12:46 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
12:48 πŸ”— PurpleSym has joined #archiveteam
12:54 πŸ”— dashcloud has joined #archiveteam
12:57 πŸ”— sivoais has joined #archiveteam
13:05 πŸ”— SketchCow OK, I have two coding requests, if possible.
13:05 πŸ”— SketchCow The first, is a piece of code that, given a Wiki, pulls every external reference of that Wiki out and submits it to archivebot or Internet Archive.
13:07 πŸ”— SketchCow The second is can wait
13:07 πŸ”— ersi What kind of wiki? (MediaWiki?)
13:07 πŸ”— SketchCow sets mode: +oooo beardicus BlueMaxim Cameron_D chfoo
13:07 πŸ”— SketchCow sets mode: +oooo dashcloud db48x dcmorton DFJustin
13:07 πŸ”— SketchCow sets mode: +oo ersi Famicoman
13:08 πŸ”— SketchCow Yes, Mediawiki.
13:09 πŸ”— ersi Hm~
13:09 πŸ”— ersi I guess we can use the wikiteam dump scrips and then suck out the URLs from the dump
13:18 πŸ”— espes__ has joined #archiveteam
13:22 πŸ”— PurpleSym If the wiki is still online there’s https://www.mediawiki.org/wiki/Help:Linksearch
13:22 πŸ”— arkiver SketchCow: I'll write a bit of code for that
13:26 πŸ”— wvdp_ has joined #archiveteam
13:27 πŸ”— SketchCow I didn't know about :Linksearch
13:27 πŸ”— SketchCow That's very nice. It might be useful for the script.
13:32 πŸ”— wvdp___ has quit IRC (Read error: Operation timed out)
13:39 πŸ”— nmnn has quit IRC (Ping timeout: 483 seconds)
13:40 πŸ”— Stiletto has quit IRC (Read error: Operation timed out)
13:41 πŸ”— Stiletto has joined #archiveteam
13:50 πŸ”— expr_ has joined #archiveteam
13:52 πŸ”— BlueMaxim has quit IRC (Read error: Connection reset by peer)
13:52 πŸ”— bentpins https://news.ycombinator.com/item?id=10064565
13:57 πŸ”— SketchCow Fuck THAT guy and his manuals
14:08 πŸ”— bentpins The guy who runs the store?
14:10 πŸ”— rogal has joined #archiveteam
14:12 πŸ”— rogal hi! After some time I'm reloading my project of archiving ownlog.com blog service. I'm an author of ownlog-grab scripts on archiveteam's github. Tracker for this project is also ready
14:13 πŸ”— rogal What's the next step? I suppose I should have some permissions to upload items to the tracker - and I need rsync account created for this project
14:14 πŸ”— rogal has quit IRC (Read error: Connection reset by peer)
14:14 πŸ”— rogal has joined #archiveteam
14:27 πŸ”— rogal has quit IRC (Read error: Connection reset by peer)
14:29 πŸ”— rogal has joined #archiveteam
14:39 πŸ”— chfoo has quit IRC (Ping timeout: 258 seconds)
14:43 πŸ”— Stiletto has quit IRC ()
14:54 πŸ”— xk_id has joined #archiveteam
15:14 πŸ”— SimpBrain has joined #archiveteam
15:15 πŸ”— Stiletto has joined #archiveteam
15:22 πŸ”— rogal has quit IRC (Read error: Connection reset by peer)
15:22 πŸ”— rogal has joined #archiveteam
15:26 πŸ”— chfoo has joined #archiveteam
15:29 πŸ”— rogal has quit IRC (Read error: Connection reset by peer)
15:30 πŸ”— rogal has joined #archiveteam
15:42 πŸ”— Stiletto has quit IRC ()
15:45 πŸ”— Froggypwn has quit IRC (Ping timeout: 606 seconds)
15:46 πŸ”— Froggypwn has joined #archiveteam
15:53 πŸ”— Stiletto has joined #archiveteam
15:56 πŸ”— rogal has quit IRC (Read error: Connection reset by peer)
15:56 πŸ”— rogal has joined #archiveteam
16:37 πŸ”— nmnn has joined #archiveteam
16:37 πŸ”— xk_id has quit IRC (Read error: Connection reset by peer)
16:40 πŸ”— chfoo0 has joined #archiveteam
16:41 πŸ”— bamboo has joined #archiveteam
16:41 πŸ”— bamboo hi
16:41 πŸ”— bamboo anyone here working on blingee
16:46 πŸ”— chfoo has quit IRC (Ping timeout: 483 seconds)
16:46 πŸ”— bamboo i'd like to try scraping the stamps, which are stored as swfs
16:47 πŸ”— bamboo bit of a process to get at them
16:49 πŸ”— bamboo they're all stored as swfs
16:52 πŸ”— xk_id has joined #archiveteam
16:57 πŸ”— chfoo0 is now known as chfoo
16:58 πŸ”— rogal has quit IRC (Read error: Connection reset by peer)
16:58 πŸ”— rogal has joined #archiveteam
17:16 πŸ”— garyrh I am!
17:16 πŸ”— garyrh bamboo, Do you have an example stamp/swf url?
17:16 πŸ”— chfoo0 has joined #archiveteam
17:19 πŸ”— nmnn has quit IRC (Ping timeout: 483 seconds)
17:22 πŸ”— nertzy has joined #archiveteam
17:23 πŸ”— chfoo has quit IRC (Ping timeout: 483 seconds)
17:27 πŸ”— JesseW has joined #archiveteam
17:27 πŸ”— bamboo trying to get one, i don't htink i can generate them programatically
17:28 πŸ”— bamboo i was going to scrape their search pages http://blingee.com/stamp/embedded_list?query=cat
17:28 πŸ”— bamboo which pass an encrypted string back to the main blingee editor (flash app) which i decompiled and am lookin through
17:28 πŸ”— bamboo they're AES encrypted
17:28 πŸ”— bamboo the key appears to be "rAI1P8bpXoReutED8XOTT0lh26MWhWz87IH4t39LjJp3wxLkEHDKE2Er"
17:32 πŸ”— garyrh From what I've seen, you can access stamps via http://blingee.com/stamp/view/$ID and then search the html for the bigbox div.
17:32 πŸ”— garyrh For example, http://blingee.com/stamp/view/4906955 and http://image.blingee.com/images18/content/output/000/000/000/04a/670662943_920758.gif
17:32 πŸ”— garyrh Not sure if that works for all of them though.
17:35 πŸ”— rogal has quit IRC (Read error: Connection reset by peer)
17:36 πŸ”— rogal has joined #archiveteam
17:36 πŸ”— expr_ has quit IRC (My Mac has gone to sleep. ZZZzzz…)
17:40 πŸ”— rogal has quit IRC (Read error: Connection reset by peer)
17:40 πŸ”— rogal has joined #archiveteam
17:43 πŸ”— aaaaaaaaa has joined #archiveteam
17:48 πŸ”— bamboo the gifs are useless though, they all have that checkerboard pattern
17:49 πŸ”— bamboo i decrypted this thing finally lol
17:49 πŸ”— bamboo http://image.blingee.com/images19/content/output/000/000/000/083/856589260_1244670.swf
17:49 πŸ”— bamboo this is what the app is actually using
17:49 πŸ”— bamboo they have transparency
17:49 πŸ”— bamboo you can't generate the swf url from the gif alas
17:50 πŸ”— bamboo the swf stickers actually have full alpha transparency
17:51 πŸ”— garyrh ah
17:52 πŸ”— bamboo i think it would be feasible to scrape search, decode these strings, and grab the swfs
17:52 πŸ”— bamboo i'll see if there's something else we can scrape
17:53 πŸ”— JesseW has quit IRC (Leaving.)
17:53 πŸ”— rogal has quit IRC (Read error: Connection reset by peer)
17:53 πŸ”— bamboo it seems like the archive bot has a lot of blingee captured already
17:54 πŸ”— rogal has joined #archiveteam
17:54 πŸ”— bamboo but these stamps are valuable, other gif-stamp sites exist but don't have the range
17:54 πŸ”— bamboo alarming: the top stamp names are in korean
17:54 πŸ”— rogal has quit IRC (Read error: Connection reset by peer)
17:55 πŸ”— rogal has joined #archiveteam
17:55 πŸ”— nmnn has joined #archiveteam
17:56 πŸ”— rogal has quit IRC (Read error: Connection reset by peer)
17:56 πŸ”— Start has quit IRC (Quit: Disconnected.)
17:56 πŸ”— rogal has joined #archiveteam
17:57 πŸ”— garyrh bamboo, do you know if there are swf urls for the actual blingees? or is it just gifs?
17:57 πŸ”— bamboo i just pasted a swf url
17:58 πŸ”— garyrh I mean non-stamps, like http://blingee.com/blingee/view/1
17:58 πŸ”— bamboo ah no, i think the final output is a gif
17:59 πŸ”— bamboo lol 3000 cat swfs
18:04 πŸ”— nmnn has quit IRC (Ping timeout: 483 seconds)
18:06 πŸ”— bamboo wonder how you could get a list of stamp tags
18:06 πŸ”— bamboo ah the stamp pages themselves have them
18:12 πŸ”— rogal has quit IRC (Read error: Connection reset by peer)
18:14 πŸ”— rogal has joined #archiveteam
18:19 πŸ”— rogal has quit IRC (Read error: Connection reset by peer)
18:20 πŸ”— lexicon meta-meta2-B /buffer +1
18:20 πŸ”— lexicon sorry
18:26 πŸ”— bamboo the funny thing is a lot of these stamps have that gradient watermark pattern on them, idgi
18:30 πŸ”— brayden_ has quit IRC (Read error: Connection reset by peer)
18:31 πŸ”— primus104 has quit IRC (Leaving.)
18:33 πŸ”— chfoo0 has quit IRC (Ping timeout: 483 seconds)
18:43 πŸ”— nmnn has joined #archiveteam
18:46 πŸ”— JesseW has joined #archiveteam
19:08 πŸ”— godane has quit IRC (Quit: Leaving.)
19:15 πŸ”— primus104 has joined #archiveteam
19:27 πŸ”— aliz has quit IRC (Ping timeout: 252 seconds)
19:27 πŸ”— nmnn has quit IRC (Ping timeout: 483 seconds)
19:34 πŸ”— habi has joined #archiveteam
19:40 πŸ”— habi has left
19:43 πŸ”— yan has joined #archiveteam
19:46 πŸ”— yan has quit IRC (Client Quit)
20:07 πŸ”— nertzy has quit IRC (Quit: This computer has gone to sleep)
20:15 πŸ”— nertzy has joined #archiveteam
20:30 πŸ”— philpem has joined #archiveteam
20:32 πŸ”— dashcloud has quit IRC (Remote host closed the connection)
20:34 πŸ”— dashcloud has joined #archiveteam
20:39 πŸ”— bentpins has quit IRC (Quit: Leaving)
20:50 πŸ”— JesseW has quit IRC (Leaving.)
21:10 πŸ”— PurpleSym has quit IRC (Remote host closed the connection)
21:17 πŸ”— JesseW has joined #archiveteam
21:19 πŸ”— SimpBrain has quit IRC (Leaving)
21:19 πŸ”— bamboo welp i'm fetching 188 pages of "cat" swfs
21:20 πŸ”— bamboo my friend wrote something to dump pngs out of the swfs
21:21 πŸ”— garyrh Great!
21:21 πŸ”— bamboo would it be funny to merge the pngs into an apng
21:27 πŸ”— aliz has joined #archiveteam
21:29 πŸ”— aaaaaaaaa That would be one massive collage
21:29 πŸ”— aaaaaaaaa oh, oops read that wrong
21:32 πŸ”— aliz has quit IRC (Remote host closed the connection)
21:37 πŸ”— JesseW has quit IRC (Ping timeout: 600 seconds)
21:43 πŸ”— godane has joined #archiveteam
22:05 πŸ”— chfoo has joined #archiveteam
22:17 πŸ”— nertzy has quit IRC (Quit: This computer has gone to sleep)
22:21 πŸ”— nertzy has joined #archiveteam
22:42 πŸ”— arkiver bamboo: scripts for blingee for a warrior project are (hopefully) ready tomorrow
22:44 πŸ”— arkiver if you have anything you think we should know about, please write something about it here http://archiveteam.org/index.php?title=Blingee
22:47 πŸ”— bamboo oh nice
22:47 πŸ”— bamboo WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD
22:47 πŸ”— garyrh yahoosucks
22:47 πŸ”— arkiver yahoosucks
22:47 πŸ”— arkiver heh
22:47 πŸ”— bamboo lol
22:48 πŸ”— bamboo i should probably incorporate my thing into yours, i made something to focus on the swfs
22:49 πŸ”— garyrh I'm almost done with the scripts: https://github.com/garyrh/blingee-grab
22:49 πŸ”— bamboo would need to move it over to lua, presumably
22:49 πŸ”— garyrh You could do it in Lua, or in Python and just call it from the Lua script.
22:49 πŸ”— bamboo cool i'll have a look later
22:51 πŸ”— dcmorton has quit IRC (Quit: ZNC - http://znc.in)
22:54 πŸ”— dcmorton has joined #archiveteam
23:00 πŸ”— wvdp___ has joined #archiveteam
23:01 πŸ”— bamboo my swf scraper is here https://github.com/julescarbon/blingee-stamp
23:01 πŸ”— bamboo written in javascript because it was ready-to-hand, hope that's cool
23:06 πŸ”— wvdp_ has quit IRC (Read error: Operation timed out)
23:26 πŸ”— aaaaaaaa_ has joined #archiveteam
23:26 πŸ”— aaaaaaaaa has quit IRC (Read error: Connection reset by peer)
23:27 πŸ”— aaaaaaaa_ is now known as aaaaaaaaa
23:36 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
23:39 πŸ”— dashcloud has joined #archiveteam
23:40 πŸ”— Start has joined #archiveteam
23:46 πŸ”— dcmorton_ has joined #archiveteam
23:50 πŸ”— BlueMaxim has joined #archiveteam

irclogger-viewer