Time |
Nickname |
Message |
00:10
🔗
|
godane |
so looks like i added 8 videos just to ephemera collection |
00:32
🔗
|
SketchCow |
They're cuties |
00:32
🔗
|
SketchCow |
I'm adding 100+ video game commercials. |
00:38
🔗
|
godane |
cool |
02:03
🔗
|
dashcloud |
that reminds me- I should check the ones I have against those- what collection are they in SketchCow ? |
02:06
🔗
|
SketchCow |
http://archive.org/details/videogamecommercials |
02:06
🔗
|
SketchCow |
It's past 330 commercials. |
02:07
🔗
|
SketchCow |
http://archive.org/movies/thumbnails.php?identifier=Legend_of_Zelda_The_The_Wind_Waker_2003_Nintendo_Jp |
03:19
🔗
|
S[h]O[r]T |
http://forums.somethingawful.com/showthread.php?threadid=3551611 |
03:37
🔗
|
omf_ |
don't have an SA account. |
03:41
🔗
|
S[h]O[r]T |
o well its a fun thread, Find old computers and power them on |
03:49
🔗
|
xmc |
I have an SA account |
03:49
🔗
|
xmc |
somewhere |
03:54
🔗
|
ivan` |
hah, they want $10 + $10 for archives + $10 for search |
04:39
🔗
|
DFJustin |
it's worth it |
04:39
🔗
|
DFJustin |
the coupons forum alone pays for itself |
04:41
🔗
|
DFJustin |
<joepie91> ... in hindsight, would it have been better to upload it as a .zip instead of the original .7z, so that you can browse it? |
04:41
🔗
|
DFJustin |
it would but there's no reason you can't edit the item now and have both |
06:14
🔗
|
yipdw |
jeez |
06:14
🔗
|
yipdw |
16018 ec2-user 20 0 180m 167m 6060 R 99.4 27.9 1:20.02 cc1 |
06:14
🔗
|
yipdw |
I like how building lxml's C extension requires 150+ megabytes of memory |
06:39
🔗
|
godane |
i'm starting to upload macbreak weekly |
06:39
🔗
|
godane |
or just macbreak its called |
06:42
🔗
|
ivan` |
yipdw: yum install python-lxml? |
07:03
🔗
|
yipdw |
ivan`: yeah, that'd probably make more sense |
07:48
🔗
|
godane |
uploaded: http://archive.org/details/macbreak20060125-001-mp4 |
07:48
🔗
|
godane |
i'm putting this as created by pixel corps cause most of the series was done by them |
07:55
🔗
|
godane |
twit only hosted the first 64 in audio format |
12:10
🔗
|
ersi |
Some of the videos/talks/recordings from Observe Hack Make are available: http://wipkip.nikhef.nl/events/OHM/video/ |
13:27
🔗
|
ersi |
joepie91: some videos/talks are released from OHM2013 btw |
14:41
🔗
|
omf_ |
ersi, you still doing uploads right? |
14:41
🔗
|
omf_ |
or you have more uploads to do |
14:44
🔗
|
ersi |
omf_: I'm uploading OHM2013 videos as we speak |
14:44
🔗
|
ersi |
They're being derived |
14:44
🔗
|
omf_ |
you using the web interface or the ias3upload script? |
14:44
🔗
|
ersi |
web interface |
14:44
🔗
|
ersi |
I'm also uploading DebConf12 videos |
14:44
🔗
|
ersi |
I'm starting to feel a little like godane o_o |
14:45
🔗
|
omf_ |
Well I am looking for someone else to try the next version of ias3upload I plan on releasing today. It is a bugfix release |
14:51
🔗
|
balrog |
omf_: are you still screwing with the yahoo group downloader thing? |
14:51
🔗
|
omf_ |
yes |
14:57
🔗
|
balrog |
I'm guessing you'd learn a lot by using tamperdata or even just web inspector to watch a yahoo login request |
16:25
🔗
|
joepie91 |
ersi: whee |
16:25
🔗
|
joepie91 |
you'll probably want SketchCow to add them to the OHM2013 collection |
16:25
🔗
|
joepie91 |
if you don't have access to that yet |
16:26
🔗
|
joepie91 |
https://archive.org/details/ohm2013 |
16:29
🔗
|
SketchCow |
ersi, just let me know your ia account name, and I'll go through it and 'do the right thing' |
16:31
🔗
|
Tephra |
SketchCow: do you have/know of a good twitter feed archiver? |
16:33
🔗
|
SketchCow |
We should really have a wiki entry for this |
16:36
🔗
|
Tephra |
script on http://www.archiveteam.org/index.php?title=Twitter hasn't worked for me |
16:38
🔗
|
Tephra |
Was thinking of writing one but thought i should ask if there was a good one first |
16:38
🔗
|
ersi |
SketchCow: I PM'd the e-mail |
16:39
🔗
|
ersi |
SketchCow: Feel free to create some collection for DebConf or DebConf12 as well |
16:43
🔗
|
ersi |
Yay, they derived \o/ http://archive.org/details/D2T113201308012300HardDisksMoreThanJustBlockDevicesSpriteTm http://archive.org/details/D4T204201301031400SimCardExploitationKarstenNohl |
17:54
🔗
|
godane |
so 2009 episodes of labrats.tv is uploaded |
18:06
🔗
|
joepie91 |
anyone has a backup of this? https://secure.flickr.com/photos/mosaaberising/ |
18:06
🔗
|
joepie91 |
<Sallam94x>they were removed by authorities ystrday |
18:06
🔗
|
joepie91 |
<Sallam94x>i wanted to leech them in case they get removed :( |
18:06
🔗
|
joepie91 |
<Sallam94x>that photographer was killed by cops during shooting those pics |
18:06
🔗
|
joepie91 |
<Sallam94x>those pics were considered as proof against cops regarding massacres in egypt :( |
18:10
🔗
|
DFJustin |
the pictures are still there, you just have to log in to a flickr account because they are flagged for content |
18:12
🔗
|
Tephra |
yep i'm able to view them |
18:12
🔗
|
SketchCow |
Save them anyway, please. |
18:13
🔗
|
Tephra |
will get on it |
19:12
🔗
|
omf_ |
I just read someone is releasing a 6" windows phone. 6"!?! I could not fit that in my pocket |
19:20
🔗
|
ersi |
Hah, weirdos |
19:22
🔗
|
omf_ |
cell phones themselves are getting bigger and bluetooth headsets get smaller |
19:28
🔗
|
joepie91 |
iiiiiiif anyone needs it for flickr |
19:28
🔗
|
joepie91 |
https://github.com/joepie91/main/blob/master/tools/scrapers/flickrgrab.py |
19:28
🔗
|
joepie91 |
:) |
19:29
🔗
|
joepie91 |
Tephra: any luck with the flickr account yet? |
19:31
🔗
|
joepie91 |
(re: script, provide userpage URL as first argument, and it will download all full-size images from that user) |
19:31
🔗
|
joepie91 |
(might also work with collections, idk) |
19:36
🔗
|
Tephra |
joepie91: sweeet! didn't find a good tool, was just about to purcahse one |
19:36
🔗
|
Tephra |
will grab it now |
19:37
🔗
|
joepie91 |
:P |
19:37
🔗
|
joepie91 |
oh |
19:37
🔗
|
joepie91 |
it will need your session probably |
19:37
🔗
|
joepie91 |
hm |
19:37
🔗
|
joepie91 |
I'll have to modify it to do sessions |
19:37
🔗
|
joepie91 |
Tephra: can any flickr account view that user? |
19:38
🔗
|
Tephra |
joepie91: yes I think so |
19:41
🔗
|
Tephra |
"No page counr found" |
19:41
🔗
|
Tephra |
*count |
19:42
🔗
|
joepie91 |
yeah, means it can't see it |
19:42
🔗
|
joepie91 |
give me a moment |
19:43
🔗
|
joepie91 |
will make some dinner first then I'll modify the script |
19:44
🔗
|
Tephra |
right, I'll continue grabbing lower res photos since i can do that quite easy in bulk |
19:49
🔗
|
joepie91 |
ok |
19:51
🔗
|
omf_ |
joepie91, what about this I heard about flickr changing some js around to use more <span> tags |
19:51
🔗
|
omf_ |
Did you run into that? |
19:53
🔗
|
omf_ |
yeah here is the tweet: congratulations @flickr on replacing links with fucking javascript masquerading as <span> elements |
19:54
🔗
|
xmc |
haha |
19:55
🔗
|
omf_ |
vanityfair has that bullshit for pagination now |
19:55
🔗
|
yipdw |
they must be hiring more full-stack code artists |
19:57
🔗
|
omf_ |
lol that is a great way to explain it |
19:58
🔗
|
yipdw |
I think at some point we will need to integrate WARC functionality with PhantomJS |
19:58
🔗
|
yipdw |
if this fuckery continues |
19:59
🔗
|
yipdw |
actually, it's happening already -- you're not going to be able to archive Discourse-based boards, for example |
19:59
🔗
|
yipdw |
since all of that is an ember.js app |
19:59
🔗
|
yipdw |
and AFAIK Discourse has no export function yet |
20:00
🔗
|
Tephra |
our job just keeps getting easier... |
20:01
🔗
|
yipdw |
oh wait |
20:01
🔗
|
yipdw |
https://github.com/discourse/discourse/blob/master/lib/jobs/exporter.rb |
20:01
🔗
|
yipdw |
https://github.com/discourse/discourse/blob/master/lib/jobs/importer.rb |
20:01
🔗
|
yipdw |
never mind, that looks promising |
20:01
🔗
|
yipdw |
so wgetting it won't be easy, but you can get the data into another Discoursei nstance |
20:01
🔗
|
yipdw |
that's good enough for me |
20:01
🔗
|
yipdw |
(assuming that import/export works) |
20:02
🔗
|
yipdw |
that said, the export/import is controlled by an admin account |
20:02
🔗
|
yipdw |
so you still need the site admin to have the foresight to go "I should save this", which kinda sucks but is better than nothing, I guess |
20:02
🔗
|
joepie91 |
<omf_>joepie91, what about this I heard about flickr changing some js around to use more <span> tags |
20:02
🔗
|
joepie91 |
no idea |
20:03
🔗
|
joepie91 |
I just grabbed the JSON from their AJAX calls |
20:03
🔗
|
Tephra |
joepie91: https://twitter.com/mosaaberizing the guys twitter feed |
20:06
🔗
|
xmc |
yipdw | I think at some point we will need to integrate WARC functionality with PhantomJS |
20:06
🔗
|
xmc |
yes |
20:06
🔗
|
xmc |
also see https://github.com/iramari/WarcProxy |
20:06
🔗
|
yipdw |
xmc: ah, neat |
20:07
🔗
|
yipdw |
xmc: actually, I think that's even better |
20:07
🔗
|
xmc |
I've been browsing through it for the past month actually |
20:07
🔗
|
* |
yipdw forks that to ArchiveTeam |
20:09
🔗
|
Tephra |
joepie91: there's still activity (latest 1 min ago) from his twitter account |
20:09
🔗
|
yipdw |
xmc: I'll keep that in mind for future grabs, thanks |
20:09
🔗
|
xmc |
:) |
20:10
🔗
|
yipdw |
xmc: seems like we should make a seesaw kit based on it |
20:10
🔗
|
xmc |
hm, yeah |
20:10
🔗
|
xmc |
spidermonkey + warcproxy is a winning combination |
20:10
🔗
|
yipdw |
yes |
20:10
🔗
|
yipdw |
ALTHOUGH |
20:10
🔗
|
xmc |
slow, but thorough |
20:10
🔗
|
yipdw |
it seems that what you archive depends somewhat on what interactions you supply |
20:10
🔗
|
yipdw |
like that pagination shit |
20:10
🔗
|
xmc |
yeah |
20:10
🔗
|
yipdw |
maybe we need scrollbar control, haha |
20:10
🔗
|
xmc |
fucking. |
20:10
🔗
|
yipdw |
phantomjs + warcproxy maybe |
20:11
🔗
|
yipdw |
well, that and now you have to wait for async processes to finish |
20:11
🔗
|
xmc |
hm, yeah |
20:12
🔗
|
yipdw |
none of this is hard to get around, but it sure does suck :P |
20:13
🔗
|
xmc |
yeah ... |
20:13
🔗
|
ersi |
Sweet, uploaded my first thing - straight to a custom collection! |
20:14
🔗
|
yipdw |
xmc: does https://github.com/iramari/WarcMiddleware already handle this for us? |
20:14
🔗
|
yipdw |
or is that more akin to our current wget-warc setup |
20:14
🔗
|
ersi |
I think that's more like wget-warc |
20:14
🔗
|
xmc |
^ |
20:14
🔗
|
yipdw |
hm o |
20:14
🔗
|
yipdw |
k |
20:14
🔗
|
xmc |
it claims to be more thorough than wget -p |
20:16
🔗
|
ersi |
Something for all retro lovers. Like SketchCow: https://archive.org/details/D4T206201308031505AwesomeRetroStitch |
20:16
🔗
|
ersi |
Awesome Retro \o/ |
20:21
🔗
|
SketchCow |
Someone just subscribed to the Internet Archive for $245/month |
20:22
🔗
|
omf_ |
awesome |
20:22
🔗
|
ersi |
Awesome! |
20:22
🔗
|
ersi |
argh, fuck. Forgot to change collection |
20:26
🔗
|
Tephra |
SketchCow: got the low res photos saved. joepie91 is looking into getting the high res photos with his script |
20:27
🔗
|
Tephra |
SketchCow: The guy seems to be active on twitter (latest update ~20 mins ago) so he may be alive it would seem |
20:31
🔗
|
balrog |
Tephra: which guy? |
20:37
🔗
|
Tephra |
< joepie91> anyone has a backup of this? https://secure.flickr.com/photos/mosaaberising/ |
20:37
🔗
|
Tephra |
< joepie91> <Sallam94x>they were removed by authorities ystrday |
20:37
🔗
|
Tephra |
< joepie91> <Sallam94x>those pics were considered as proof against |
20:37
🔗
|
Tephra |
cops regarding massacres in egypt :( |
20:37
🔗
|
Tephra |
earlier discussion |
20:37
🔗
|
Tephra |
< joepie91> <Sallam94x>that photographer was killed by cops during |
20:37
🔗
|
Tephra |
< joepie91> <Sallam94x>i wanted to leech them in case they get removed :( |
22:07
🔗
|
godane |
so i'm getting something non-tech related |
22:07
🔗
|
godane |
something called almost live |
22:08
🔗
|
godane |
they say it was i couldn't make profit from it |
22:09
🔗
|
godane |
*the new owners say it couldn't make profit from it |
22:09
🔗
|
godane |
thats why it was canned in 1999 |
22:25
🔗
|
xmc |
almost live was great |
22:30
🔗
|
joepie91 |
wait wtf |
22:30
🔗
|
joepie91 |
was that flickr just un-hidden? |
22:30
🔗
|
joepie91 |
now I can't test my login thing damnit :| |
22:39
🔗
|
joepie91 |
anyway |
22:39
🔗
|
joepie91 |
downloading it |
22:39
🔗
|
joepie91 |
also, happy archiveday godane! :P |
22:41
🔗
|
godane |
trying to archive labrats.tv shows |
22:41
🔗
|
godane |
looks like butterscotch.com is down |
22:42
🔗
|
godane |
but the rss feeds are still up |
23:55
🔗
|
godane |
saved the torrentbytes twitter account: http://archive.is/Jz8lN |