Time |
Nickname |
Message |
02:37
🔗
|
SketchCow |
Excellent |
14:52
🔗
|
midas |
seems that twitpic is gone |
14:53
🔗
|
midas |
frontpage is empty |
14:53
🔗
|
midas |
but, data still exists: http://twitpic.com/135xa |
14:54
🔗
|
godane |
very odd |
14:54
🔗
|
godane |
send warrior after areas that need grabbing |
15:11
🔗
|
joepie91 |
503 Service Temporarily Unavailable |
15:11
🔗
|
joepie91 |
cc midas godane |
15:11
🔗
|
joepie91 |
(for the image URL) |
15:12
🔗
|
midas |
hm? works here? |
15:12
🔗
|
midas |
refresh it a couple of times |
15:15
🔗
|
godane |
https://dn3pm25xmtlyu.cloudfront.net/photos/large/1827262.jpg?1232052212&Expires=1414336322&Signature=M3Dx66kXYdcgrTVlxqBCPqZ2osLKCwd4OZj84B3L0h~33bi~TCd8395uq5ImqqVCZfCqUUwdenExmfPACy1rUXX8lIxK2v8fyZMUt6r-kZMO3b039g1hBRBWy9BM-EzmwENCD7qf34fFLjrTylRqCpIHvyKEQ3453-jz7F38wNA_&Key-Pair-Id=APKAIYVGSUJFNRFZBBTA |
15:21
🔗
|
ersi |
Outlook cloudy. |
15:25
🔗
|
godane |
i was hoping we could just attack the number.jpg ulrs |
15:25
🔗
|
godane |
but no |
15:25
🔗
|
godane |
you need special keys |
15:25
🔗
|
antomatic |
that's the change Noah made once the 500m were grabbed |
15:25
🔗
|
godane |
oh |
15:25
🔗
|
Kenshin |
sadly we didn't have the space for the last 300m |
15:25
🔗
|
Kenshin |
at that point of time when it was still unblocked |
15:26
🔗
|
Kenshin |
and because they announced no-close as well |
15:26
🔗
|
Kenshin |
there was no incentive to push for the last 300m |
15:26
🔗
|
Kenshin |
when they did announce re-close, too late, already locked the photos with key |
15:26
🔗
|
antomatic |
and archiving had temporarily stopped because he said "Yay, Twitpic is acquired, you want to go skateboards?" |
15:26
🔗
|
antomatic |
[This is how website owners speak, I imagine] |
15:26
🔗
|
Kenshin |
if we grabed the last 300m it would ahve probably been easier at this point really |
15:27
🔗
|
Kenshin |
in worse case we could have just redirected the base-36 URLs to base-10 image links |
15:28
🔗
|
Kenshin |
either way nothing more we can do |
15:28
🔗
|
antomatic |
500m is still a very good result |
15:28
🔗
|
antomatic |
especially given the circumstances |
15:28
🔗
|
Kenshin |
let's just say if they didn't announce the acquire |
15:28
🔗
|
Kenshin |
we would have continued |
15:29
🔗
|
Kenshin |
and probably got close to current |
15:29
🔗
|
antomatic |
Hindsight is always 20/20. |
15:29
🔗
|
Kenshin |
i blame it on the lack of storage |
15:29
🔗
|
antomatic |
As you say, it was hard to make a case for the space once they said it wasn't being wiped |
15:29
🔗
|
Kenshin |
i was pushing 3-4gbps nonstop for the cloudfront project |
15:30
🔗
|
Kenshin |
i got the 500m in 3 days |
15:30
🔗
|
antomatic |
even in the light of "won't be fooled again" there are always practical limits |
15:30
🔗
|
Kenshin |
I'd quote House, "Everybody lies" |
15:30
🔗
|
antomatic |
Hehe. :) |
15:31
🔗
|
antomatic |
All those weeny commenters on HN, all "they can't afford the transfer costs" and stuff. |
15:31
🔗
|
antomatic |
You'd have thought we were taking a dying man's last dime. |
15:31
🔗
|
ersi |
This is starting to be something for #archiveteam-bs |
15:31
🔗
|
Kenshin |
the 500m would have cost them a few thousands i guess |
15:32
🔗
|
antomatic |
It's a useful discussion for considering future AT policy and approaches to cases like these, though, maybe, ersi? |
15:32
🔗
|
Kenshin |
ersi: i think this is something everyone needs to learn about when archiving all future sites |
15:32
🔗
|
antomatic |
but -bs is fine too |
15:33
🔗
|
Kenshin |
AT needs a firm solution for the storage issue |
15:33
🔗
|
* |
antomatic nods |
15:33
🔗
|
yipdw |
someone give me a bunch of twitpic URLs |
15:33
🔗
|
yipdw |
ArchiveBot isn't banned and we have image access |
15:33
🔗
|
antomatic |
weh? |
15:33
🔗
|
yipdw |
just do it |
15:34
🔗
|
Kenshin |
yipdw: you can always do base36 conversion for 500m+ |
15:34
🔗
|
ersi |
Well, as long as it's not fretting about what we've lost for hundreds of lines and useless policy talk |
15:34
🔗
|
Kenshin |
yipdw: 49.213.23.196 |
15:34
🔗
|
Kenshin |
gah bad paste |
15:34
🔗
|
ersi |
Policy is, do (archiving) > talking |
15:34
🔗
|
Kenshin |
http://twitpic.com/89oqgx |
15:35
🔗
|
Kenshin |
ersi: yes but when storage is an issue, not possible to "do" |
15:35
🔗
|
antomatic |
archiving without a plan is just masturbation. :) |
15:35
🔗
|
yipdw |
watch http://archivebot.at.ninjawedding.org:4567/ |
15:35
🔗
|
* |
antomatic watches |
15:35
🔗
|
ersi |
Sometimes, making a plan is when archiving doesn't happen and the building burn down |
15:35
🔗
|
ersi |
So it's not just masturbation. Whatever, I'm out :) |
15:36
🔗
|
yipdw |
maybe I need to phantomjs that |
15:36
🔗
|
yipdw |
huh yeah |
15:37
🔗
|
yipdw |
so yeah, either keep talking about archiving or URLs |
15:37
🔗
|
Kenshin |
yipdw: it worked |
15:37
🔗
|
Kenshin |
you grabed the image fine, 200 |
15:37
🔗
|
yipdw |
it's full of signed request shit, but yeah |
15:37
🔗
|
Kenshin |
well because it's a complete grab, the signing is fine |
15:37
🔗
|
Kenshin |
the html will link to the image |
15:37
🔗
|
yipdw |
yeah, it'll just screw wayback up without some processing |
15:37
🔗
|
yipdw |
oh yeah or that |
15:37
🔗
|
yipdw |
ANYWAY |
15:38
🔗
|
Kenshin |
lets move this to #quitpic |
15:38
🔗
|
yipdw |
ok |
15:39
🔗
|
godane |
antomatic: thats why i create plans with my archives |
15:39
🔗
|
godane |
looks at my funnyordie collection and you see the plan |
15:39
🔗
|
antomatic |
you do great work, godane. that needs to be said. |
15:39
🔗
|
antomatic |
always. |
15:39
🔗
|
godane |
grab older stuff first |
15:40
🔗
|
antomatic |
makes sense |
15:40
🔗
|
godane |
my advices is more for the stuff thats not shutting down |
15:41
🔗
|
godane |
cause otherwise you lose the stuff that more popular |
15:43
🔗
|
godane |
also this at least give you some thing if they shutdown with no warning |
15:44
🔗
|
godane |
with twitpic the best way of archiving that IA can handle is maybe grabbing the url images on the front pages that IA has |
15:45
🔗
|
godane |
this way you give IA something and you have less dead links in wayback machiine |
15:46
🔗
|
godane |
anyways i'm grab more global news videos |
15:46
🔗
|
godane |
like News Hour and News Hour Final |
16:33
🔗
|
SketchCow |
Morning |
16:33
🔗
|
SketchCow |
This went to -bs quick |
18:31
🔗
|
signius |
I am already running scripts on a couple of VPS boxes but when i try & run the scripts on a VM on my local LAN i am just getting Server returned 0 .........Sleeping |
18:32
🔗
|
signius |
What am i doing wrong ? or not doing that i should be ? |
18:34
🔗
|
Kazzy |
does the vm have internet access? |
18:34
🔗
|
signius |
yeah |
19:07
🔗
|
bzc6p |
TWITPIC "ACQUIRED" BY TWITTER. http://blog.twitpic.com/2014/10/twitpics-future/ |
19:09
🔗
|
bzc6p |
(the quotation marks mean "kind of") |
19:10
🔗
|
godane |
its at least saved in a read-only state |
19:10
🔗
|
godane |
but this one bothers me: You will still be able to login to your profile to delete content or delete your account on Twitpic.com |
19:11
🔗
|
godane |
also twitter gives all there stuff older then 6 months to the library of congress i think |
19:13
🔗
|
joepie91 |
"Twitter shares our goal of protecting our users and this data." |
19:13
🔗
|
joepie91 |
hahahahahahahaha. |
19:13
🔗
|
joepie91 |
I don't think there's any sharing involved there... |
19:21
🔗
|
pikhq |
godane: At the very least, if you send communication to Twitter saying "hey, we'd like to archive stuff!" they're likely to send a response. :P |
19:23
🔗
|
joepie91 |
pikhq: try to send actual communication to Twitter and then say that again :P |
19:23
🔗
|
pikhq |
I didn't say it would be more than a formletter. |
19:23
🔗
|
joepie91 |
pikhq: nono, you misunderstand |
19:23
🔗
|
joepie91 |
the problem isn't in the response received |
19:23
🔗
|
joepie91 |
the problem is in actually sending them a communication |
19:23
🔗
|
joepie91 |
I've tried this before |
19:24
🔗
|
pikhq |
They *do* have a mailing address, no? |
19:24
🔗
|
joepie91 |
after an hour I gave up on finding anything that wasn't an incredibly narrowly defined topic-specific contact form |
19:24
🔗
|
joepie91 |
pikhq: snail mail? possibly, I have no idea |
19:24
🔗
|
joepie91 |
certainly not any kind of online human support |
19:26
🔗
|
pikhq |
Huh, impressive. Their website actually makes it hard to find contact info. |
19:27
🔗
|
joepie91 |
yep. |
19:27
🔗
|
joepie91 |
contract to Yahoo, the much-maligned destroyer of data, who have an actual online support department |
19:27
🔗
|
joepie91 |
contrast * |
19:27
🔗
|
joepie91 |
with a human responding |
19:27
🔗
|
joepie91 |
:P |
19:27
🔗
|
joepie91 |
anyway, this is going into -bs territory |
19:33
🔗
|
SketchCow |
"going" |
19:33
🔗
|
joepie91 |
shush :P |
19:33
🔗
|
SketchCow |
Dude, we passed WELCOME TO -BS, FIRST IN TANGENTS on the highway 3 hours ago |
19:33
🔗
|
joepie91 |
the siren didn't even go off yet! |
19:57
🔗
|
godane |
SketchCow: i started grabbing News Hour Toronto |
19:58
🔗
|
godane |
looks like they may have better archives then Global National archives |
20:00
🔗
|
chronomex |
someone wants an off topic siren? |
20:00
🔗
|
chronomex |
woop woop woop off-topic siren |
20:00
🔗
|
chronomex |
oh, that was half an hour ago |
20:09
🔗
|
SketchCow |
Excellent |
20:10
🔗
|
godane |
looks like i can get episodes going back to 2013 |
20:17
🔗
|
godane |
SketchCow: there is also a english south korea news program called KBS News Today |
20:17
🔗
|
godane |
there episodes of that on youtube |
22:58
🔗
|
arkiver |
https://web.archive.org/web/20141007064755/http://www.genealogy.com/users/m/e/i/Paula-Arleen-Meinert/WEBSITE-0001/UHP-1583.html |
22:58
🔗
|
arkiver |
https://web.archive.org/web/20141007063953/http://www.genealogy.com/users/m/e/i/Paula-Arleen-Meinert/WEBSITE-0001/UHP-Index.html |
22:58
🔗
|
arkiver |
https://web.archive.org/web/20140929010450/http://www.mundia.com/be/Person/27883978/5094683264 |
22:58
🔗
|
arkiver |
it's there |
22:58
🔗
|
arkiver |
and it's awesome |
22:59
🔗
|
arkiver |
:) |
23:01
🔗
|
arkiver |
https://web.archive.org/web/20140924190350/http://www.mundia.com/af/Search/Results?surname=BAHA&birthPlace=Afghanistan |