Time |
Nickname |
Message |
04:57
🔗
|
omf_ |
I just tried the tracker site and got a blank page. Is it down? |
05:27
🔗
|
S[h]O[r]T |
godane if you give me the g4tv url list ill download it all |
08:17
🔗
|
godane1 |
S[h]O[r]T: https://archive.org/details/g4tv.com-video-url-list-1 |
08:17
🔗
|
godane1 |
i uploaded the list |
10:55
🔗
|
IR5611 |
12www.jizzday.com |
11:04
🔗
|
GLaDOS |
I wouldn't set jizzday up on an autoban list.. |
12:39
🔗
|
SketchCow |
hi. |
12:39
🔗
|
SketchCow |
we are bring back the JSTOR downloader |
12:39
🔗
|
SketchCow |
for aaron. |
12:40
🔗
|
SketchCow |
I believe alard and underscor have the code? |
13:09
🔗
|
alard |
SketchCow: I must have it somewhere, yes. |
13:19
🔗
|
SketchCow |
let's do it. |
13:23
🔗
|
Cameron_D |
yes, when I first heard about it I wondered if we'd ever completed the JSTOR stuff |
13:23
🔗
|
Cameron_D |
so lets do that |
13:44
🔗
|
SketchCow |
get the code, prep the bookmarklet. please, someone check if jstor changed tos to address what we are doing. |
13:45
🔗
|
SketchCow |
ill provide a non archive.org box access for this. |
13:45
🔗
|
SketchCow |
And after I nap a little, I will write verbiage for the page. |
13:47
🔗
|
GLaDOS |
(d) undertake any activity such as computer programs that automatically download or export Content, commonly known as web robots, spiders, crawlers, wanderers or accelerators that may interfere with, disrupt or otherwise burden the JSTOR server(s) or any third-party server(s) being used or accessed in connection with JSTOR |
13:48
🔗
|
GLaDOS |
The only part in the prohibited activities clause which could conflict. |
13:48
🔗
|
kennethre |
odd timing http://www.webpronews.com/jstor-opens-up-its-archive-kinda-sorta-2013-01 |
13:48
🔗
|
Cameron_D |
Won't be automated, IIRC it was something that had to be manually triggered for each fiile |
13:49
🔗
|
GLaDOS |
AFAIK, we're ok |
13:49
🔗
|
Cameron_D |
At least, bookmarklet implies that |
13:53
🔗
|
SketchCow |
ok. |
13:53
🔗
|
SketchCow |
did it change? is that old or new info? |
13:53
🔗
|
GLaDOS |
Just fetched it |
13:54
🔗
|
GLaDOS |
Wait |
13:54
🔗
|
GLaDOS |
"he Content can be read online (but not printed or downloaded) as further described in Section 2.1 below." |
13:55
🔗
|
godane |
i'm capturing thefeed from g4tv.com and its taking a very long time |
13:56
🔗
|
godane |
this capture is without the images in it |
13:56
🔗
|
godane |
its readly 733mb |
13:56
🔗
|
godane |
*alreadly |
13:57
🔗
|
GLaDOS |
(f) download or print, or attempt to download or print an entire issue of a journal (unless such entire issue has been purchased through the Publisher Sales Service) or substantial portions of the entire run of a journal, except for the specific case in which the complete contents of a journal issue or a substantial portion of Textual Content (e.g. a series of scholarly essays) is relevant to the particular research |
13:57
🔗
|
GLaDOS |
(c) incorporate Content into an unrestricted database or website, except that authors or other Content creators may incorporate their Content into such sites with prior permission from the publisher and other applicable rights holders |
13:57
🔗
|
GLaDOS |
Any of these new? |
14:00
🔗
|
SketchCow |
check wayback |
14:02
🔗
|
alard |
https://twitter.com/JSTOR/status/174155323668574208 |
14:03
🔗
|
GLaDOS |
Newest version in wayback is may 31 |
14:03
🔗
|
GLaDOS |
http://web.archive.org/web/20120531065004/http://about.jstor.org/participate-jstor/individuals/early-journal-content |
14:03
🔗
|
GLaDOS |
Wait, mind mixed order of messages up |
14:04
🔗
|
GLaDOS |
Blocked by robots.txt |
14:04
🔗
|
Cameron_D |
http://www.jstor.org/robots.txt it won't be in wayback? |
14:10
🔗
|
SketchCow |
I don't to move too rashly on this. I've done that in the past, not always forgood. |
14:11
🔗
|
SketchCow |
a part ofmewants to make it so it violates the agreement, so thousands of people commit the felony. |
14:11
🔗
|
SketchCow |
ok, rest |
14:11
🔗
|
Cameron_D |
Yeah, and looknig at point (c) we may not be able to, although there are no past versions of the ToC to compare to |
14:59
🔗
|
balrog_ |
I'm wondering if something exists that just stores any PDFs you're viewing in browser together with a little bit of metadata |
15:40
🔗
|
riordan |
Is this where OpAaronSW is going down? |
15:47
🔗
|
balrog_ |
to some extent |
17:27
🔗
|
SketchCow |
I've put a slight waiting period on it to understand the best thing to do. |
17:27
🔗
|
SketchCow |
But I want his stuff in away from keyboard on archive.org, so we are definitely doing that. |
17:45
🔗
|
riordan |
SketchCow: totally - thank you man |
18:18
🔗
|
godane |
uploaded: http://archive.org/details/www.aaronsw.com-20130112-mirror |
21:46
🔗
|
SketchCow |
Hi. |
21:46
🔗
|
SketchCow |
OK, so. |
21:49
🔗
|
SketchCow |
#1. He deleted some sites, before hanging himself. |
21:49
🔗
|
SketchCow |
#2. Making a collection now. |
21:50
🔗
|
SketchCow |
#3. Soooooo angry still, but running out of people to blame |
21:52
🔗
|
SketchCow |
I've cooked up a plan, working it out with alard. |
21:53
🔗
|
SketchCow |
Here's the plan. |
21:53
🔗
|
SketchCow |
Bookmarket, like the JSTOR downloader. You run it, and it downloads one document. |
21:53
🔗
|
SketchCow |
You write something about aaron when you do it. |
21:53
🔗
|
SketchCow |
And so it gets uploaded, with your memorial. |
21:53
🔗
|
SketchCow |
Then everyone commits a felony |
21:53
🔗
|
SketchCow |
And says their peace. |
21:58
🔗
|
chronomex |
nice |
22:02
🔗
|
dashcloud |
SketchCow: if the goal is to download everything, can't we just have something that would take a group of people months to complete (i.e, low profile enough to avoid detection until the end?) |
22:06
🔗
|
alard |
One document seems like a nice idea. So people can also leave their name and a message? |
22:06
🔗
|
alard |
(They'll still have to install the bookmarklet, even if there's only one document.) |
22:28
🔗
|
balrog_ |
alard: I'd like to see something I mentioned above to be done |
22:28
🔗
|
balrog_ |
basically like RECAP but for more than just PACER |
22:41
🔗
|
SketchCow |
yes |
22:42
🔗
|
balrog_ |
it bothers me greatly when PDFs (and content in general) that I browsed when doing research even recently goes dark |
22:42
🔗
|
SketchCow |
dashcloud: goal is not torape jstor todeath |
22:42
🔗
|
SketchCow |
sorry, ipad |
22:43
🔗
|
balrog_ |
often a lot of the older stuff is on very sketchy sites to begin with :/ |
22:43
🔗
|
balrog_ |
look at chip datasheets for example... |
22:48
🔗
|
philpem |
yeah, the EAB archive is one such site |
22:48
🔗
|
philpem |
bloody huge collection of databooks and so on, sitting behind someone's cable modem. |
22:48
🔗
|
philpem |
if I had the details of the guy who ran it, I'd offer to send hima a |
22:49
🔗
|
philpem |
*him a Peli hardcase and a bunch of hard drives in exchange for a copy. |
22:49
🔗
|
SketchCow |
n 20 |
22:51
🔗
|
SketchCow |
OK, so, this is what I would like. |
22:51
🔗
|
SketchCow |
1. JSTOR bookmarklet. You add it, click it, and it downloads the article, asking you for a message about aaron. |
22:52
🔗
|
SketchCow |
2. If someone has a virtual instance alard can use, I'd like you to coordinate with him. He has a lot done. |
22:52
🔗
|
SketchCow |
3. When the bookmarklet is used again, banner thanking people, and then a link to the Wikipedia article on Aaron. |
22:52
🔗
|
SketchCow |
Make sense? |
22:58
🔗
|
fault |
I've got some server capacity that can be used |
23:01
🔗
|
fault |
Send me a message if you need somewhere to dump it, I can set up nginx/cgi, whatever stack you need |
23:05
🔗
|
alard |
Actually, it's almost bedtime for me. I have little time tomorrow. So if there's anyone who wants to take over, please do. |
23:05
🔗
|
alard |
I've done the following so far: |
23:06
🔗
|
alard |
There's a bookmarklet that does a form POST with the PDF and the message to a script somewhere. What's needed is a server-side thing that receives the POST data, stores it and adds it to the memorial page. |
23:10
🔗
|
chronomex |
this seems like a fit for tracker.archiveteam.org |
23:13
🔗
|
SketchCow |
Who can take over? |
23:53
🔗
|
Nemo_bis |
Some warrior instances getting killed for not enough memory. |
23:54
🔗
|
Nemo_bis |
Ah, looks like I lost a user of which I downloaded some 10-15 GiB. |