#archiveteam 2015-12-12,Sat

↑back Search

Time Nickname Message
00:13 🔗 HCross JetBalsa, this channel is logged FYI
00:13 🔗 JetBalsa Its Archive Team, I figured
00:14 🔗 JetBalsa that pass is used for temp accounts we hand out, its changed pretty much asap, but Thanks for the note
00:14 🔗 aaaaaaaaa A surprisingly large number of people don't.
00:14 🔗 JetBalsa hahah
00:19 🔗 JW_work most of the archiveteam channels are *not* logged, at least in public
00:19 🔗 JW_work just the always-on ones
00:20 🔗 JetBalsa that should be apart of the archive
00:20 🔗 JetBalsa the channel logs, documenting the saving of everything
00:20 🔗 VADemon has quit IRC (left4dead)
00:34 🔗 HCross Anything happened to FOS? My upload has tanked
00:35 🔗 bwn_ has joined #archiveteam
00:36 🔗 arkiver Please update your scripts for Google Code!
00:42 🔗 bwn has quit IRC (Read error: Operation timed out)
01:19 🔗 philpem has quit IRC (Ping timeout: 252 seconds)
01:20 🔗 SN4T14 has quit IRC (Read error: Operation timed out)
01:28 🔗 SN4T14 has joined #archiveteam
01:38 🔗 jleclanch has quit IRC (Read error: Operation timed out)
01:46 🔗 jleclanch has joined #archiveteam
01:47 🔗 JesseW has joined #archiveteam
02:01 🔗 jleclanch has quit IRC (Ping timeout: 255 seconds)
02:14 🔗 bwn_ has quit IRC (Read error: Connection reset by peer)
02:14 🔗 bwn has joined #archiveteam
02:22 🔗 schbirid2 has joined #archiveteam
02:24 🔗 schbirid has quit IRC (Read error: Operation timed out)
02:35 🔗 vitzli has joined #archiveteam
02:44 🔗 HCross has quit IRC (Max SendQ exceeded)
02:44 🔗 Fusl has quit IRC (Max SendQ exceeded)
02:44 🔗 HCross has joined #archiveteam
02:45 🔗 Fusl has joined #archiveteam
02:47 🔗 _desu___ has quit IRC (Ping timeout: 252 seconds)
02:50 🔗 _desu___ has joined #archiveteam
02:52 🔗 xmc has quit IRC (Quit: brb)
03:12 🔗 bwn has quit IRC (Read error: Operation timed out)
03:24 🔗 xmc has joined #archiveteam
03:24 🔗 swebb sets mode: +o xmc
03:36 🔗 JetBalsa has quit IRC (Quit: - nbs-irc 2.39 - www.nbs-irc.net -)
03:36 🔗 kyan tree3: I've started re-grabbing the ones that failed on the first time, but it looks like he's started setting them to private
03:37 🔗 kyan even though it's not saturday yet
03:37 🔗 kyan :(
03:37 🔗 kyan So I won't be able to get those unless you can convince him to set them to "unlisted"
03:38 🔗 kyan I got most of the ones that were available though (289 I'm retrying).
03:38 🔗 kyan So that's 4161 successfully grabbed (or were already taken down due to Content ID).
03:39 🔗 kyan They're still uploading, though
03:39 🔗 kyan Also he's put up some new ones since then, which I haven't gotten yet.
03:41 🔗 andrew_m has joined #archiveteam
03:42 🔗 andrew_m has quit IRC (Client Quit)
04:03 🔗 tree33 has joined #archiveteam
04:09 🔗 tree3 has quit IRC (Read error: Operation timed out)
04:27 🔗 aaaaaaaaa Can someone stop the google code grab?
04:30 🔗 phuzion yipdw, chfoo, arkiver ping ^^
04:31 🔗 vitzli it is 2015-12-12 04:31 UTC, for logging purposes
04:33 🔗 chfoo i stopped it
04:33 🔗 chfoo what was wrong with it?
04:34 🔗 vitzli augie_at in #googlecodeblue logged in and asked to reduce the load, he is currently in #googlecodeblue
04:35 🔗 vitzli load spikes trip ddos alerts to the google staff
04:37 🔗 Ghost_of_ has quit IRC (Remote host closed the connection)
05:01 🔗 nertzy has joined #archiveteam
05:08 🔗 aaaaaaaaa has quit IRC (Leaving)
05:20 🔗 nertzy has quit IRC (Quit: This computer has gone to sleep)
05:48 🔗 Sk1d has quit IRC (Ping timeout: 250 seconds)
05:50 🔗 kyan tree33: I'm pretty sure I've gotten all of the ones that I didn't get the first time around and haven't already been set to private.
05:50 🔗 kyan I don't plan on getting the more recent videos (posted since the shutdown alert) unless you let me know they're at risk.
05:51 🔗 kyan It'll be a while before everything shows up on archive.org, though.
05:57 🔗 Sk1d has joined #archiveteam
06:13 🔗 redlob has quit IRC (Read error: Operation timed out)
06:14 🔗 jleclanch has joined #archiveteam
06:17 🔗 redlob has joined #archiveteam
06:53 🔗 kniffy has quit IRC (Excess Flood)
07:01 🔗 kniffy has joined #archiveteam
07:01 🔗 kniffy has quit IRC (Excess Flood)
07:02 🔗 kniffy has joined #archiveteam
07:04 🔗 Froggypwn has quit IRC (Read error: Operation timed out)
07:20 🔗 dtm has quit IRC (hub.efnet.us irc.Prison.NET)
07:20 🔗 JW_work has quit IRC (hub.efnet.us irc.Prison.NET)
07:20 🔗 kyan has quit IRC (hub.efnet.us irc.Prison.NET)
07:20 🔗 logan has quit IRC (hub.efnet.us irc.Prison.NET)
07:20 🔗 patrickod has quit IRC (hub.efnet.us irc.Prison.NET)
07:27 🔗 chfoo has quit IRC (Ping timeout: 310 seconds)
07:29 🔗 dtm has joined #archiveteam
07:29 🔗 JW_work has joined #archiveteam
07:29 🔗 kyan has joined #archiveteam
07:29 🔗 logan has joined #archiveteam
07:29 🔗 patrickod has joined #archiveteam
07:36 🔗 BlueMaxim has quit IRC (Quit: Leaving)
07:59 🔗 vitzli has quit IRC (Quit: Leaving)
08:34 🔗 bwn has joined #archiveteam
09:23 🔗 asdf has joined #archiveteam
09:23 🔗 JesseW has quit IRC (Leaving.)
09:28 🔗 acridAxid has quit IRC (Quit: marauder)
09:29 🔗 acridAxid has joined #archiveteam
10:17 🔗 remsen has quit IRC (Leaving)
10:17 🔗 remsen has joined #archiveteam
10:50 🔗 vitzli has joined #archiveteam
10:59 🔗 vOYtEC has quit IRC (Quit: rm -r *)
11:31 🔗 Jogie has quit IRC (Ping timeout: 506 seconds)
11:31 🔗 Jogie has joined #archiveteam
11:48 🔗 VADemon has joined #archiveteam
12:03 🔗 vOYtEC has joined #archiveteam
12:36 🔗 vOYtEC has quit IRC (Quit: rm -r *)
12:42 🔗 Ghost_of_ has joined #archiveteam
12:52 🔗 vOYtEC has joined #archiveteam
13:46 🔗 WinterFox has quit IRC (Remote host closed the connection)
14:03 🔗 remsen2 has joined #archiveteam
14:03 🔗 rizzzz has quit IRC (Remote host closed the connection)
14:03 🔗 rizzzz has joined #archiveteam
14:05 🔗 remsen has quit IRC (Read error: Operation timed out)
14:21 🔗 altlabel has quit IRC (Ping timeout: 506 seconds)
14:24 🔗 fie has joined #archiveteam
14:26 🔗 xmc has quit IRC (Read error: Operation timed out)
14:27 🔗 K4k has joined #archiveteam
14:54 🔗 K4k has quit IRC (WeeChat 1.0.1)
14:54 🔗 K4k has joined #archiveteam
14:58 🔗 foobar_ has joined #archiveteam
15:00 🔗 nertzy has joined #archiveteam
15:01 🔗 foobar_ has quit IRC (Client Quit)
15:06 🔗 foobar_ has joined #archiveteam
15:07 🔗 foobar_ Hi, does someone know the current status of the Gitorious archiving project?
15:07 🔗 foobar_ Are you looking for any volunteers?
15:12 🔗 PurpleSym foobar_: http://archive.fart.website/bin/irclogger_log/archiveteam?date=2015-12-11,Fri&sel=195#l191
15:18 🔗 SketchCow Where's my hug!!
15:19 🔗 foobar_ @PurpleSysm: Thanks!
15:20 🔗 foobar_ has left
15:20 🔗 SketchCow Got my telethon haircut
15:21 🔗 SketchCow IA OCR is dealing with piles of Macintosh books
15:25 🔗 arkiver HUG
15:25 🔗 arkiver Good luck with the telethon!
15:27 🔗 arkiver I guess we'll later hear more about the livestreams
15:29 🔗 SketchCow telethon.archive.org is the site that will have all info.
15:30 🔗 HCross Yes. Good luck SketchCow - will be there in spirit (while actually being over 5000 miles away :D)
15:38 🔗 SketchCow So, FOS is now slightly filling with FTP.
15:38 🔗 SketchCow 64%
15:38 🔗 SketchCow And I have things wiping away items ater they're uploaded.
15:39 🔗 HCross Want us to slow a lot lot down?
15:42 🔗 kniffy has quit IRC (Ping timeout: 252 seconds)
15:46 🔗 SketchCow Well, of the 4.7tb being used, 3.3 is FTP
15:50 🔗 arkiver Maybe we should create a server which is only for the FTP grab
15:50 🔗 DopefishJ has joined #archiveteam
15:50 🔗 swebb sets mode: +o DopefishJ
15:50 🔗 arkiver So it won't affect the other grabs we do
15:50 🔗 Start_ has joined #archiveteam
15:50 🔗 Start has quit IRC (Read error: Connection reset by peer)
15:51 🔗 HCross arkiver, if we did that, could it be EU based please?
15:51 🔗 godane has quit IRC (Quit: Leaving.)
15:51 🔗 DFJustin has quit IRC (Read error: Operation timed out)
15:55 🔗 nertzy has quit IRC (Quit: This computer has gone to sleep)
16:12 🔗 n00b390 has joined #archiveteam
16:13 🔗 n00b390 WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD
16:14 🔗 ersi "yahoosucks"
16:14 🔗 n00b390 Hail!
16:14 🔗 n00b390 And many thanks!
16:15 🔗 ersi ^_^
16:15 🔗 n00b390 has quit IRC (Client Quit)
16:15 🔗 vitzli *sigh*
16:15 🔗 ersi With great powers comes great responsibility
16:20 🔗 kniffy_ has joined #archiveteam
16:22 🔗 kniffy_ is now known as kniffy
16:31 🔗 kniffy has quit IRC (Quit: :^))
16:32 🔗 kniffy has joined #archiveteam
16:33 🔗 chfoo has joined #archiveteam
16:45 🔗 SN4T14_ has joined #archiveteam
16:46 🔗 SN4T14 has quit IRC (Read error: Operation timed out)
16:47 🔗 sep332 has quit IRC (Read error: Operation timed out)
16:49 🔗 altlabel has joined #archiveteam
16:50 🔗 vitzli has quit IRC (Leaving)
16:54 🔗 kyan has quit IRC (Ping timeout: 258 seconds)
16:56 🔗 godane has joined #archiveteam
16:59 🔗 JesseW has joined #archiveteam
17:09 🔗 JetBalsa has joined #archiveteam
17:12 🔗 tree33 is now known as tree3
17:18 🔗 antomati_ has joined #archiveteam
17:18 🔗 swebb sets mode: +o antomati_
17:18 🔗 wxtr has quit IRC (Read error: Operation timed out)
17:18 🔗 RichardG_ has joined #archiveteam
17:18 🔗 Fletcher has quit IRC (Read error: Operation timed out)
17:18 🔗 Famicoman has quit IRC (Read error: Operation timed out)
17:18 🔗 afics has quit IRC (Read error: Operation timed out)
17:18 🔗 antomatic has quit IRC (Read error: Operation timed out)
17:18 🔗 no2pencil has quit IRC (Read error: Operation timed out)
17:19 🔗 no2pencil has joined #archiveteam
17:19 🔗 Stiletto has quit IRC (Read error: Operation timed out)
17:19 🔗 Stiletto has joined #archiveteam
17:19 🔗 nox has quit IRC (Read error: Operation timed out)
17:20 🔗 wxtr has joined #archiveteam
17:20 🔗 cadbury has quit IRC (Read error: Operation timed out)
17:20 🔗 Apathy has quit IRC (Read error: Operation timed out)
17:21 🔗 [phire] has quit IRC (Read error: Operation timed out)
17:22 🔗 brayden_ has quit IRC (Read error: Operation timed out)
17:22 🔗 vtyl has joined #archiveteam
17:23 🔗 wp494 has quit IRC (Read error: Operation timed out)
17:23 🔗 mistym has quit IRC (Ping timeout: 606 seconds)
17:23 🔗 wp494 has joined #archiveteam
17:23 🔗 nox has joined #archiveteam
17:23 🔗 mistym has joined #archiveteam
17:24 🔗 Start_ is now known as Start
17:24 🔗 RichardG has quit IRC (Ping timeout: 606 seconds)
17:26 🔗 lytv has quit IRC (Ping timeout: 606 seconds)
17:28 🔗 Fletcher has joined #archiveteam
17:31 🔗 Emcy_ has joined #archiveteam
17:32 🔗 RichardG has joined #archiveteam
17:38 🔗 wp494 has quit IRC (hub.se efnet.portlane.se)
17:38 🔗 RichardG_ has quit IRC (hub.se efnet.portlane.se)
17:38 🔗 Sk1d has quit IRC (hub.se efnet.portlane.se)
17:38 🔗 dashcloud has quit IRC (hub.se efnet.portlane.se)
17:38 🔗 ParkerR has quit IRC (hub.se efnet.portlane.se)
17:38 🔗 Elegance has quit IRC (hub.se efnet.portlane.se)
17:38 🔗 Gfy has quit IRC (hub.se efnet.portlane.se)
17:38 🔗 thefinn93 has quit IRC (hub.se efnet.portlane.se)
17:38 🔗 Emcy has quit IRC (hub.se efnet.portlane.se)
17:39 🔗 Elegance_ has joined #archiveteam
17:42 🔗 Gfy_ has joined #archiveteam
17:47 🔗 Apathy has joined #archiveteam
17:47 🔗 parker_ has joined #archiveteam
17:47 🔗 afics has joined #archiveteam
17:49 🔗 JesseW has quit IRC (Leaving.)
17:50 🔗 cadbury has joined #archiveteam
17:50 🔗 thefinn91 has joined #archiveteam
17:54 🔗 Gfy_ is now known as Gfy
17:54 🔗 wp494 has joined #archiveteam
17:55 🔗 dashcloud has joined #archiveteam
18:03 🔗 [phire] has joined #archiveteam
18:15 🔗 brayden_ has joined #archiveteam
18:15 🔗 swebb sets mode: +o brayden_
18:18 🔗 remsen has joined #archiveteam
18:22 🔗 R5M has joined #archiveteam
18:22 🔗 R5M has quit IRC (Client Quit)
18:24 🔗 Froggypwn has joined #archiveteam
18:27 🔗 Ghost_of_ has quit IRC (Quit: Leaving)
18:28 🔗 remsen2 has quit IRC (Read error: Operation timed out)
18:29 🔗 Famicoman has joined #archiveteam
18:29 🔗 chfoo- has quit IRC (ZNC - 1.6.0 - http://znc.in)
18:30 🔗 chfoo- has joined #archiveteam
18:31 🔗 remsen has quit IRC (Read error: Operation timed out)
18:32 🔗 SimpBrain has quit IRC (Read error: Operation timed out)
18:41 🔗 sep332 has joined #archiveteam
18:42 🔗 SimpBrain has joined #archiveteam
18:54 🔗 SimpBrain has quit IRC (Leaving)
18:54 🔗 SimpBrain has joined #archiveteam
18:57 🔗 JesseW has joined #archiveteam
19:00 🔗 dashcloud has quit IRC (Read error: Operation timed out)
19:04 🔗 dashcloud has joined #archiveteam
19:13 🔗 Atom-- has quit IRC (Read error: Connection reset by peer)
19:19 🔗 thefinn91 is now known as thefinn93
19:21 🔗 xXx_ndidd has joined #archiveteam
19:24 🔗 aaaaaaaaa has joined #archiveteam
19:24 🔗 swebb sets mode: +o aaaaaaaaa
19:28 🔗 ndiddy has quit IRC (Read error: Operation timed out)
19:35 🔗 toad2 has joined #archiveteam
19:36 🔗 toad1 has quit IRC (Read error: Operation timed out)
19:54 🔗 bwn has quit IRC (Read error: Operation timed out)
20:01 🔗 Start_ has joined #archiveteam
20:01 🔗 Start has quit IRC (Read error: Connection reset by peer)
20:31 🔗 xXx_ndidd has quit IRC (Read error: Connection reset by peer)
20:34 🔗 ndiddy has joined #archiveteam
20:34 🔗 ndiddy has quit IRC (Read error: Connection reset by peer)
20:37 🔗 ndiddy has joined #archiveteam
20:43 🔗 bwn has joined #archiveteam
20:55 🔗 HCross http://www.cryengine.com/community/downloads.php is going away because of http://www.cryengine.com/news/the-new-cryenginecom-is-coming-next-week Best way of getting it all
20:56 🔗 HCross We have until Monday
20:56 🔗 Ghost_of_ has joined #archiveteam
21:21 🔗 scyther has joined #archiveteam
21:23 🔗 Dennisjr1 has joined #archiveteam
21:23 🔗 philpem has joined #archiveteam
21:26 🔗 HCross I should of added, put it in ArchiveBot but was wondering if a warrior task is needed
21:26 🔗 godane i don't know if archivebot would take it
21:26 🔗 godane it uses javascript for the download buttons
21:26 🔗 ndiddy has quit IRC (Read error: Connection reset by peer)
21:26 🔗 HCross hmm. What is the best way of saving it then
21:26 🔗 HCross ive put --phantom-js on
21:27 🔗 ndiddy has joined #archiveteam
21:28 🔗 HCross IMHO its important
21:28 🔗 godane i know
21:29 🔗 godane but looks like its not grabbing the files
21:32 🔗 HCross ah. Do we need a warrior project?
21:33 🔗 godane i'm trying to get files in wget
21:33 🔗 godane wget -e robots=off --user-agent=Firefox --post-data="submit=Download&hotlink_id=&df_id=4329&modcp=0&cat_id=109&hotlink_id=&view=load" http://www.cryengine.com/community/downloads.php
21:33 🔗 godane its not working
21:34 🔗 yipdw HCross: least overhead is for you or someone to just resolve the URLs and download them
21:34 🔗 HCross hmm ok
21:34 🔗 yipdw warrior is overkill, Crytek isn't that large
21:35 🔗 HCross tru, but there are quite a few
21:35 🔗 yipdw computers are pretty good at doing repetitive tasks quickly
21:37 🔗 HCross tempted to fire up a web browser and start clicking download a lot
21:38 🔗 PurpleSym Does the wayback machine handle POST requests correctly?
21:38 🔗 yipdw how would it possibly do so
21:39 🔗 PurpleSym It could match the input parameters with requests it has in WARCs and hope for the best.
21:39 🔗 Start_ is now known as Start
21:40 🔗 godane how do i put the post data in so wget get the file
21:40 🔗 PurpleSym GET works fine, btw: http://www.cryengine.com/community/downloads.php?submit=Download&hotlink_id=&df_id=5310&modcp=0&cat_id=125&hotlink_id=&view=load
21:40 🔗 godane Thank you
21:41 🔗 godane i thought there was a way to do it using GET
21:41 🔗 yipdw I don't know if IA's wayback does, but emulating POST like that is asking for a lot of trouble
21:41 🔗 godane i will start archiving
21:41 🔗 HCross Thanks godane
21:41 🔗 HCross let me know if you need a hand with bandwith or something
21:42 🔗 PurpleSym Sure, yipdw, for the requests that actually *modify* something things will go terribly wrong.
21:42 🔗 HCross godane, are you aware of the deadline?
21:42 🔗 godane yes
21:42 🔗 yipdw yes, which is most POST requests
21:42 🔗 godane http://www.cryengine.com/community/downloads.php?submit=Download&hotlink_id=&df_id=5310
21:42 🔗 godane that works too
21:42 🔗 yipdw anyway this is offtopic
21:43 🔗 PurpleSym True, I’m sorry.
21:46 🔗 Ghost_of_ has quit IRC (Quit: Leaving)
21:46 🔗 godane i can't get wget to work with it
21:48 🔗 godane HELP
21:51 🔗 godane Access denied!
21:52 🔗 HCross Sorry I cant really help with this
21:54 🔗 JesseW curl -A user-agent -I the-get-url you had above
21:54 🔗 JesseW seems to get me a 301 redirect to http://crytekfiles.com/files/CRYENGINE_Build_PC_v3_5_8_2310_freesdk.zip
21:54 🔗 JesseW which seems to download OK
21:58 🔗 godane full code
21:58 🔗 godane its not working for me
21:59 🔗 JesseW working on it
22:02 🔗 JesseW can you get a list of df_id's (from the view=detail links on the download pages)? We probably want the view=detail pages in any case, and they should be grabbale with grab-site (or wget, etc)
22:02 🔗 godane doing this still give me access denied pages: curl -A firefox -i 'http://www.cryengine.com/community/downloads.php?view=load&hotlink_id=&code=&df_id=5310'
22:05 🔗 JesseW captal -I ,not lowercase
22:05 🔗 JesseW doing a HEAD request
22:07 🔗 JesseW still working on it
22:12 🔗 melody has quit IRC (Ping timeout: 252 seconds)
22:12 🔗 melody has joined #archiveteam
22:13 🔗 godane i'm grabbing 6000 download pages
22:13 🔗 godane just the pages
22:15 🔗 godane looks like i grab the pages
22:16 🔗 godane *i can grab the pages
22:19 🔗 JesseW awesome, we'll need those
22:19 🔗 JesseW I'm close to getting code to translate them into downloads
22:19 🔗 JesseW bash shell is OK?
22:21 🔗 godane yes
22:22 🔗 JesseW function cryengine_download {foo=$(curl -I -A 'Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Firefox/38.0 Iceweasel/38.3.0' 'http://www.cryengine.com/community/downloads.php?submit=Download&df_id='$1'&view=load' | awk '/Location:/{print $2}'); wget "${foo:0:-1}"; }
22:22 🔗 JesseW pass it a df_id
22:23 🔗 JesseW it will download the file to the current directory
22:23 🔗 HCross do we have a list of df_id's?
22:23 🔗 JesseW godane: is working on that -- that's what the download pages are
22:23 🔗 HCross or can we just have a list of numbers and go from there
22:23 🔗 JesseW e.g. http://www.cryengine.com/community/downloads.php?view=detail&category=45&df_id=5289
22:23 🔗 JesseW the df_id is 5289
22:24 🔗 godane its working
22:25 🔗 godane your script
22:25 🔗 godane i'm going to brute force it
22:27 🔗 JesseW nice
22:27 🔗 HCross On behalf of everyone who plays Crysis, thanks everyone
22:27 🔗 JesseW curl 'http://www.cryengine.com/community/downloads.php?&sort_by=0&order=ASC&start=80' > ~/blahxxx | awk -F \" '/df_id\"/{print $6}'
22:27 🔗 JesseW will get you a list of df_ids on the page
22:28 🔗 HCross Do we need a channel for this?
22:28 🔗 JesseW it looks like are about 2,640 downloads
22:28 🔗 * JesseW shrug
22:28 🔗 JesseW I don't think there's that much more to talk about. as long as godane keeps us updated if he needs someone else to take part of the range, I think we're good.
22:29 🔗 Dennisjr1 What does the archive size for this look like/how much data are we looking at here?
22:30 🔗 scyther has quit IRC (Read error: Connection reset by peer)
22:37 🔗 JesseW Dennisjr1: most of the downloads appear to be in the few megabytes range, which would give us a few gigabytes total. There may be some big ones buried among them, though.
22:38 🔗 Dennisjr1 JesseW: ah that's not too bad then :)
22:50 🔗 Rickster has quit IRC (Ping timeout: 252 seconds)
22:50 🔗 wutno has quit IRC (Ping timeout: 252 seconds)
22:51 🔗 diacope has quit IRC (Ping timeout: 252 seconds)
22:51 🔗 sigkell has quit IRC (Ping timeout: 252 seconds)
22:51 🔗 sigkell has joined #archiveteam
22:52 🔗 Zebranky has quit IRC (Ping timeout: 252 seconds)
22:52 🔗 Zebranky has joined #archiveteam
22:52 🔗 Fletcher has quit IRC (Ping timeout: 252 seconds)
22:52 🔗 _desu___ has quit IRC (Ping timeout: 252 seconds)
22:52 🔗 zyphlar has quit IRC (Ping timeout: 252 seconds)
22:52 🔗 Atluxity has quit IRC (Ping timeout: 252 seconds)
22:52 🔗 _desu___ has joined #archiveteam
22:52 🔗 Atluxity has joined #archiveteam
22:53 🔗 Rickster has joined #archiveteam
22:53 🔗 Fletcher has joined #archiveteam
22:53 🔗 zyphlar has joined #archiveteam
22:54 🔗 diacope has joined #archiveteam
22:56 🔗 bauruine has quit IRC (Ping timeout: 252 seconds)
22:57 🔗 bauruine has joined #archiveteam
23:00 🔗 WinterFox has joined #archiveteam
23:06 🔗 HCross godane, JesseW, are we underway and grabbing?
23:09 🔗 JesseW I just helped with coding -- godane is the one doing the grab
23:10 🔗 HCross ah ok.
23:10 🔗 HCross Thanks BTW
23:10 🔗 JesseW sure
23:10 🔗 HCross They have totally done a Yahoo
23:10 🔗 JesseW it's a dammed shame they couldn't, you know, make a torrent of all of it and seed it for a month
23:11 🔗 JesseW that would be a *responsible* way to stop hosting it...
23:11 🔗 HCross most of these companies dont have any sense
23:15 🔗 asdf has quit IRC (Quit: Leaving)
23:26 🔗 JesseW has quit IRC (Leaving.)
23:29 🔗 arkiver So how's cryengine? any help needed?
23:29 🔗 arkiver godane: can you give me an example list of links saved by your script?
23:33 🔗 Ghost_of_ has joined #archiveteam
23:37 🔗 arkiver I want to save this with POST requests too
23:42 🔗 ats has quit IRC (Quit: let's see if installing a new graphics card has got any less painful in the last ten years)
23:43 🔗 arkiver I'm only finding cat_id 87
23:44 🔗 arkiver nevermind
23:47 🔗 godane curl -I -A 'Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Firefox/38.0 Iceweasel/38.3.0' 'http://www.cryengine.com/community/downloads.php?submit=Download&df_id='1'&view=load
23:48 🔗 godane ok
23:48 🔗 arkiver so no WARCs?
23:51 🔗 arkiver anyway, please continue the grab godane
23:51 🔗 dashcloud has quit IRC (Read error: Operation timed out)
23:51 🔗 arkiver will do my best to also grab these in WARCs
23:52 🔗 godane so i got the WARC of the pages
23:53 🔗 arkiver ok, I'll do them for the POST requests and actual files too then
23:53 🔗 godane 6000 pages but only 2636 exist
23:53 🔗 godane the 2636 that have files anyways
23:54 🔗 godane http://pastebin.com/DfXyp6nu
23:56 🔗 dashcloud has joined #archiveteam
23:57 🔗 arkiver thanks!

irclogger-viewer