#archiveteam 2017-04-14,Fri

↑back Search

Time Nickname Message
00:11 πŸ”— hive-mind has quit IRC (Ping timeout: 260 seconds)
00:13 πŸ”— hive-mind has joined #archiveteam
00:28 πŸ”— odemg has quit IRC (Remote host closed the connection)
00:29 πŸ”— odemg has joined #archiveteam
00:53 πŸ”— RichardG_ is now known as RichardG
01:18 πŸ”— Stilett0 has quit IRC (Read error: Operation timed out)
01:50 πŸ”— Stilett0 has joined #archiveteam
01:59 πŸ”— Stilett0 has quit IRC (Ping timeout: 370 seconds)
02:56 πŸ”— icedice has quit IRC (Quit: Leaving)
02:57 πŸ”— pizzaiolo has left
03:31 πŸ”— Guest7383 has joined #archiveteam
04:22 πŸ”— dashcloud has quit IRC (Ping timeout: 260 seconds)
04:23 πŸ”— Sk1d has joined #archiveteam
04:27 πŸ”— dashcloud has joined #archiveteam
04:42 πŸ”— arbin has quit IRC (Read error: Operation timed out)
04:53 πŸ”— Stilett0 has joined #archiveteam
04:54 πŸ”— Stilett0 is now known as Stiletto
05:04 πŸ”— ndiddy has quit IRC ()
05:44 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
05:47 πŸ”— dashcloud has joined #archiveteam
06:27 πŸ”— tuxy has joined #archiveteam
06:29 πŸ”— tuxy Hello guys!
06:30 πŸ”— tuxy How can I download the Yoyo sandbox completely
06:34 πŸ”— tuxy How is the sandbox even archived?
06:34 πŸ”— db48x everything the Archive Team saved for the the Gamemaker Sandbox is available here: https://archive.org/details/archiveteam_gamemaker
06:35 πŸ”— db48x these are all WARC files, with CDX files which are indexes
06:36 πŸ”— tuxy There seem to be multiple files.
06:36 πŸ”— tuxy Ok
06:36 πŸ”— tuxy I'm new to archive formats
06:37 πŸ”— tuxy How do I download all of them and view them. IS there like a guide. Thanks
06:38 πŸ”— db48x well, the collection has 62 items
06:38 πŸ”— db48x each item has some files associated with it
06:38 πŸ”— db48x actually, on closer inspection only 15 of those items are WARCs; the other 47 appear to be single executable files extracted from those WARCs
06:39 πŸ”— db48x a WARC file is a record of every HTTP request, and the response that was recieved, that the archiving team made
06:40 πŸ”— db48x there are several tools you can use to view them
06:41 πŸ”— db48x probably the simplest would be a proxy that serves up content from them to your web browser
06:42 πŸ”— db48x https://github.com/alard/warc-proxy, for example, or https://github.com/internetarchive/warcprox
06:43 πŸ”— tuxy The second one says it's a writing MITM proxy
06:46 πŸ”— tuxy So can I still help other projects by running Warrior?
06:47 πŸ”— tuxy Does that still work?
06:47 πŸ”— tuxy http://archiveteam.org/index.php?title=ArchiveTeam_Warrior
06:48 πŸ”— db48x yep
06:48 πŸ”— tuxy Does it work automatically?
06:49 πŸ”— db48x yea, the warrior is automatic
06:49 πŸ”— db48x you can see all the projects that might have work to do at http://tracker.archiveteam.org/
06:51 πŸ”— tuxy BTW how does viewing multiple warcs work with the Sandbox. What if I wanted to search a game. Is that what CDX files are for?
06:53 πŸ”— db48x I believe that alard's warc-proxy simply shows you everything in every warc in the directory that you ran it from
06:55 πŸ”— db48x the CDX is a straight-forward text file that tells you where each HTTP request/response is within a WARC:
06:55 πŸ”— db48x com,yoyogames,sandbox)/extras/image/name/san1/0/116000/large/loader.jpg?1215841807 20141015224914 http://sandbox.yoyogames.com/extras/image/name/san1/0/116000/large/loader.jpg?1215841807 image/jpeg 200 JVQG5RRC3QJXS2RPAJ3D6EY3YUK5UB24 - - 3704 4388284889 archiveteam_gamemaker_20141118085013/gamemaker_20141118085013.megawarc.warc.gz
06:55 πŸ”— db48x com,yoyogames,sandbox)/extras/image/name/san1/0/116000/thumb/loader.jpg?1215841807 20141015224919 http://sandbox.yoyogames.com/extras/image/name/san1/0/116000/thumb/loader.jpg?1215841807 image/jpeg 200 WK44LBO4USBHOAHXA4J4YXNGLLQNAXCT - - 1043 4388388280 archiveteam_gamemaker_20141118085013/gamemaker_20141118085013.megawarc.warc.gz
06:55 πŸ”— db48x com,yoyogames,sandbox)/extras/image/name/san1/0/119000/large/opo.jpg?1216567269 20141015233541 http://sandbox.yoyogames.com/extras/image/name/san1/0/119000/large/opo.jpg?1216567269 image/jpeg 200 S5HDXK7SDLPHVVSZHZCKV7SL4B37TZ4A - - 3280 41313654721 archiveteam_gamemaker_20141118085013/gamemaker_20141118085013.megawarc.warc.gz
06:56 πŸ”— db48x you could save yourself some time by downloading all of the CDX files first, then grepping them for the urls you want
06:56 πŸ”— db48x then downloading the correct WARC
06:56 πŸ”— schbirid has joined #archiveteam
07:08 πŸ”— odemg has quit IRC (Remote host closed the connection)
07:10 πŸ”— odemg has joined #archiveteam
07:23 πŸ”— odemg has quit IRC (Quit: fucked right off!!)
07:49 πŸ”— tuxy Guys I'm getting ERR_EMPTY_RESPONSE
07:50 πŸ”— tuxy When I try to access warrior from my browser
07:51 πŸ”— tuxy I'll try using Virtualbox instead of VMware
07:55 πŸ”— topdownji has quit IRC (Remote host closed the connection)
07:56 πŸ”— topdownji has joined #archiveteam
07:56 πŸ”— tuxy Guys I live in India. Is it okay for you if I run warrior
07:57 πŸ”— hive-mind has quit IRC (Ping timeout: 260 seconds)
07:57 πŸ”— hive-mind has joined #archiveteam
07:57 πŸ”— tuxy A few sites are banned like adfly and piratebay
08:07 πŸ”— anhedonis has quit IRC (Read error: Operation timed out)
08:07 πŸ”— anhedonis has joined #archiveteam
08:29 πŸ”— Simpbrain has joined #archiveteam
08:29 πŸ”— zino has joined #archiveteam
08:31 πŸ”— tuxy has quit IRC (Ping timeout: 268 seconds)
09:55 πŸ”— _Zialus_ has joined #archiveteam
09:55 πŸ”— Zialus has quit IRC (Read error: Operation timed out)
09:59 πŸ”— numba has joined #archiveteam
10:00 πŸ”— numba Do you guys check from multiple warriors for blocks and scraping errors to make sure data is clean
10:01 πŸ”— numba My warrior is not working
10:01 πŸ”— numba This page isn’t working localhost didn’t send any data. ERR_EMPTY_RESPONSE
10:01 πŸ”— numba http://localhost:8001/
10:02 πŸ”— numba I really want to run a warrior please help me with the error
10:26 πŸ”— pizzaiolo has joined #archiveteam
10:30 πŸ”— zenguy has quit IRC (Read error: Operation timed out)
10:31 πŸ”— pizzaiolo Fossamail is shutting down in May: https://www.reddit.com/r/linux/comments/65awd3/fossamail_to_shut_down_in_may_2017/
10:34 πŸ”— zenguy has joined #archiveteam
10:36 πŸ”— pizzaiolo I did an archivebot job for it, not sure if everything came through though
10:52 πŸ”— BlueMaxim has quit IRC (Read error: Operation timed out)
11:28 πŸ”— khaoohs has quit IRC (Ping timeout: 1208 seconds)
11:33 πŸ”— numba has quit IRC (Ping timeout: 268 seconds)
11:35 πŸ”— RichardG_ has joined #archiveteam
11:35 πŸ”— RichardG has quit IRC (Read error: Connection reset by peer)
11:48 πŸ”— anhedonis has quit IRC (Remote host closed the connection)
11:59 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
12:02 πŸ”— dashcloud has joined #archiveteam
12:20 πŸ”— odemg has joined #archiveteam
13:11 πŸ”— kristian_ has joined #archiveteam
13:30 πŸ”— anhedonis has joined #archiveteam
13:44 πŸ”— Guest7383 has quit IRC (Quit: Bye)
14:21 πŸ”— ndiddy has joined #archiveteam
14:36 πŸ”— icedice has joined #archiveteam
15:11 πŸ”— kristian_ has quit IRC (Quit: Leaving)
15:42 πŸ”— LastNinja has quit IRC (byeeee)
15:58 πŸ”— RichardG_ is now known as RichardG
16:02 πŸ”— Aranje has joined #archiveteam
16:37 πŸ”— _Zialus_ has quit IRC (i'm out!)
17:28 πŸ”— nsfmc has quit IRC (Quit: Connection closed for inactivity)
17:42 πŸ”— MMovie has quit IRC (Read error: Operation timed out)
17:46 πŸ”— Simpbrain has quit IRC (Read error: Connection reset by peer)
18:04 πŸ”— kris33 has quit IRC (Remote host closed the connection)
18:09 πŸ”— MMovie has joined #archiveteam
18:20 πŸ”— zino has quit IRC (Remote host closed the connection)
18:21 πŸ”— nsfmc has joined #archiveteam
18:45 πŸ”— kevinr has quit IRC ()
18:47 πŸ”— kevinr has joined #archiveteam
19:43 πŸ”— DFJustin has quit IRC (Remote host closed the connection)
19:52 πŸ”— DFJustin has joined #archiveteam
20:25 πŸ”— anhedonis has quit IRC (Read error: Operation timed out)
21:08 πŸ”— LastNinja has joined #archiveteam
21:16 πŸ”— icedice has quit IRC (Quit: Leaving)
21:29 πŸ”— Madchen has joined #archiveteam
21:55 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
21:59 πŸ”— dashcloud has joined #archiveteam
22:36 πŸ”— schbirid has quit IRC (Quit: Leaving)
22:46 πŸ”— kristian_ has joined #archiveteam
22:55 πŸ”— Odd0002_ has joined #archiveteam
22:55 πŸ”— Odd0002 has left
22:56 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
23:24 πŸ”— BlueMaxim has joined #archiveteam
23:35 πŸ”— bsmith093 has joined #archiveteam
23:48 πŸ”— dashcloud has joined #archiveteam
23:58 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)

irclogger-viewer