#archiveteam 2014-12-20,Sat

↑back Search

Time Nickname Message
00:00 πŸ”— dashcloud has quit IRC (Ping timeout: 265 seconds)
00:06 πŸ”— Nertsy has quit IRC (Read error: Connection reset by peer)
00:06 πŸ”— dashcloud has joined #archiveteam
00:11 πŸ”— Nertsy has joined #archiveteam
00:28 πŸ”— schbirid has quit IRC (Leaving)
01:11 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
01:12 πŸ”— Start has joined #archiveteam
01:15 πŸ”— dashcloud has joined #archiveteam
01:39 πŸ”— primus104 has quit IRC (Leaving.)
01:45 πŸ”— BiggieJo1 has joined #archiveteam
01:48 πŸ”— BiggieJon has quit IRC (Read error: Operation timed out)
01:49 πŸ”— Start has quit IRC (Ping timeout: 492 seconds)
02:11 πŸ”— ruukasu has quit IRC (Quit: WeeChat 1.0.1)
02:11 πŸ”— ruukasu has joined #archiveteam
02:42 πŸ”— signius has quit IRC (Ping timeout: 480 seconds)
02:51 πŸ”— signius has joined #archiveteam
02:56 πŸ”— brayden has quit IRC (Ping timeout: 606 seconds)
02:59 πŸ”— brayden has joined #archiveteam
03:03 πŸ”— mistym has quit IRC (Remote host closed the connection)
04:17 πŸ”— Nertsy has quit IRC (Read error: Connection reset by peer)
04:20 πŸ”— Nertsy has joined #archiveteam
05:01 πŸ”— aaaaaaaaa has quit IRC (Leaving)
05:06 πŸ”— Swizzle has quit IRC (Quit: HydraIRC -> http://www.hydrairc.com <- Would you like to know more?)
05:52 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
05:56 πŸ”— dashcloud has joined #archiveteam
06:00 πŸ”— aschmitz has joined #archiveteam
06:31 πŸ”— Nertsy has quit IRC (Remote host closed the connection)
06:31 πŸ”— Nertsy has joined #archiveteam
06:52 πŸ”— mistym has joined #archiveteam
07:10 πŸ”— rejon has quit IRC (Read error: Operation timed out)
07:28 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
07:32 πŸ”— dashcloud has joined #archiveteam
07:37 πŸ”— primus104 has joined #archiveteam
08:17 πŸ”— primus104 has quit IRC (Leaving.)
08:32 πŸ”— Ymgve has joined #archiveteam
08:51 πŸ”— indigo_ has quit IRC (Remote host closed the connection)
08:55 πŸ”— brayden has quit IRC (Quit: Leaving)
09:14 πŸ”— brayden has joined #archiveteam
09:46 πŸ”— schbirid has joined #archiveteam
10:24 πŸ”— mistym has quit IRC (Remote host closed the connection)
10:24 πŸ”— Ymgve__ has joined #archiveteam
10:32 πŸ”— Ymgve has quit IRC (Ping timeout: 512 seconds)
10:35 πŸ”— primus104 has joined #archiveteam
10:47 πŸ”— arkiver So I have my two week holiday!
10:47 πŸ”— arkiver Will be working on the upcoming projects for the warrior
10:48 πŸ”— arkiver SketchCow: can the halo project start again?
10:49 πŸ”— primus104 has quit IRC (Leaving.)
10:57 πŸ”— APerti has quit IRC (Ping timeout: 370 seconds)
11:44 πŸ”— primus104 has joined #archiveteam
12:10 πŸ”— BlueMaxim has quit IRC (Quit: Leaving)
13:54 πŸ”— Nertsy` has joined #archiveteam
13:54 πŸ”— Nertsy has quit IRC (Read error: Connection reset by peer)
14:02 πŸ”— Nemo_bis has quit IRC (Remote host closed the connection)
14:08 πŸ”— hive-mind has quit IRC (Ping timeout: 272 seconds)
14:15 πŸ”— hive-mind has joined #archiveteam
14:19 πŸ”— Daloader_ has joined #archiveteam
14:21 πŸ”— T31M has joined #archiveteam
14:24 πŸ”— Nertsy` has quit IRC (Remote host closed the connection)
14:25 πŸ”— Nertsy has joined #archiveteam
14:25 πŸ”— chazchaz has quit IRC (Read error: Connection reset by peer)
14:25 πŸ”— chazchaz has joined #archiveteam
14:25 πŸ”— Laverne has quit IRC (Ping timeout: 369 seconds)
14:26 πŸ”— Laverne has joined #archiveteam
14:26 πŸ”— T31m_ has quit IRC (Read error: Operation timed out)
14:31 πŸ”— chazchaz has quit IRC (Remote host closed the connection)
14:32 πŸ”— T31M has quit IRC (Read error: Operation timed out)
14:32 πŸ”— Daloader_ has quit IRC (Read error: Operation timed out)
14:36 πŸ”— chazchaz has joined #archiveteam
14:40 πŸ”— smither has joined #archiveteam
14:40 πŸ”— smither hi there
14:40 πŸ”— smither I’ve been trying to do a grab of cbc.ca/Q
14:40 πŸ”— smither but something prevents wget from retriving more than one page
14:41 πŸ”— smither I’m using user-agent=β€œnot Google” but that ain’t tricking the machine
14:41 πŸ”— Daloader_ has joined #archiveteam
14:42 πŸ”— smither (for background, CBC is deleting some of its archives because one of their anchor turned out to be a rapist. But it’s problematic because they’re erasing a lot of info for journalists)
14:42 πŸ”— smither http://www.huffingtonpost.ca/2014/12/17/ghomeshi-q-archives_n_6340882.html
14:46 πŸ”— Jonimus has quit IRC (Excess Flood)
14:47 πŸ”— Jonimus has joined #archiveteam
14:48 πŸ”— BiggieJo1 is now known as BiggieJ
14:49 πŸ”— Jonimus has quit IRC (Excess Flood)
14:50 πŸ”— Jonimus has joined #archiveteam
14:51 πŸ”— eprillios has quit IRC (Ping timeout: 369 seconds)
14:51 πŸ”— eprillios has joined #archiveteam
14:52 πŸ”— godane smither: looks like the podcast rss archived very often at least
14:52 πŸ”— godane https://web.archive.org/web/*/http://www.cbc.ca/podcasting/includes/qpodcast.xml
14:52 πŸ”— smither so it should be fine ?
14:53 πŸ”— godane no
14:53 πŸ”— godane i'm grabbing the mp3s right now
14:54 πŸ”— godane also from what i can tell 2011 mp3 urls don't work anymore
14:58 πŸ”— smither any idea why my wget didn’t work ?
14:58 πŸ”— smither I used wget -mc --no-parent --no-clobber --adjust-extension --user-agent="not Google" --convert-links --page-requisites cbc.ca/q
15:02 πŸ”— T31m_ has joined #archiveteam
15:04 πŸ”— ohhdemgir has quit IRC (Read error: Operation timed out)
15:04 πŸ”— brayden has quit IRC (Read error: Operation timed out)
15:05 πŸ”— Nertsy has quit IRC (Read error: Connection reset by peer)
15:05 πŸ”— godane i tryed my own way and it will not mirror either
15:05 πŸ”— godane wget --mirror cbc.ca/q -U "firefox" -e robots=off --warc-file=cbc-q --warc-cdx -E -o wget.log
15:06 πŸ”— ohhdemgir has joined #archiveteam
15:07 πŸ”— Nertsy has joined #archiveteam
15:11 πŸ”— zenguy_pc has quit IRC (Excess Flood)
15:12 πŸ”— smither so it’s not the robot?
15:12 πŸ”— zenguy_pc has joined #archiveteam
15:14 πŸ”— Daloader_ has quit IRC (Read error: Operation timed out)
15:15 πŸ”— brayden has joined #archiveteam
15:23 πŸ”— Nemo_bis has joined #archiveteam
15:27 πŸ”— T31M has joined #archiveteam
15:29 πŸ”— primus104 has quit IRC (Leaving.)
15:31 πŸ”— Nertsy has quit IRC (Remote host closed the connection)
15:31 πŸ”— Nertsy has joined #archiveteam
15:34 πŸ”— T31m_ has quit IRC (Read error: Operation timed out)
15:39 πŸ”— Fusl am i currently the only one mirroring wallbase?
15:39 πŸ”— smither has quit IRC (smither)
15:40 πŸ”— Nertsy has quit IRC (Remote host closed the connection)
15:41 πŸ”— Fusl i expect the server to go down beginning 2015, can someone please be so kind and mirror it and put the mirror in the wallbase mirror list?
15:41 πŸ”— Nertsy has joined #archiveteam
15:48 πŸ”— Nertsy` has joined #archiveteam
15:48 πŸ”— Nertsy has quit IRC (Read error: Connection reset by peer)
15:56 πŸ”— goekesmi has quit IRC (Ping timeout: 369 seconds)
16:04 πŸ”— goekesmi has joined #archiveteam
16:05 πŸ”— Kenshin Fusl: has the site been taken down? officially propose to push to IA?
16:05 πŸ”— aaaaaaaaa has joined #archiveteam
16:07 πŸ”— Daloader_ has joined #archiveteam
16:07 πŸ”— arkiver Fusl: yeah, sure
16:08 πŸ”— Fusl the site has not been taken down
16:08 πŸ”— Fusl but the host node where i'm hosting that entire thing on (it's about 1.3TB huge) will be cancelled
16:08 πŸ”— Fusl because of the lack of money
16:08 πŸ”— arkiver Fusl: you're talking about this right? http://archive_wallbase.cc.mirror.fuslvz.ws/
16:08 πŸ”— Fusl yes, but there is rsync on this mirror
16:09 πŸ”— Fusl rsync://mirror.fuslvz.ws/archive_wallbase.cc/
16:10 πŸ”— goekesmi has quit IRC (Read error: Connection reset by peer)
16:10 πŸ”— indigo_ has joined #archiveteam
16:13 πŸ”— T31M has quit IRC (Read error: Operation timed out)
16:14 πŸ”— goekesmi has joined #archiveteam
16:27 πŸ”— Kenshin iomart?
16:28 πŸ”— Kenshin arkiver: do you have anywhere i can dump the halo stuff i was holding while FOS was down?
16:28 πŸ”— T31M has joined #archiveteam
16:28 πŸ”— Kenshin i have qwiki with me too
16:28 πŸ”— arkiver halo: https://archive.org/details/archiveteam_halo
16:28 πŸ”— Kenshin if u can get those off me i'll have space for Fusl
16:28 πŸ”— arkiver But SketchCow needs to give you acces to upload to there
16:28 πŸ”— Kenshin arkiver: rsync target
16:28 πŸ”— Kenshin i'm holding rsync data only
16:29 πŸ”— arkiver ah ok
16:29 πŸ”— arkiver we might be able to use FOS's rsync, but I'm not sure if we can already start uploading to that one
16:29 πŸ”— Kenshin will need to ask SketchCow i guess
16:29 πŸ”— Kenshin what about qwiki?
16:29 πŸ”— arkiver If SketchCow thinks FOS is fine again, we can move your stuff to FOS
16:29 πŸ”— Kenshin it's not a lot, about 700MB
16:30 πŸ”— arkiver qwiki the same
16:30 πŸ”— arkiver We'll have to wait for SketchCow, what he says
16:30 πŸ”— Kenshin k. if he gives the go ahead then i'll have 1.5T for Fusl's stuff
16:30 πŸ”— arkiver yes
16:30 πŸ”— Kenshin alternatively, fusl push straight to IA if it's get approved
16:31 πŸ”— arkiver how much of halo do you have?
16:31 πŸ”— Kenshin since the website is pretty much dead
16:31 πŸ”— Fusl Kenshin: if you explain how, i can do that :)
16:31 πŸ”— Kenshin arkiver: 1.7T
16:31 πŸ”— arkiver Kenshin: ok, that'd be fine
16:31 πŸ”— Kenshin Fusl: no idea. i've always left uploading to yip
16:32 πŸ”— Fusl hm
16:32 πŸ”— Kenshin arkiver: what would be the best way to push 1.3t of website data to IA?
16:34 πŸ”— arkiver Kenshin: megawarc it and upload it to the collection, with https://pypi.python.org/pypi/internetarchive or https://github.com/kngenie/ias3upload
16:34 πŸ”— arkiver Fusl: I'd suggest using one of the above tools ^ to upload all the stuff to IA
16:38 πŸ”— primus104 has joined #archiveteam
16:38 πŸ”— T31m_ has joined #archiveteam
16:39 πŸ”— Fusl the lastter one, what .csv files do i need exactly?
16:39 πŸ”— Fusl or don't i need them?
16:41 πŸ”— Daloader_ has quit IRC (Read error: Operation timed out)
16:42 πŸ”— Daloader_ has joined #archiveteam
16:44 πŸ”— w0rp has quit IRC (Ping timeout: 1221 seconds)
16:45 πŸ”— w0rp has joined #archiveteam
16:45 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
16:45 πŸ”— ruukasu has quit IRC (Quit: WeeChat 1.0.1)
16:45 πŸ”— ruukasu has joined #archiveteam
16:46 πŸ”— T31M has quit IRC (Read error: Operation timed out)
16:48 πŸ”— arkiver Fusl: in the csv file you'll write the information for your item, like the identifier, title, description, tags, etc. https://github.com/kngenie/ias3upload/blob/master/metadata.csv
16:48 πŸ”— dashcloud has joined #archiveteam
16:50 πŸ”— T31m_ has quit IRC (Read error: Operation timed out)
16:50 πŸ”— Fusl unfortunately too much work at the moment
16:50 πŸ”— Fusl i have to move other stuff :/
16:54 πŸ”— primus104 has quit IRC (Leaving.)
16:57 πŸ”— arkiver Fusl: I'm mirroring parts of it now
16:57 πŸ”— arkiver will get folders with images up in IA
16:57 πŸ”— Fusl arkiver: if you want, i can throw your ssh key in the server so you can put it up on IA directly from there...?
16:58 πŸ”— arkiver nah, it's going fine this way
16:58 πŸ”— Fusl k
16:59 πŸ”— arkiver I'll put them in seperate items for each folder. So the images from http://archive_wallbase.cc.mirror.fuslvz.ws/siterip/images/0000000/ will have the collection wallbase.cc-rip-0000000
17:11 πŸ”— arkiver Fusl: test item: https://archive.org/details/test_wallbase.cc-rip-0000000
17:11 πŸ”— arkiver looks good? (still uploading, not derived yet)
17:12 πŸ”— Fusl neat
17:14 πŸ”— arkiver Fusl: what's the full size of everything in this directory? http://archive_wallbase.cc.mirror.fuslvz.ws/siterip/images/
17:14 πŸ”— arkiver if not too big I'll put everything in one item
17:15 πŸ”— Fusl calculating ...
17:15 πŸ”— Fusl 1.2T
17:50 πŸ”— rejon has joined #archiveteam
18:06 πŸ”— mistym has joined #archiveteam
18:13 πŸ”— bsmith093 has quit IRC (Read error: Operation timed out)
18:15 πŸ”— db48x has quit IRC (Ping timeout: 258 seconds)
18:27 πŸ”— bsmith093 has joined #archiveteam
18:43 πŸ”— APerti has joined #archiveteam
19:49 πŸ”— okeuday has joined #archiveteam
20:03 πŸ”— SketchCow What
20:03 πŸ”— SketchCow Kenshin: FOS, go ahead.
20:03 πŸ”— SketchCow Sorry for lack of response, maniacs
20:04 πŸ”— SketchCow /join #aside
20:05 πŸ”— rejon has quit IRC (Read error: Operation timed out)
20:17 πŸ”— SketchCow How did I miss #roon
20:19 πŸ”— thechip_ has quit IRC (Read error: Operation timed out)
20:22 πŸ”— SketchCow Anyway, I guess we're doing roon. I'm doing the groupings now.
20:24 πŸ”— primus104 has joined #archiveteam
20:31 πŸ”— fluff is now known as fluff_
20:34 πŸ”— godane I'm in #roon on irc.efnet.net and is no one there
20:35 πŸ”— chfoo #rooined
20:41 πŸ”— Kenshin SketchCow: do you have the rsync urls for qwiki and halo?
20:47 πŸ”— SketchCow chfoo does
20:48 πŸ”— ohhdemgir has quit IRC (Read error: Operation timed out)
20:49 πŸ”— ohhdemgir has joined #archiveteam
20:54 πŸ”— chfoo Kenshin: for fos i assume: rsync://fos.textfiles.com/chfoo/warrior/qwiki/:downloader/ & rsync://fos.textfiles.com/chfoo/warrior/halo/:downloader/
20:56 πŸ”— Kenshin cool thanks, much appreciated
20:56 πŸ”— chfoo replace :downloader with a nickname
20:56 πŸ”— mistym has quit IRC (Remote host closed the connection)
21:11 πŸ”— BlueMaxim has joined #archiveteam
21:29 πŸ”— wp494 has quit IRC ()
21:31 πŸ”— wp494 has joined #archiveteam
21:37 πŸ”— bzc6p has joined #archiveteam
21:44 πŸ”— bzc6p chfoo, arkiver: In a template I made on the wiki, I included "some additional information" I miss from the script documentations on GitHub. Could you please include those pieces of information when you next time create projects on GitHub?
21:45 πŸ”— bzc6p (I mean the missing ones, about the concurrency, stopping the script, and what to do when outdated – even rephrased if necessary.) I think these are important for newcomers, but if they were on github, I could remove them from the wiki. Thank you.
21:46 πŸ”— chfoo the template we're using is located at https://github.com/ArchiveTeam/standalone-readme-template
21:48 πŸ”— bzc6p Could you please expand that then? I don't want to create a github account just for that.
21:49 πŸ”— bzc6p (Of course if you too find it a good idea.)
21:50 πŸ”— ersi bzc6p: Where's your template on the wiki?
21:51 πŸ”— bzc6p ersi: http://archiveteam.org/index.php?title=Template:Howcanihelp
21:51 πŸ”— bzc6p Sorry, I indeed forgot to name it...
21:51 πŸ”— dashcloud has quit IRC (Read error: Operation timed out)
21:52 πŸ”— bzc6p the additional info is put into a collapsible box
21:54 πŸ”— ersi I'll take a look at merging it. I do, have a GitHub account. :)
21:56 πŸ”— dashcloud has joined #archiveteam
22:11 πŸ”— schbirid has quit IRC (Leaving)
22:50 πŸ”— wp494 has quit IRC ()
22:52 πŸ”— wp494 has joined #archiveteam
23:04 πŸ”— Start has joined #archiveteam
23:16 πŸ”— bzc6p has left
23:18 πŸ”— Start has quit IRC (Ping timeout: 606 seconds)
23:23 πŸ”— primus has joined #archiveteam

irclogger-viewer