#archiveteam-bs 2015-07-08,Wed

↑back Search

Time Nickname Message
00:01 🔗 RedType has joined #archiveteam-bs
00:06 🔗 Stiletto has joined #archiveteam-bs
00:08 🔗 fx_ has quit IRC (Remote host closed the connection)
00:13 🔗 mistym has quit IRC (Remote host closed the connection)
00:29 🔗 mistym has joined #archiveteam-bs
00:47 🔗 Ctrl-S um i might?
00:47 🔗 Ctrl-S if you dont have another source poke me and i'll dig around for it
00:49 🔗 BlueMaxim has joined #archiveteam-bs
01:15 🔗 godane can anyone download this: https://www.youtube.com/watch?v=i3rZmnJ66Po
01:16 🔗 godane its blocked in the USA
01:17 🔗 godane i have viewed it before it got blocked
01:24 🔗 primus104 has quit IRC (Leaving.)
01:42 🔗 szalwia godane: http://tv.236chan.net/i3rZmnJ66Po.mp4
01:42 🔗 szalwia blocked in my country as well
01:42 🔗 szalwia had to use tor + proxychains + youtube-dl :)
02:02 🔗 asparagir has joined #archiveteam-bs
02:36 🔗 asparagir has quit IRC (asparagir)
02:48 🔗 asparagir has joined #archiveteam-bs
03:05 🔗 godane thanks
03:44 🔗 vitzli has joined #archiveteam-bs
04:04 🔗 bsmith093 has quit IRC (Read error: Operation timed out)
04:08 🔗 bsmith093 has joined #archiveteam-bs
04:10 🔗 mistym has quit IRC (Remote host closed the connection)
04:16 🔗 vitzli has quit IRC (Quit: Leaving)
04:18 🔗 kyan webarchiveplayer is so cool :D (https://github.com/ikreymer/webarchiveplayer) just discovered it
04:28 🔗 godane i got to really get that working in a mesh network
04:29 🔗 godane cause if you can meshnet web archives then you could ship websites on dvds/usb drives
04:30 🔗 kyan I'm using it in a "portable library" i am trying to build
04:31 🔗 kyan my goal is to collect various great and popular cultural works, reference works, software, and other resources into a library that could be carried around in one's pocket
04:31 🔗 godane RACHEL project
04:31 🔗 kyan (that isn't reliant on the internet)
04:31 🔗 xmc libc is a good library
04:31 🔗 xmc ... i'll show myself out
04:32 🔗 mistym has joined #archiveteam-bs
04:37 🔗 aaaaaaaaa has quit IRC (Leaving)
04:57 🔗 godane szalwia: looks like this one got blocked also: https://www.youtube.com/watch?v=aHmzQCAoRDY
04:57 🔗 godane but its complete blocked i think
04:58 🔗 ersi has quit IRC (Ping timeout: 258 seconds)
05:00 🔗 Stiletto has quit IRC (hub.efnet.us irc.Prison.NET)
05:00 🔗 kyan has quit IRC (hub.efnet.us irc.Prison.NET)
05:00 🔗 Kenshin has quit IRC (hub.efnet.us irc.Prison.NET)
05:00 🔗 sunnymilk has quit IRC (hub.efnet.us irc.Prison.NET)
05:00 🔗 wyatt8740 has quit IRC (hub.efnet.us irc.Prison.NET)
05:04 🔗 McGEE has joined #archiveteam-bs
05:04 🔗 ersi has joined #archiveteam-bs
05:10 🔗 godane szalwia: https://www.youtube.com/watch?v=MQyn4AkkGHg
05:10 🔗 godane that one is just blocked by country
05:11 🔗 kyan_ has joined #archiveteam-bs
05:11 🔗 Stiletto has joined #archiveteam-bs
05:11 🔗 Kenshin has joined #archiveteam-bs
05:11 🔗 sunnymilk has joined #archiveteam-bs
05:12 🔗 sunnymilk has quit IRC (Ping timeout: 258 seconds)
05:12 🔗 Stilett0 has joined #archiveteam-bs
05:12 🔗 sunnymilk has joined #archiveteam-bs
05:13 🔗 Stiletto has quit IRC (Ping timeout: 258 seconds)
05:13 🔗 wyatt8740 has joined #archiveteam-bs
05:26 🔗 kyan_ is now known as kyan
05:28 🔗 wyatt8740 http://i.imgur.com/Z6AIgAv.jpg
05:28 🔗 wyatt8740 I'm astonished
05:28 🔗 wyatt8740 (this does work, by the way, if you share a common ground between the jacks)
05:41 🔗 asparagir has quit IRC (asparagir)
06:03 🔗 godane szalwia: https://www.youtube.com/watch?v=9cgDVFBCiuw
06:04 🔗 godane same copyright problem as last one
06:05 🔗 godane funny thing is fellowship of the ring i can still get
06:07 🔗 godane this one is blocked on copyright grounds: https://www.youtube.com/watch?v=D5CDhs6zmlo
06:23 🔗 kyan has quit IRC (Read error: Connection reset by peer)
06:31 🔗 kyan has joined #archiveteam-bs
06:44 🔗 xmc wyatt8740: makes sense, just don't bump the table
06:45 🔗 wyatt8740 xmc: yeah :) Just suprised the lego hands were a perfect fit
06:45 🔗 wyatt8740 never would have thought of it
06:46 🔗 xmc hm, indeed
06:52 🔗 mistym has quit IRC (Remote host closed the connection)
07:27 🔗 joepie91 zhongfu: TPB torrents are dead
07:27 🔗 joepie91 godane: I could see the first, not the second
07:28 🔗 joepie91 not the third either
07:33 🔗 schbirid has joined #archiveteam-bs
07:52 🔗 bzc6p_ is now known as bzc6p
07:52 🔗 mistym has joined #archiveteam-bs
07:53 🔗 primus104 has joined #archiveteam-bs
08:06 🔗 mistym has quit IRC (Ping timeout: 606 seconds)
08:17 🔗 primus104 has quit IRC (Leaving.)
08:45 🔗 primus104 has joined #archiveteam-bs
08:47 🔗 godane good news everyone
08:47 🔗 godane i found a way to do dailymail.co.uk audit
08:47 🔗 godane daily sitemap: http://www.dailymail.co.uk/sitemap-articles-day~2011-01-01.xml
09:15 🔗 godane uploaded: https://archive.org/details/www.dailymail.co.uk-articles-1996-20150708
09:18 🔗 godane uploaded: https://archive.org/details/www.dailymail.co.uk-articles-1997-20150708
09:22 🔗 bzc6p_ has joined #archiveteam-bs
09:25 🔗 bzc6p__ has joined #archiveteam-bs
09:26 🔗 bzc6p has quit IRC (Read error: Operation timed out)
09:31 🔗 godane uploaded: https://archive.org/details/www.dailymail.co.uk-articles-1998-20150708
09:31 🔗 godane uploaded: https://archive.org/details/www.dailymail.co.uk-articles-1999-20150708
09:32 🔗 bzc6p_ has quit IRC (Read error: Operation timed out)
09:36 🔗 Ctrl-S godane is that full articles?
09:36 🔗 godane thats everything in the xml
09:37 🔗 godane there was not a lot there
09:38 🔗 Ctrl-S have you tried sequential article numbers?
09:38 🔗 godane yes but it takes way too long
09:38 🔗 Ctrl-S like http://www.dailymail.co.uk/news/article-1343359/ is valid even though it's cropped from http://www.dailymail.co.uk/news/article-1343359/Joanna-Yeates-murder-Heart-broken-boyfriend-Greg-Reardon-speaks-anguish.html
09:38 🔗 godane you can use a --content-disposition to fix that
09:39 🔗 godane anyways here is a full article from 1999-12-30: http://www.dailymail.co.uk/columnists/article-301923/Thompson-trained-gold-bathroom.html
09:44 🔗 godane anyways the article ids are not in order
09:45 🔗 godane like this article id is before one above: http://www.dailymail.co.uk/columnists/article-301900
09:45 🔗 godane but the date is march 15 2000
10:13 🔗 godane so for the grab starting in 2001 i'm doing it as daily archive grabs
10:13 🔗 godane there are over 20k urls in 2001
10:13 🔗 godane and its going to have to be done at some point
10:15 🔗 godane good news is this allows me to put the xml i used for the script into the index.txt
10:24 🔗 bzc6p__ is now known as bzc6p
10:49 🔗 zhongfu rsync for hackingteam leak: zeta.znx.cc::ht
10:50 🔗 zhongfu Kazzy: I heard you wanted it
10:58 🔗 mistym has joined #archiveteam-bs
11:06 🔗 mistym has quit IRC (Ping timeout: 483 seconds)
11:08 🔗 godane has quit IRC (Ping timeout: 306 seconds)
11:43 🔗 primus104 has quit IRC (Leaving.)
11:58 🔗 BlueMaxim has quit IRC (Quit: Leaving)
12:19 🔗 Laverne thanks zhongfu
12:24 🔗 vitzli has joined #archiveteam-bs
12:34 🔗 zhongfu yw
12:35 🔗 vitzli zhongfu, what is your progress on downloading HT torrent? it's veeery slow for me
12:38 🔗 zhongfu vitzli: took me around two or three days, i'm seeding it at 100mbps right now
12:38 🔗 zhongfu max speed was around 6MB/s iirc? then there were bits where it dropped to 70KB/s
12:39 🔗 vitzli nope, it goes about to 600 kB/s for me, though I usually can do 3MB/s
12:40 🔗 zhongfu I could do 50MB/s usually too on very well seeded torrents or with multiple torrents
12:40 🔗 vitzli sometimes it does 1..1.3 MB/s
12:59 🔗 mistym has joined #archiveteam-bs
13:07 🔗 McGEE has quit IRC (Quit: Connection closed for inactivity)
13:10 🔗 mistym has quit IRC (Ping timeout: 606 seconds)
13:34 🔗 pikhq has quit IRC (Remote host closed the connection)
14:20 🔗 simpwork has joined #archiveteam-bs
14:25 🔗 simpwork has quit IRC (Remote host closed the connection)
14:25 🔗 simpwork has joined #archiveteam-bs
14:30 🔗 simpw0rk has joined #archiveteam-bs
14:32 🔗 mistym has joined #archiveteam-bs
14:33 🔗 simpwork has quit IRC (Ping timeout: 240 seconds)
14:39 🔗 primus104 has joined #archiveteam-bs
14:41 🔗 mistym has quit IRC (Remote host closed the connection)
14:51 🔗 mistym has joined #archiveteam-bs
14:52 🔗 mistym has quit IRC (Remote host closed the connection)
15:04 🔗 mistym has joined #archiveteam-bs
15:48 🔗 mistym has quit IRC (Remote host closed the connection)
16:01 🔗 mistym has joined #archiveteam-bs
16:14 🔗 garyrh has quit IRC (Quit: http://bnc4free.com/)
16:26 🔗 simpw0rk has quit IRC (Quit: AndroIRC - Android IRC Client ( http://www.androirc.com ))
16:29 🔗 garyrh has joined #archiveteam-bs
16:33 🔗 vitzli has quit IRC (Quit: Leaving)
16:44 🔗 SadDM has quit IRC (ircd.choopa.net ircd.shaw.ca)
16:44 🔗 dashcloud has quit IRC (ircd.choopa.net ircd.shaw.ca)
16:44 🔗 Jonimus has quit IRC (ircd.choopa.net ircd.shaw.ca)
16:44 🔗 balrog has quit IRC (ircd.choopa.net ircd.shaw.ca)
16:44 🔗 DFJustin has quit IRC (ircd.choopa.net ircd.shaw.ca)
16:44 🔗 wp494 has quit IRC (ircd.choopa.net ircd.shaw.ca)
16:44 🔗 xtr-201 has quit IRC (ircd.choopa.net ircd.shaw.ca)
16:44 🔗 phillipsj has quit IRC (ircd.choopa.net ircd.shaw.ca)
16:44 🔗 phiren has quit IRC (ircd.choopa.net ircd.shaw.ca)
16:44 🔗 chfoo- has quit IRC (ircd.choopa.net ircd.shaw.ca)
16:44 🔗 kniffy has quit IRC (ircd.choopa.net ircd.shaw.ca)
16:44 🔗 useretail has quit IRC (ircd.choopa.net ircd.shaw.ca)
16:44 🔗 Sanqui has quit IRC (ircd.choopa.net ircd.shaw.ca)
16:45 🔗 Stilett0 has quit IRC (Read error: Operation timed out)
16:45 🔗 Stiletto has joined #archiveteam-bs
16:49 🔗 Sanqui has joined #archiveteam-bs
16:49 🔗 useretail has joined #archiveteam-bs
16:49 🔗 chfoo- has joined #archiveteam-bs
16:49 🔗 phillipsj has joined #archiveteam-bs
16:49 🔗 xtr-201 has joined #archiveteam-bs
16:49 🔗 wp494 has joined #archiveteam-bs
16:49 🔗 phiren has joined #archiveteam-bs
16:49 🔗 DFJustin has joined #archiveteam-bs
16:49 🔗 balrog has joined #archiveteam-bs
16:49 🔗 kniffy has joined #archiveteam-bs
16:49 🔗 Jonimus has joined #archiveteam-bs
16:49 🔗 dashcloud has joined #archiveteam-bs
16:49 🔗 SadDM has joined #archiveteam-bs
17:01 🔗 SimpBrain has joined #archiveteam-bs
17:02 🔗 Stiletto has quit IRC (Ping timeout: 240 seconds)
17:07 🔗 zenguy_pc has quit IRC (Read error: Connection reset by peer)
17:08 🔗 zenguy_pc has joined #archiveteam-bs
17:11 🔗 schbirid http://firstmonday.org/ojs/index.php/fm/article/view/5619/4653
17:11 🔗 schbirid The Twitter Archive at the Library of Congress: Challenges for information practice and information policy
17:11 🔗 schbirid Michael Zimmer
17:45 🔗 vitzli has joined #archiveteam-bs
17:47 🔗 SketchCow The Twitter Archive at the Library of Congress: We will never ever ever ever ever ever make it public
17:48 🔗 xmc ^
17:48 🔗 xmc neverrrr
18:32 🔗 godane has joined #archiveteam-bs
18:37 🔗 phillipsj godane look like i3rZmnJ66Po is blocked in Canada too.
18:40 🔗 phillipsj "More than just the 140-charater plain text that a user types into the Twitter interface, each tweet contains 150 pieces of metadata, such as a unique numerical ID, a timestamp, a location stamp, IDs for any replies, favorites and retweets that the tweet gets, the language, the date the account was created, the URL of the author if a Web site is referenced, the number of followers, and numerous other technical specifications
18:40 🔗 phillipsj (Dwoskin, 2014)."
18:41 🔗 phillipsj I ma kinda shocked that the metadata exceeds the actual text by that much.
18:45 🔗 phillipsj Wait, why is indexing the information hard if all the meta-data fields are indexes? No DB cluster?
18:46 🔗 godane my wiif was acting up
18:54 🔗 vitzli has quit IRC (Quit: Leaving)
18:56 🔗 aaaaaaaaa has joined #archiveteam-bs
19:10 🔗 mistym has quit IRC (Remote host closed the connection)
19:24 🔗 mistym has joined #archiveteam-bs
19:28 🔗 Jonimus has quit IRC (Ping timeout: 370 seconds)
19:36 🔗 schbirid has quit IRC (Leaving)
19:39 🔗 schbirid has joined #archiveteam-bs
19:57 🔗 Stiletto has joined #archiveteam-bs
19:59 🔗 godane has quit IRC (Quit: Leaving.)
20:01 🔗 mistym has quit IRC (Remote host closed the connection)
20:15 🔗 mistym has joined #archiveteam-bs
20:33 🔗 SimpBrain has quit IRC (Quit: Leaving)
21:04 🔗 xmc has quit IRC (Ping timeout: 483 seconds)
21:08 🔗 xmc has joined #archiveteam-bs
21:30 🔗 ohhdemgir has quit IRC (Quit: Leaving)
21:48 🔗 bzc6p_ has joined #archiveteam-bs
21:51 🔗 bzc6p has quit IRC (Read error: Operation timed out)
21:53 🔗 Nertsy has quit IRC (Quit: Nertsy)
21:56 🔗 Nertsy has joined #archiveteam-bs
22:13 🔗 bzc6p__ has joined #archiveteam-bs
22:15 🔗 bzc6p__ has quit IRC (Client Quit)
22:18 🔗 bzc6p_ has quit IRC (Ping timeout: 600 seconds)
23:17 🔗 godane has joined #archiveteam-bs
23:28 🔗 RedType has quit IRC (Quit: patchin')
23:31 🔗 RedType has joined #archiveteam-bs
23:34 🔗 BlueMaxim has joined #archiveteam-bs
23:40 🔗 atrocity has joined #archiveteam-bs
23:46 🔗 phiren has quit IRC (Read error: Connection reset by peer)
23:47 🔗 phiren_ has joined #archiveteam-bs
23:47 🔗 phiren_ is now known as phiren
23:47 🔗 atrocity has quit IRC ()
23:59 🔗 Stiletto has quit IRC (Ping timeout: 265 seconds)

irclogger-viewer