#archiveteam 2014-08-14,Thu

↑back Search

Time Nickname Message
00:48 🔗 SketchCow What.
03:35 🔗 VonGuard heya SketchCow i wanna chat about habitat when you get time
03:35 🔗 VonGuard we're probably gonna want to bring you in on it
05:58 🔗 SketchCow VonGuard: You have the most adorable view of how IRC works
06:10 🔗 yipdw SketchCow: when you have time, I'd love to hear your SAA14 talk
06:10 🔗 yipdw the twitters have made me interested
06:13 🔗 VonGuard lol
06:13 🔗 VonGuard SketchCow, i just figure irc is low priority
06:13 🔗 VonGuard which is where i wanna be with this request
06:14 🔗 VonGuard so, do you know about habitat and our work on it? ]
07:04 🔗 SketchCow No.
07:43 🔗 midas 43 minutes later, still no idea what habitat is or the work that has been done on it tho :)
07:46 🔗 Nemo_bis more like 4 hours
07:54 🔗 midas I once had a habitat
07:54 🔗 midas but then the girbils died
11:08 🔗 Arkiver2 Is someone here able to scrape google, yahoo and/or bing results?
11:08 🔗 Arkiver2 I updates these wiki pages: http://archiveteam.org/index.php?title=Swipnet
11:09 🔗 Arkiver2 http://archiveteam.org/index.php?title=Verizon_Personal_Web_Space
11:09 🔗 Arkiver2 Created scripts and got list of sites from wayback machine, scripts will be checked by chfoo when there is more time
11:09 🔗 Arkiver2 if someone is able to scrape results from google, yahoo and/or bing, please let me know
11:15 🔗 Nemo_bis Arkiver2: the only experience i had is with https://gist.github.com/nemobis/7718061
11:17 🔗 Arkiver2 Nemo_bis: would you be able to do that same thing on google with the keywords with the keyword: site:home.swipnet.se ?
11:18 🔗 Arkiver2 is it possible then to get all the 100.000's of results?
11:21 🔗 Nemo_bis no
11:21 🔗 Nemo_bis dunno
11:36 🔗 tephra Arkiver2: https://archive.org/details/home.swipnet.se-w-20001-to-26000-20140726, https://archive.org/editxml/home.swipnet.se-w-26000-to-30000-20140726-00000.warc, https://archive.org/details/home.swipnet.se-w-30001-to-39999-20140726
11:36 🔗 Arkiver2 tephra: thank you!
11:37 🔗 Arkiver2 I'll exclude those items from the list later
11:37 🔗 Arkiver2 but
11:37 🔗 tephra Arkiver2: first uploads of the ~w-[0-9] users, more to come soon. Will upload more when i get back later and start with google also
11:37 🔗 Arkiver2 turns out it is not 5 digits
11:37 🔗 Arkiver2 there are 6 digits
11:37 🔗 Arkiver2 so it's likely going to be a warrior project
11:37 🔗 Arkiver2 since there are 6 digits
11:38 🔗 Arkiver2 there is ~w- as well as -w-
11:40 🔗 tephra hmm crap! :P Well I have 26000-99999
11:40 🔗 Arkiver2 you got all 26000-99999 already? awesome!
11:40 🔗 Arkiver2 please keep me informed and when they are uploaded I'll delete them from the list of items
11:42 🔗 tephra yes
11:44 🔗 jjonas good day:) silly questions maybe but does archive.org actually save any POST answers / does any CDN cache any?
11:46 🔗 godane tephira: the name should have been https://archive.org/details/home.swipnet.se-w-26000-to-30000-20140726
11:46 🔗 godane not
11:46 🔗 godane https://archive.org/details/home.swipnet.se-w-26000-to-30000-20140726-00000.warc
11:47 🔗 godane *tephra
11:47 🔗 godane ^
12:01 🔗 Cameron_D http://hulkfile.eu/
12:01 🔗 Cameron_D http://torrentfreak.com/hulkfile-shuts-down-following-expendables-3-lawsuit-140813/
12:01 🔗 Cameron_D "The company informs TorrentFreak that it has disabled access to all visitors from the United States and that it intends to shut down globally during the coming days. "
12:03 🔗 midas https://www.google.nl/search?q=site%3Ahulkfile.eu&oq=site%3Ahulkfile.eu&aqs=chrome..69i57j69i58.4535j0j7&sourceid=chrome&es_sm=93&ie=UTF-8
12:03 🔗 midas well, that sucked
12:03 🔗 Cameron_D I can still access it here, how do I make a warc?
12:04 🔗 midas you mean the site itself?
12:04 🔗 Cameron_D actually, half the assets don't load
12:04 🔗 Cameron_D and there isn't any kind of message
12:04 🔗 Cameron_D so there is probably already an old version of the frontpage
12:04 🔗 midas archivebot can do it, just need a europe pipeline
12:10 🔗 Arkiver2 midas: http://www.chillingeffects.org/notice.cgi?sID=1301208 is still available
12:10 🔗 Arkiver2 contains a hulkfile link
12:10 🔗 Arkiver2 what about trying to discover as any hulkfile liks as possible and warc those?
12:14 🔗 Arkiver2 tried some filesearchers, none search hulkfile also
12:17 🔗 Cameron_D anything listed in chillingeffects documents will already by gone
13:09 🔗 Smiley https://i.imgur.com/TuKkqnV.jpg
13:15 🔗 midas +1 factual
13:28 🔗 dserodio how do I get voice on #archivebot ?
13:35 🔗 dserodio I want to help with the archiving of Ferguson news
13:58 🔗 rekenner WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD
13:58 🔗 rekenner ... I feel like a bot should have responded by now. Or I missed the joke in that test. <_<
13:58 🔗 turnip yahoosucks
13:59 🔗 rekenner thanks.
13:59 🔗 turnip am bot do not understand wordy words
15:52 🔗 tephra godane: yes i know but could not figure out how to change after, sorry
15:52 🔗 godane ok
17:26 🔗 VonGuard ok sketchcow, well, we should chat on the phone or in some chat window somewhere about Habitat
17:26 🔗 VonGuard We're bringing it back
17:26 🔗 VonGuard and it's a freaking nutty big project
17:26 🔗 VonGuard that requires us to also bring back Qlink, so any help finding Qlink people would be very much appreciated
17:37 🔗 xmc habitat the video game?
17:37 🔗 xmc neat
18:14 🔗 VonGuard yeah the c64 mmo
18:28 🔗 ohhdemgir what is verizon taking down? personal web space, what kinda files?
19:49 🔗 balrog ohhdemgir: personal web space
19:50 🔗 balrog members.bellatlantic.net
19:51 🔗 balrog aka mysite.verizon.net
19:51 🔗 balrog they're both the same
19:52 🔗 balrog but the name "bell atlantic" should tell you just how old it is ;)
19:52 🔗 Arkiver2 small sites of users
19:52 🔗 Arkiver2 I'm working on the making it a warrior
19:52 🔗 balrog nice
19:52 🔗 balrog 10mb max per site
19:52 🔗 balrog so not big at all
19:52 🔗 balrog should be a blip compared to twitch
19:52 🔗 Arkiver2 if you would like to help
19:52 🔗 Arkiver2 http://archiveteam.org/index.php?title=Swipnet
19:52 🔗 Arkiver2 take a look at progress
19:53 🔗 Arkiver2 we still need lists from yahoo
19:53 🔗 balrog oh?
19:53 🔗 Arkiver2 for the keyword site:home.swipnet.se
19:53 🔗 Arkiver2 The fifth list will be taken from Bing Search results with the keyword: site:home.swipnet.se. (User:Mithrandir is downloading a list of links retrieved from Bing.)
19:53 🔗 Arkiver2 The fourth list will be taken from Yahoo Search results with the keyword: site:home.swipnet.se.
19:53 🔗 Arkiver2 The third list will be taken from Google Search results with the keyword: site:home.swipnet.se. (Someone was planning this already?)
19:54 🔗 Arkiver2 google is being worked on as far as I know, bing is being taken care of
19:54 🔗 Arkiver2 just yahoo
19:54 🔗 Arkiver2 I thought it was good to make a progress list so people could actually see what they can do for it
19:55 🔗 Arkiver2 other then running a warrior and see where the project stands right now
21:57 🔗 tephra google script is running, had to resort to using selinium 'cause google kept blocking me. Hopefully this will work better
22:20 🔗 Arkiver2 tephra: thank you! I'm going now, please keep me informed

irclogger-viewer