#archiveteam-bs 2016-08-18,Thu

↑back Search

Time Nickname Message
00:22 🔗 howdoicom has quit IRC (Quit: Page closed)
00:50 🔗 Petri152 has joined #archiveteam-bs
00:59 🔗 JesseW has joined #archiveteam-bs
01:21 🔗 tomwsmf has quit IRC (Read error: Operation timed out)
01:26 🔗 JesseW has quit IRC (Ping timeout: 370 seconds)
01:50 🔗 JesseW has joined #archiveteam-bs
02:01 🔗 Honno has joined #archiveteam-bs
02:40 🔗 aschmitz HCross: ping me in a couple weeks about newsgrab, I may have a host if you can run in a container, but am waaaay too busy right now. :(
02:51 🔗 tuankiet6 has joined #archiveteam-bs
02:52 🔗 tuankiet6 is now known as tuankiet
03:04 🔗 RichardG has quit IRC (Read error: Connection reset by peer)
03:04 🔗 RichardG has joined #archiveteam-bs
03:12 🔗 RichardG has quit IRC (Quit: Keyboard not found, press F1 to continue)
03:15 🔗 JesseW has quit IRC (Ping timeout: 370 seconds)
03:39 🔗 RichardG has joined #archiveteam-bs
03:50 🔗 mutoso has quit IRC (Ping timeout: 260 seconds)
03:58 🔗 mutoso has joined #archiveteam-bs
04:16 🔗 JesseW has joined #archiveteam-bs
04:22 🔗 Sk1d has quit IRC (Ping timeout: 194 seconds)
04:30 🔗 Sk1d has joined #archiveteam-bs
04:42 🔗 JesseW has quit IRC (Ping timeout: 370 seconds)
05:02 🔗 JesseW has joined #archiveteam-bs
05:31 🔗 RichardG has quit IRC (Read error: Operation timed out)
05:31 🔗 RichardG has joined #archiveteam-bs
05:54 🔗 dan- has quit IRC (Ping timeout: 633 seconds)
05:57 🔗 dan- has joined #archiveteam-bs
06:25 🔗 JesseW has quit IRC (Ping timeout: 370 seconds)
06:32 🔗 aschmitz has quit IRC (Read error: Operation timed out)
06:39 🔗 aschmitz has joined #archiveteam-bs
07:11 🔗 RichardG has quit IRC (Ping timeout: 255 seconds)
07:25 🔗 BlueMaxim has quit IRC (Read error: Operation timed out)
07:26 🔗 BlueMaxim has joined #archiveteam-bs
08:02 🔗 kristian_ has joined #archiveteam-bs
08:33 🔗 fie has joined #archiveteam-bs
08:38 🔗 RichardG has joined #archiveteam-bs
08:47 🔗 BlueMaxim has quit IRC (Read error: Operation timed out)
08:48 🔗 BlueMaxim has joined #archiveteam-bs
09:32 🔗 whydomain has joined #archiveteam-bs
09:37 🔗 whydomain Not really on topic, but this popped up in the vintage mac forums earlier.
09:37 🔗 whydomain From the ad: "The lease is up at the end of the month, and all of my Macintosh stuff has to go! ... I must liquidate this collection (which is probably one of the biggest in the country), or it goes to the recycler!"
09:37 🔗 whydomain Ad: http://denver.craigslist.org/sys/5732303316.html
09:38 🔗 whydomain Sadly I'm 4000+ miles away. Anyone want to save some hardware for cheap?
09:41 🔗 SketchCow Ignore it
09:55 🔗 whydomain has quit IRC (http://www.kiwiirc.com/ - A hand crafted IRC client)
10:23 🔗 kristian_ has quit IRC (Leaving)
10:59 🔗 wp494 has quit IRC (Read error: Connection reset by peer)
11:41 🔗 fie_ has joined #archiveteam-bs
11:43 🔗 fie has quit IRC (Read error: Operation timed out)
12:07 🔗 Atluxity wow, he had a lot of crap
12:39 🔗 BlueMaxim has quit IRC (Quit: Leaving)
12:43 🔗 vitzli has joined #archiveteam-bs
13:05 🔗 BartoCH has quit IRC (Ping timeout: 260 seconds)
13:11 🔗 BartoCH has joined #archiveteam-bs
14:18 🔗 wp494 has joined #archiveteam-bs
14:30 🔗 tuankiet has quit IRC (Ping timeout: 246 seconds)
14:38 🔗 tuankiet6 has joined #archiveteam-bs
15:51 🔗 vitzli has quit IRC (Quit: Leaving)
16:17 🔗 tomwsmf has joined #archiveteam-bs
17:28 🔗 kristian_ has joined #archiveteam-bs
17:51 🔗 schbirid has joined #archiveteam-bs
18:02 🔗 schbirid lol
18:02 🔗 schbirid [ 989.029486] sd 8:0:0:0: [sdg] Very big device. Trying to use READ CAPACITY(16).
18:04 🔗 gfscott has joined #archiveteam-bs
18:06 🔗 godane i'm going after gawker.com sitemap for 2016-06 and 2016-07
18:07 🔗 schbirid i will byte because i am actually interested, how is cc0 not a license?
18:07 🔗 schbirid byte :D
18:17 🔗 godane uploaded: https://archive.org/details/gawker.com-sitemap-2016-01-20160627
18:31 🔗 GE has joined #archiveteam-bs
18:31 🔗 phuzion This is the offtopic channel so other channels like #archivebot and #archiveteam can stay clutter-free for important discussion.
18:32 🔗 kristian_ has quit IRC (Leaving)
18:50 🔗 riordan has joined #archiveteam-bs
19:09 🔗 riordan has quit IRC (riordan)
19:10 🔗 riordan_ has joined #archiveteam-bs
19:12 🔗 godane so gawker.com sitemaps urls are uploaded
19:17 🔗 riordan_ has quit IRC (Read error: Operation timed out)
19:21 🔗 arkiver godane: is gawker.com 100% in the wayback machine now?
19:21 🔗 arkiver or well, less than 100% of course
19:21 🔗 arkiver but you get what I mean
19:22 🔗 godane https://archive.org/details/@chris85?and[]=gawker.com&sort=-publicdate
19:23 🔗 godane that should be very close to 100%
19:28 🔗 riordan has joined #archiveteam-bs
19:36 🔗 joepie91 schbirid: it even says "license" in the legal doc: https://creativecommons.org/publicdomain/zero/1.0/legalcode
19:36 🔗 schbirid jepp
19:36 🔗 joepie91 it just prefers a public domain dedication
19:36 🔗 yipdw wow nice https://gitlab.peach-bun.com/snippets/28
19:42 🔗 yipdw actually that's probably not a fair question, because that gcc package includes all the frontends too
19:52 🔗 godane what i have a kotaku.com so far: https://archive.org/details/@chris85?and[]=kotaku.com&sort=-publicdate&and[]=subject%3A%22archiveteam%22
19:56 🔗 riordan has quit IRC (riordan)
19:57 🔗 riordan has joined #archiveteam-bs
19:58 🔗 riordan_ has joined #archiveteam-bs
19:58 🔗 riordan has quit IRC (Read error: Operation timed out)
20:11 🔗 riordan_ has quit IRC (Ping timeout: 633 seconds)
20:25 🔗 schbirid has quit IRC (Quit: Leaving)
20:30 🔗 Start godane: you might want to go after this one if you haven't already: http://thecuck.gawker.com/sitemap.xml
20:31 🔗 Start it's...controversial to say the least
20:32 🔗 schbirid has joined #archiveteam-bs
20:37 🔗 arrith has joined #archiveteam-bs
20:53 🔗 Selavi hey folks, new to this scene. I'm thinking about getting a mirror of http://exoteric.roach.org/ that has a bunch of fractal wallpapers, and the site owner has http basic auth on the images and download quota. so I'm thinking about emailing the guy and requesting a rip. is this reasonable?
20:57 🔗 yipdw yes
20:58 🔗 ErkDog well yipdw
20:58 🔗 yipdw site owners can usually be reasonable about that
20:58 🔗 ErkDog http://exoteric.roach.org/terms.php
20:58 🔗 yipdw yes
20:58 🔗 yipdw even more reason to ask
20:58 🔗 Selavi right
20:59 🔗 ErkDog I don't see him mentioning quotas, and the site is probably incredibly old and never updated if he thinks he needed to do that for a few image files, lol
20:59 🔗 Selavi yeah, his last news was from 2 years ago, and he hasn't added images in ages
21:01 🔗 Selavi here at the bottom it says "This site employs software to limit the maximum amount of data any one person can download from the site in a given period of time" http://exoteric.roach.org/faq.html
21:04 🔗 yipdw even more reason to ask
21:04 🔗 yipdw archiveteam has access to a lot of IP space and a lot of bandwidth but you've heard that thing about with great power comes great responsibility
21:05 🔗 Selavi yeah, I would rather it be a win-win
21:06 🔗 yipdw indeed
21:16 🔗 GE Somebody should try to rip World Of Spectrum if that maybe possible
21:16 🔗 GE http://www.worldofspectrum.org/archivenote.html Just something to ntoe
21:16 🔗 GE note*
21:23 🔗 ErkDog hey Selavi
21:23 🔗 ErkDog look what I found
21:23 🔗 ErkDog http://exoteric.roach.org/bg/dl/torrents.html
21:23 🔗 Selavi click them :p
21:23 🔗 ErkDog boner
21:25 🔗 Selavi there's a couple torrents on TPB, but naturally they're deader than dead
21:36 🔗 Start godane: here's a complete list of every gawker subdomain, scraped from google: https://gist.githubusercontent.com/PressStartandSelect/ca04f19ddd856bd53c36b892265d81d7/raw/f7ea7f08dc165176291ceca9e7363ff83219ac1b/gawker%2520subdomains
22:17 🔗 gfscott has quit IRC (gfscott)
22:26 🔗 Stiletto has quit IRC (Ping timeout: 246 seconds)
22:29 🔗 godane Start: i'm going thur that now
22:31 🔗 godane looks like antiviral.gakwer.com has only 533 urls
22:38 🔗 BlueMaxim has joined #archiveteam-bs
22:40 🔗 godane btw june and july of kotaku.com and io9.gizmodo.com sitemap urls are being uploaded
22:49 🔗 godane https://archive.org/details/antiviral.gawker.com-sitemap-2014-20160818
23:05 🔗 godane there was 2 http://gawker.com urls in the deframer.gawker.com grabbing
23:05 🔗 godane i'm not uploading those pages cause there both saved in wayback machine
23:06 🔗 godane mostly cause its one random url in 2005 sitemap and 2006 sitemap for defamer.gawker.com
23:07 🔗 godane the real meat of defamer.gawker.com starts in 2013 sitemap
23:07 🔗 godane here are the urls below:
23:07 🔗 joepie91 PSA: this search engine lets you get results in JSON: https://searx.me/
23:07 🔗 godane https://web.archive.org/web/*/http://gawker.com/157998/meg-ryans-baby-to-be-afflicted-with-identity-issues
23:07 🔗 joepie91 we might want this for discovery projects
23:07 🔗 joepie91 cc arkiver
23:07 🔗 godane https://web.archive.org/web/*/http://gawker.com/033069/lindsay-lohan-now-in-doll-form
23:09 🔗 Honno has quit IRC (Read error: Operation timed out)
23:26 🔗 godane blackbag.gawker.com is saved: https://archive.org/search.php?query=creator%3A%22blackbag.gawker.com%22
23:40 🔗 GE has quit IRC (Quit: zzz)
23:43 🔗 Stiletto has joined #archiveteam-bs

irclogger-viewer