#archiveteam-bs 2016-02-27,Sat

↑back Search

Time Nickname Message
00:59 🔗 zerkalo has quit IRC (Quit: Lost terminal)
01:18 🔗 zerkalo has joined #archiveteam-bs
01:39 🔗 JesseW has joined #archiveteam-bs
02:09 🔗 Boppen has joined #archiveteam-bs
02:13 🔗 Boppen has quit IRC (Ping timeout: 200 seconds)
02:18 🔗 Boppen has joined #archiveteam-bs
02:43 🔗 Boppen has quit IRC (Ping timeout: 200 seconds)
03:03 🔗 vitzli has joined #archiveteam-bs
03:11 🔗 Boppen has joined #archiveteam-bs
03:17 🔗 Boppen has quit IRC (Ping timeout: 200 seconds)
03:28 🔗 bwn has quit IRC (Ping timeout: 252 seconds)
04:08 🔗 VADemon has quit IRC (Quit: left4dead)
04:18 🔗 vitzli has quit IRC (Leaving)
04:47 🔗 tomwsmf-a has quit IRC (Read error: Operation timed out)
04:52 🔗 JetBalsa has quit IRC (Quit: - nbs-irc 2.39 - www.nbs-irc.net -)
05:24 🔗 Sk1d has quit IRC (Ping timeout: 250 seconds)
05:31 🔗 Sk1d has joined #archiveteam-bs
05:39 🔗 JesseW http://archiveteam.org/index.php?title=Internet_Archive#Technical_notes -- some notes about the IA's /history/ pages. Corrections/additions/suggestions welcomed.
05:53 🔗 Kenshin has quit IRC (Ping timeout: 252 seconds)
05:54 🔗 Kenshin has joined #archiveteam-bs
06:52 🔗 JesseW !a http://ilovemyneurons.tumblr.com/ --ignore-sets=singletumblr
06:55 🔗 JesseW !a http://incompatibletype.tumblr.com --ignore-sets=singletumblr
06:57 🔗 bwn has joined #archiveteam-bs
06:58 🔗 yipdw JesseW: uh
06:59 🔗 JesseW ?
07:01 🔗 JesseW arghghghghg wrong channel.
07:01 🔗 * JesseW goes to fix this
07:11 🔗 Chorca has quit IRC (Ping timeout: 260 seconds)
07:11 🔗 Chorca has joined #archiveteam-bs
07:17 🔗 JesseW has quit IRC (Quit: Leaving.)
07:17 🔗 JesseW has joined #archiveteam-bs
07:31 🔗 JesseW has quit IRC (Quit: Leaving.)
07:41 🔗 wp494_ has joined #archiveteam-bs
07:42 🔗 logan2 has joined #archiveteam-bs
07:45 🔗 Stilett0 has joined #archiveteam-bs
07:45 🔗 Coderjoe_ has joined #archiveteam-bs
07:46 🔗 wp494 has quit IRC (Ping timeout: 499 seconds)
07:46 🔗 Stiletto has quit IRC (Read error: Operation timed out)
07:47 🔗 logan has quit IRC (Read error: Operation timed out)
07:48 🔗 Coderjoe has quit IRC (Ping timeout: 499 seconds)
07:51 🔗 vitzli has joined #archiveteam-bs
08:18 🔗 logan2 has quit IRC (Read error: Operation timed out)
08:19 🔗 logan has joined #archiveteam-bs
08:35 🔗 vitzli has quit IRC (Leaving)
08:47 🔗 bwn has quit IRC (Ping timeout: 252 seconds)
08:54 🔗 wp494_ is now known as wp494
08:56 🔗 bwn has joined #archiveteam-bs
09:02 🔗 metalcamp has joined #archiveteam-bs
09:03 🔗 MrRadar !w ebr1360k5v2qyy8ssjowcbm0r
09:22 🔗 jut has joined #archiveteam-bs
09:51 🔗 bwn has quit IRC (Ping timeout: 633 seconds)
10:34 🔗 Sk2d has joined #archiveteam-bs
10:39 🔗 Sk1d has quit IRC (hub.se irc.du.se)
10:54 🔗 Sk2d is now known as Sk1d
10:54 🔗 Boppen has joined #archiveteam-bs
10:56 🔗 schbirid has joined #archiveteam-bs
11:04 🔗 RichardG has quit IRC (Ping timeout: 250 seconds)
11:05 🔗 RichardG has joined #archiveteam-bs
11:06 🔗 jut has quit IRC (Read error: Connection reset by peer)
11:09 🔗 RichardG_ has joined #archiveteam-bs
11:11 🔗 RichardG has quit IRC (Ping timeout: 362 seconds)
11:17 🔗 RichardG has joined #archiveteam-bs
11:18 🔗 RichardG_ has quit IRC (Ping timeout: 250 seconds)
11:34 🔗 signius has quit IRC (Ping timeout: 1224 seconds)
11:41 🔗 signius has joined #archiveteam-bs
12:15 🔗 vitzli has joined #archiveteam-bs
12:20 🔗 Boppen has quit IRC (Ping timeout: 200 seconds)
12:41 🔗 zino has quit IRC (Quit: Leaving)
12:49 🔗 godane http://arstechnica.com/the-multiverse/2016/02/you-can-now-read-the-entirety-of-sci-fi-magazine-if-for-free/
14:10 🔗 logan has quit IRC (Read error: Operation timed out)
14:11 🔗 VADemon has joined #archiveteam-bs
14:11 🔗 logan has joined #archiveteam-bs
14:28 🔗 dashcloud has quit IRC (Read error: Operation timed out)
14:32 🔗 dashcloud has joined #archiveteam-bs
14:36 🔗 dashcloud has quit IRC (Read error: Operation timed out)
14:40 🔗 dashcloud has joined #archiveteam-bs
14:42 🔗 joepie91 http://www.kennethreitz.org/essays/mentalhealtherror-an-exception-occurred
14:45 🔗 schbirid i wonder what percentage of floss projects is driven by "mentality"
15:09 🔗 Stilett0 is now known as Stiletto
16:25 🔗 schbirid has quit IRC (Remote host closed the connection)
16:51 🔗 SketchCow Poor little Kenneth
17:00 🔗 Chorca has quit IRC (Read error: Operation timed out)
17:10 🔗 vitzli has quit IRC (Leaving)
17:16 🔗 Chorca has joined #archiveteam-bs
17:24 🔗 snape "Key Takeaway: ...don't date the crazy chick
17:48 🔗 JesseW has joined #archiveteam-bs
17:56 🔗 midas he takes lithium now, but remember kids, dont lick your cellphone batteries.
18:09 🔗 JesseW has quit IRC (Quit: Leaving.)
19:00 🔗 RichardG_ has joined #archiveteam-bs
19:01 🔗 RichardG has quit IRC (Read error: Operation timed out)
19:15 🔗 tomwsmf-a has joined #archiveteam-bs
19:45 🔗 JesseW has joined #archiveteam-bs
19:48 🔗 bwn has joined #archiveteam-bs
19:57 🔗 bwn has quit IRC (Ping timeout: 246 seconds)
19:59 🔗 Stiletto is now known as Stilett0
20:03 🔗 Stilett0 has quit IRC (Ping timeout: 250 seconds)
20:09 🔗 dashcloud has quit IRC (Read error: Operation timed out)
20:12 🔗 dashcloud has joined #archiveteam-bs
20:55 🔗 rpn has joined #archiveteam-bs
20:55 🔗 rpn has left
21:00 🔗 Simpbrai_ has joined #archiveteam-bs
21:20 🔗 bwn has joined #archiveteam-bs
21:50 🔗 superkuh has joined #archiveteam-bs
22:36 🔗 zino has joined #archiveteam-bs
22:37 🔗 Famicoma1 has quit IRC (Quit: leaving)
22:37 🔗 Famicoma1 has joined #archiveteam-bs
22:42 🔗 metalcamp has quit IRC (Ping timeout: 252 seconds)
22:46 🔗 Famicoma1 has quit IRC (Quit: leaving)
22:47 🔗 Famicoma1 has joined #archiveteam-bs
22:54 🔗 zerkalo has quit IRC (Quit: leaving)
22:57 🔗 zerkalo has joined #archiveteam-bs
23:00 🔗 SimpBrain btw now that friendsreunited is done
23:01 🔗 SimpBrain would it be worth going after a scraping project for myspace
23:01 🔗 SimpBrain or going after the beast that is yahoo groups
23:04 🔗 SimpBrain or better yet a working myspace website, is it down? https://myspace.com
23:06 🔗 snape it's up here, just ridiculously slow to load.
23:08 🔗 SimpBrain hmm must be my connection
23:11 🔗 HCross Not loading from OVH GRA
23:12 🔗 snape Hey, could be on MySpace's end. All twenty of their active users could be browsing the site at once, or something overwhelming like that...
23:12 🔗 SimpBrain eewww profile pages scroll across to the right
23:13 🔗 SimpBrain ok myspace easy grab? https://myspace.com/profilenumber
23:17 🔗 snape Isn't the vast majority of user-generated content on MySpace long, long gone? Seems like if you want to scrape a dinosaur social network, well... I can't imagine Livejournal will be around too many more years, and AFAIK most everything is still there to grab.
23:26 🔗 * SimpBrain goes diving for info
23:28 🔗 SimpBrain ok found a way
23:29 🔗 SimpBrain basically you can fire up any profile via the ext-number e.g. which if true returns the proper profile url http://ext-354205.livejournal.com/feed/ http://mudzhyri.livejournal.com/
23:30 🔗 SimpBrain so yeah, livejournal can be archived and by user id too, plus if there was an additional scape down the time, it can probably check if there was anything newer than say the older scrape
23:31 🔗 SimpBrain if a profile was purged you'll just get an error page e.g. http://ext-3.livejournal.com/feed/
23:33 🔗 MrRadar Good find
23:34 🔗 MrRadar You should document that on the wiki page: http://archiveteam.org/index.php?title=LiveJournal
23:43 🔗 SimpBrain will do
23:44 🔗 zino Did FreeNode just disappear for anyone else? All my ancient hardware channels are over there...
23:44 🔗 HCross zino, its been up and down for me
23:45 🔗 HCross SimpBrain, I wonder if the response codes are different
23:45 🔗 zino HCross: OK, thanks.
23:45 🔗 JesseW has quit IRC (Quit: Leaving.)
23:45 🔗 HCross SimpBrain, yes it does. non present sites give a 401, and present ones give a 200
23:46 🔗 SimpBrain what on the ext pages?
23:46 🔗 HCross https://github.com/HarryC145/Python/blob/master/SubDomainHunt.py might help
23:46 🔗 HCross http://ext-3.livejournal.com/feed/ = 410 and http://ext-354205.livejournal.com/feed/ = 200
23:51 🔗 SimpBrain might throw a hacky job on that script to throw a list together
23:51 🔗 JesseW has joined #archiveteam-bs
23:52 🔗 SimpBrain i know the highest number since that will be the profile i just created
23:52 🔗 HCross SimpBrain, it spits a list of it into present.txt
23:53 🔗 HCross Illl accept a Pull thoughj

irclogger-viewer