Time |
Nickname |
Message |
00:01
🔗
|
|
Ravenloft has quit IRC (Ping timeout: 378 seconds) |
00:15
🔗
|
godane |
so i setup a old computer for my brother for his business/office room |
00:15
🔗
|
godane |
fun fact: i can only get internet from wireless |
00:16
🔗
|
godane |
the wired line to the wireless router was not working |
00:16
🔗
|
|
cbb2 has joined #archiveteam |
00:19
🔗
|
|
cbb has quit IRC (Ping timeout: 633 seconds) |
00:33
🔗
|
|
cbb2 has quit IRC (Quit: Nettalk6 - www.ntalk.de) |
00:36
🔗
|
|
wp494 has quit IRC (Read error: Operation timed out) |
00:40
🔗
|
|
ruukasu has quit IRC (Quit: WeeChat 1.0.1) |
00:40
🔗
|
|
ruukasu has joined #archiveteam |
00:49
🔗
|
|
wp494 has joined #archiveteam |
00:49
🔗
|
|
signius has quit IRC (Ping timeout: 480 seconds) |
00:59
🔗
|
|
signius has joined #archiveteam |
01:04
🔗
|
vertice32 |
the archivebot run of my site oasisjournals.com seems to be pulling in a lot of things not from my domain |
01:04
🔗
|
vertice32 |
is that expected? |
01:04
🔗
|
vertice32 |
job 9tchq93k6q83xjzok0rul5a8a |
01:32
🔗
|
aaaaaaaaa |
Yes, that is the expected behavior. |
01:44
🔗
|
|
vertice32 has quit IRC (Remote host closed the connection) |
01:44
🔗
|
|
primus104 has quit IRC (Leaving.) |
02:01
🔗
|
|
philpem has quit IRC (Ping timeout: 272 seconds) |
02:05
🔗
|
|
ATZ0 has joined #archiveteam |
02:27
🔗
|
|
schbirid2 has joined #archiveteam |
02:30
🔗
|
|
schbirid has quit IRC (Read error: Operation timed out) |
02:32
🔗
|
|
JonimusP is now known as Jonimus |
03:01
🔗
|
|
mistym has quit IRC (Remote host closed the connection) |
03:19
🔗
|
|
Jonimus is now known as JonimusP |
03:19
🔗
|
|
JonimusP is now known as Jonimus |
03:21
🔗
|
|
Jonimus is now known as JonimusP |
03:23
🔗
|
|
JonimusP is now known as Jonimus |
03:31
🔗
|
|
Aranje has quit IRC (Read error: Connection reset by peer) |
03:31
🔗
|
|
Aranje has joined #archiveteam |
03:48
🔗
|
|
kris33 has quit IRC (Textual IRC Client: www.textualapp.com) |
03:48
🔗
|
|
Ymgve has quit IRC () |
04:48
🔗
|
|
aaaaaaaaa has quit IRC (Leaving) |
05:11
🔗
|
|
Lord_Nigh has quit IRC (Read error: Operation timed out) |
05:13
🔗
|
|
Lord_Nigh has joined #archiveteam |
05:14
🔗
|
|
balrog sets mode: +o Lord_Nigh |
05:58
🔗
|
|
mistym has joined #archiveteam |
07:58
🔗
|
|
mistym has quit IRC (Remote host closed the connection) |
09:21
🔗
|
|
primus104 has joined #archiveteam |
09:59
🔗
|
|
kris33 has joined #archiveteam |
10:04
🔗
|
|
primus104 has quit IRC (Ping timeout: 1757 seconds) |
11:21
🔗
|
|
ruukasu has quit IRC (Quit: WeeChat 1.0.1) |
11:46
🔗
|
w0rp |
http://rt.com/usa/207443-kim-dotcom-megaupload-fugitives/ mega.co.nz is probably going to explode before long. |
11:54
🔗
|
|
kris33 has quit IRC (Read error: Connection reset by peer) |
11:56
🔗
|
|
kris33 has joined #archiveteam |
12:13
🔗
|
|
kris33 has quit IRC (Ping timeout: 512 seconds) |
12:13
🔗
|
|
kris33 has joined #archiveteam |
12:22
🔗
|
|
kris33 has quit IRC (Textual IRC Client: www.textualapp.com) |
13:04
🔗
|
|
Ymgve has joined #archiveteam |
13:19
🔗
|
|
GLaDOS has quit IRC (Ping timeout: 272 seconds) |
13:19
🔗
|
|
GLaDOS has joined #archiveteam |
13:24
🔗
|
|
ruukasu has joined #archiveteam |
13:31
🔗
|
|
w0rp has quit IRC (Remote host closed the connection) |
13:33
🔗
|
|
w0rp has joined #archiveteam |
14:05
🔗
|
|
dashcloud has quit IRC (Quit: No Ping reply in 180 seconds.) |
14:05
🔗
|
|
ruukasu has quit IRC (Quit: WeeChat 1.0.1) |
14:06
🔗
|
|
ruukasu has joined #archiveteam |
14:07
🔗
|
|
dashcloud has joined #archiveteam |
14:10
🔗
|
|
sankin has joined #archiveteam |
14:14
🔗
|
|
philpem has joined #archiveteam |
14:35
🔗
|
|
K4k has joined #archiveteam |
14:39
🔗
|
|
Boppen has quit IRC (Read error: Connection reset by peer) |
14:40
🔗
|
|
Boppen has joined #archiveteam |
14:52
🔗
|
|
Aranje has quit IRC (Quit: Three sheets to the wind) |
15:33
🔗
|
|
xk_id has quit IRC (Ping timeout: 852 seconds) |
15:42
🔗
|
|
mistym has joined #archiveteam |
15:48
🔗
|
|
mistym has quit IRC (Remote host closed the connection) |
16:05
🔗
|
|
mistym has joined #archiveteam |
16:26
🔗
|
DFJustin |
sounds like it's just about whether the US can keep assets they already seized? not sure how that would kill mega |
16:40
🔗
|
|
K4k has quit IRC (Read error: Operation timed out) |
16:41
🔗
|
|
MMovie has joined #archiveteam |
16:43
🔗
|
|
MMovie1 has quit IRC (Read error: Operation timed out) |
16:48
🔗
|
parsons_ |
howdy everyone -- I mentioned this a little while back, but we're sunsetting meetup.com/everywhere on Dec. 1st. Anything I can do to assist in archiving? |
16:49
🔗
|
DFJustin |
http://www.forbes.com/sites/parmyolson/2014/11/20/the-largest-cyber-attack-in-history-has-been-hitting-hong-kong-sites/ |
16:53
🔗
|
|
parsons_ is now known as parsons |
16:57
🔗
|
Lord_Nigh |
w0rp: if dotcom is smart the mega stuff is not under his name or purvey, he's just an employee and it should be unsiezable |
16:57
🔗
|
|
ruukasu has quit IRC (Quit: WeeChat 1.0.1) |
16:58
🔗
|
Lord_Nigh |
also i see http://dndtools.eu/ got cease-and-desisted to death |
17:03
🔗
|
Lord_Nigh |
https://dl.dropboxusercontent.com/s/8uwuvhbg8cc7y5m/dnd.zip?dl=1&token_hash=AAGEpJ6AE0ROuCSuoWThKPfpCHQ_Wuvfg_t8cNCtfKAOdg was (and is still up) a copy of their backend sqlite db |
17:03
🔗
|
|
bebzol has quit IRC (Ping timeout: 480 seconds) |
17:07
🔗
|
Lord_Nigh |
so i guess between that and the django application source (which is on github) a good portion of the site could be rebuilt? |
17:10
🔗
|
|
dashcloud has quit IRC (Read error: Connection reset by peer) |
17:11
🔗
|
|
mistym has quit IRC (Remote host closed the connection) |
17:12
🔗
|
|
dashcloud has joined #archiveteam |
17:20
🔗
|
|
signius has quit IRC (Ping timeout: 480 seconds) |
17:23
🔗
|
|
GLaDOS has quit IRC (Ping timeout: 272 seconds) |
17:23
🔗
|
|
GLaDOS has joined #archiveteam |
17:26
🔗
|
|
ruukasu has joined #archiveteam |
17:28
🔗
|
|
godane has quit IRC (Read error: Operation timed out) |
17:29
🔗
|
|
signius has joined #archiveteam |
17:37
🔗
|
|
ruukasu has quit IRC (Quit: WeeChat 1.0.1) |
17:52
🔗
|
|
ruukasu has joined #archiveteam |
17:53
🔗
|
|
philpem has quit IRC (Ping timeout: 272 seconds) |
18:12
🔗
|
|
dashcloud has quit IRC (Read error: Connection reset by peer) |
18:13
🔗
|
|
dashcloud has joined #archiveteam |
18:15
🔗
|
SadDM |
Lord_Nigh: Boo on WotC! |
18:16
🔗
|
|
mistym has joined #archiveteam |
18:25
🔗
|
|
dashcloud has quit IRC (Read error: Connection reset by peer) |
18:25
🔗
|
|
dashcloud has joined #archiveteam |
18:34
🔗
|
|
dashcloud has quit IRC (Read error: Connection reset by peer) |
18:40
🔗
|
|
philpem has joined #archiveteam |
18:43
🔗
|
joepie91 |
huh |
18:43
🔗
|
joepie91 |
[17:48] <parsons_> howdy everyone -- I mentioned this a little while back, but we're sunsetting meetup.com/everywhere on Dec. 1st. Anything I can do to assist in archiving? |
18:43
🔗
|
joepie91 |
did anybody respond to that? |
18:43
🔗
|
parsons |
no |
18:46
🔗
|
joepie91 |
oh, you're still here without underscore :P |
18:46
🔗
|
joepie91 |
haven't been paying attention - what exactly is going on with meetup? |
18:46
🔗
|
joepie91 |
oh, there's a footer, sec |
18:48
🔗
|
joepie91 |
yipdw: ivan`: archivebot'able? |
18:48
🔗
|
joepie91 |
chfoo: perhaps |
18:48
🔗
|
yipdw |
meetup.com |
18:48
🔗
|
yipdw |
uh |
18:48
🔗
|
joepie91 |
chfoo perhaps * |
18:48
🔗
|
yipdw |
I don't think so |
18:48
🔗
|
joepie91 |
yipdw: specifically /everywhere |
18:48
🔗
|
yipdw |
oh |
18:48
🔗
|
joepie91 |
see above |
18:48
🔗
|
parsons |
yeah, not the whole site |
18:49
🔗
|
parsons |
/everywhere is relatively small |
18:49
🔗
|
yipdw |
oh that's much more doable |
18:49
🔗
|
yipdw |
yeah, you can grab /everywhere |
18:49
🔗
|
yipdw |
provided that the rest of the site is going to stay around |
18:49
🔗
|
|
dashcloud has joined #archiveteam |
18:49
🔗
|
parsons |
yeah, rest of the site will stay around |
18:50
🔗
|
parsons |
I don't even think we store any photos for everywhere groups/events |
18:50
🔗
|
yipdw |
one thing to be aware of is that archivebot won't grab the individual meetups because they're not suffixed with /everywhere/, but so long as those URLs stay stable it's ok |
18:50
🔗
|
yipdw |
also meetup.com/Evernote/ is consistently crashing chromium on my system, lol |
18:51
🔗
|
parsons |
oh dear |
18:51
🔗
|
parsons |
well, meetup.com/Evernote/ or meetup.com/Coursera will cease to exist |
18:52
🔗
|
joepie91 |
parsons: am I correct in that you can't tell from the URL alone whether something is Meetup Everywhere or regular Meetup? |
18:52
🔗
|
yipdw |
actually they're all crashing chromium for me, so this is probably more of a "my system is fucked" problem |
18:52
🔗
|
parsons |
that is correct |
18:52
🔗
|
joepie91 |
hrm, that's tricky |
18:52
🔗
|
parsons |
i wouldn't rule out some problem on our end |
18:52
🔗
|
Zebranky_ |
parsons: BTW, any plans to add export functionality for non-everywhere groups? I have one that's been idle for a while, but I've kept it for now because I haven't gotten around to scraping everything |
18:52
🔗
|
parsons |
they are all linked to from /everywhere |
18:52
🔗
|
|
Zebranky_ is now known as Zebranky |
18:53
🔗
|
yipdw |
so, here's one thing we can do |
18:53
🔗
|
joepie91 |
parsons: the /everywhere groups - are they just the ones listed on /everywhere or is there a "view more" link somewhere? |
18:53
🔗
|
yipdw |
1: !a http://www.meetup.com/everywhere/ --no-offsite-links |
18:53
🔗
|
yipdw |
2: parse out references to meetups |
18:53
🔗
|
yipdw |
3: fetch (2) |
18:54
🔗
|
yipdw |
if /everywhere/ is really that small (1) should not take much time |
18:54
🔗
|
joepie91 |
wouldnt a quick curl | grep > urls.txt and then !a < do the job? |
18:54
🔗
|
yipdw |
yes provided that meetups are a single page |
18:54
🔗
|
yipdw |
I don't know if that's true |
18:54
🔗
|
joepie91 |
oh right, you don't have !a <? only !ao <? |
18:54
🔗
|
yipdw |
it could be added but I really didn't want to because it'd end up making a single pipeline worker do them all |
18:54
🔗
|
joepie91 |
because I suspect subpages will live under the main URL of a group |
18:54
🔗
|
yipdw |
a smarter solution is needed for !a |
18:55
🔗
|
joepie91 |
heh |
18:55
🔗
|
joepie91 |
fair enough |
18:55
🔗
|
joepie91 |
yeah, that's a bit tricky then |
18:55
🔗
|
yipdw |
I'll start with grabbing /everywhere/ |
18:55
🔗
|
parsons |
yeah, local groups are like http://www.meetup.com/Coursera/Toronto-CA/ |
18:55
🔗
|
yipdw |
we'll know when it's there via chfoo's archivebot viewer thing and then we can proceed from there |
18:56
🔗
|
yipdw |
ok, /everywhere/ grab is underway |
18:56
🔗
|
parsons |
awesome |
18:56
🔗
|
yipdw |
also there is no way it actually did what I wanted it to do |
18:57
🔗
|
joepie91 |
lol |
18:57
🔗
|
joepie91 |
"software that actually /works/? impossible!" |
18:57
🔗
|
yipdw |
oh wait |
18:57
🔗
|
yipdw |
I see |
18:57
🔗
|
yipdw |
this is not so bad |
18:57
🔗
|
yipdw |
/everywhere/ links to e.g. http://www.meetup.com/occupytogether/ |
18:57
🔗
|
yipdw |
but under that there exists a good hierarchy |
18:58
🔗
|
yipdw |
let's try one of the smaller ones, /ChessCom. |
18:58
🔗
|
yipdw |
/ |
18:59
🔗
|
parsons |
Zebranky_: no plans for export functionality that I know of |
19:00
🔗
|
Zebranky |
Oh well, I'll figure something out. Thanks anyway! |
19:01
🔗
|
parsons |
ooh, we do have an api |
19:01
🔗
|
parsons |
so, it may not be that hard |
19:01
🔗
|
parsons |
http://www.meetup.com/meetup_api/ |
19:03
🔗
|
yipdw |
I think I got this meetup everywhere thing |
19:03
🔗
|
yipdw |
one moment |
19:03
🔗
|
Zebranky |
That probably covers most things, true. I'll note that it doesn't cover the organizer "money" view, which I did write something to scrape... 2.5 years ago, when this was a thing (wow, I should *really* close it) |
19:03
🔗
|
|
cbb has joined #archiveteam |
19:04
🔗
|
joepie91 |
hehe |
19:05
🔗
|
|
ruukasu has quit IRC (Ping timeout: 265 seconds) |
19:07
🔗
|
yipdw |
wtf, is there some code on meetup everywhere that disables web inspectors |
19:07
🔗
|
yipdw |
oh no it's just my system being fucky agaibn |
19:07
🔗
|
parsons |
ah ok, was worried |
19:08
🔗
|
parsons |
we're putting more and more methods into the api to support our native apps, so I'm sure it's just a matter of time |
19:09
🔗
|
yipdw |
ok, so |
19:09
🔗
|
yipdw |
I can just !a all of these |
19:09
🔗
|
yipdw |
https://gist.github.com/anonymous/a73b819fb9a3593c52b6 |
19:09
🔗
|
yipdw |
does this look complete? |
19:09
🔗
|
parsons |
I will double check |
19:09
🔗
|
yipdw |
nblr has generously donated an archivebot node so we'll have capacity for that too |
19:10
🔗
|
yipdw |
I think I'll run these without offsite link fetch; this will still fetch page requisites (e.g. meetupstatic images) |
19:12
🔗
|
parsons |
hmm, here's what I dug up about a month ago: "There are 6,623 communities, 102,538 local groups, and 217,979 events" |
19:13
🔗
|
parsons |
so, I will find a more complete list |
19:13
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
19:13
🔗
|
|
cbb has quit IRC (Read error: Operation timed out) |
19:14
🔗
|
|
cbb has joined #archiveteam |
19:15
🔗
|
yipdw |
parsons: for meetup everywhere? |
19:15
🔗
|
yipdw |
oh |
19:15
🔗
|
yipdw |
I just pulled what I could get off the front page |
19:17
🔗
|
parsons |
yeah, I thought we'd have a complete index, but it doesn't look like it |
19:17
🔗
|
parsons |
I can get a master list though |
19:18
🔗
|
yipdw |
ok |
19:19
🔗
|
yipdw |
if it's in the 10^3 range or higher it might get trickier |
19:19
🔗
|
yipdw |
unless most of these are super-small |
19:19
🔗
|
yipdw |
(maybe the index only shows the big ones?) |
19:19
🔗
|
|
dashcloud has joined #archiveteam |
19:21
🔗
|
parsons |
yeah, it probably shows the biggest ones |
19:21
🔗
|
|
godane has joined #archiveteam |
19:22
🔗
|
parsons |
here's the full list: https://gist.github.com/adrianparsons/74318cae806fc3ada182 |
19:22
🔗
|
parsons |
most are probably really small |
19:22
🔗
|
yipdw |
yeah |
19:22
🔗
|
yipdw |
this is probably one of those weird cases where this is actually archivebottable and big |
19:22
🔗
|
yipdw |
ok |
19:23
🔗
|
|
Kenshin has quit IRC (Read error: Operation timed out) |
19:23
🔗
|
yipdw |
I can get on that in a bit, or feel free to join #archivebot and add the !a lines yourself |
19:23
🔗
|
|
Kenshin has joined #archiveteam |
19:24
🔗
|
parsons |
cool! I'll see what I can do (not familiar with the syntax) |
19:24
🔗
|
parsons |
also, heading out for a bit. I'll be back in this room, but if anyone needs to get in touch I'm adrian@adrianparsons.com |
19:24
🔗
|
|
cbb has quit IRC (Read error: Operation timed out) |
19:24
🔗
|
yipdw |
parsons: we have some docs at http://archivebot.readthedocs.org/en/latest/ |
19:25
🔗
|
parsons |
excellent, thanks |
19:25
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
19:25
🔗
|
|
cbb has joined #archiveteam |
19:27
🔗
|
|
dashcloud has joined #archiveteam |
19:36
🔗
|
|
cbb has quit IRC (Read error: Operation timed out) |
19:37
🔗
|
|
cbb has joined #archiveteam |
19:44
🔗
|
|
ATZ0 has quit IRC () |
19:47
🔗
|
|
cbb has quit IRC (Read error: Operation timed out) |
19:48
🔗
|
|
cbb has joined #archiveteam |
19:48
🔗
|
|
xk_id has joined #archiveteam |
19:58
🔗
|
|
cbb has quit IRC (Read error: Operation timed out) |
19:59
🔗
|
|
cbb has joined #archiveteam |
20:07
🔗
|
|
brayden has quit IRC (Read error: Connection reset by peer) |
20:09
🔗
|
|
cbb has quit IRC (Quit: Nettalk6 - www.ntalk.de) |
20:12
🔗
|
|
spara0 has joined #archiveteam |
20:13
🔗
|
|
brayden has joined #archiveteam |
20:26
🔗
|
|
mistym has quit IRC (Remote host closed the connection) |
20:36
🔗
|
joepie91 |
is bitcasa forums/blog/site/etc. taken care of yet? |
20:56
🔗
|
Lord_Nigh |
the dndtools sqlite thing came from the top post at http://web.archive.org/web/20140226202746/http://dndtools.eu/ |
21:16
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
21:22
🔗
|
|
dashcloud has joined #archiveteam |
21:30
🔗
|
joepie91 |
chfoo: yipdw: ivan`: xmc: is bitcasa forums/blog/site/etc. taken care of yet |
21:30
🔗
|
* |
joepie91 is just going to add it to archivebot again if no response... |
21:30
🔗
|
xmc |
no idea re: bitcasa |
21:31
🔗
|
chfoo |
i guess it has? http://archive.fart.website/archivebot/viewer/?q=bitcasa |
21:38
🔗
|
|
mistym has joined #archiveteam |
21:43
🔗
|
|
mistym_ has joined #archiveteam |
21:44
🔗
|
|
mistym has quit IRC (Read error: Operation timed out) |
21:48
🔗
|
joepie91 |
... viewer? wut |
21:48
🔗
|
joepie91 |
anyway |
21:48
🔗
|
joepie91 |
needs a more recent crawl of the forums |
21:52
🔗
|
|
[2]the_fo has joined #archiveteam |
21:53
🔗
|
|
sankin has quit IRC (Leaving.) |
21:57
🔗
|
|
the_fox_ has joined #archiveteam |
22:03
🔗
|
|
[2]the_fo has quit IRC (Read error: Operation timed out) |
22:03
🔗
|
|
ruukasu has joined #archiveteam |
22:04
🔗
|
|
[1]the_fo has joined #archiveteam |
22:06
🔗
|
|
[2]the_fo has joined #archiveteam |
22:11
🔗
|
|
the_fox_ has quit IRC (Read error: Connection reset by peer) |
22:11
🔗
|
|
[1]the_fo has quit IRC (Read error: Connection reset by peer) |
22:11
🔗
|
|
[2]the_fo has quit IRC (Read error: Connection reset by peer) |
22:26
🔗
|
|
the_fox_ has joined #archiveteam |
22:53
🔗
|
|
mistym__ has joined #archiveteam |
22:54
🔗
|
|
mistym_ has quit IRC (Read error: Operation timed out) |
23:14
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
23:26
🔗
|
|
dashcloud has joined #archiveteam |
23:36
🔗
|
|
the_fox_ has quit IRC (Ping timeout: 492 seconds) |
23:38
🔗
|
|
sivoais_ has quit IRC (Ping timeout: 252 seconds) |
23:39
🔗
|
|
aaaaaaaaa has joined #archiveteam |
23:41
🔗
|
|
the_fox_ has joined #archiveteam |
23:44
🔗
|
|
nertzy has quit IRC (Read error: Operation timed out) |
23:47
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
23:53
🔗
|
|
dashcloud has joined #archiveteam |
23:54
🔗
|
|
the_fox_ has quit IRC (Read error: Operation timed out) |
23:58
🔗
|
|
the_fox_ has joined #archiveteam |