Time |
Nickname |
Message |
00:25
π
|
|
mistym has quit IRC (Remote host closed the connection) |
00:28
π
|
|
oldcad has quit IRC (Quit: Leaving.) |
00:45
π
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
00:48
π
|
|
dashcloud has joined #archiveteam-bs |
00:55
π
|
|
mistym has joined #archiveteam-bs |
01:16
π
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
01:19
π
|
|
dashcloud has joined #archiveteam-bs |
01:43
π
|
SketchCow |
I just mounted a revolution at work to force IA to move to Slack |
01:43
π
|
SketchCow |
It... is going on interestingly. |
01:43
π
|
SketchCow |
Anyone need me for anything? |
01:43
π
|
SketchCow |
I know this whole thing with Sourceforge |
01:43
π
|
JesseW1 |
What is Slack? (Besides the goal of all right-thinking SubGenuis) |
01:44
π
|
SketchCow |
Check out slack.com, they walk you through it |
01:44
π
|
|
schbirid2 has quit IRC (Read error: Operation timed out) |
01:44
π
|
JesseW1 |
ah, an IM app. |
01:44
π
|
SketchCow |
ish |
01:45
π
|
JesseW1 |
hm, might be worth poking my work about it, too. We're currently (not particuarly happily) using Google Hangouts. |
01:49
π
|
JesseW1 |
How's the archiving with Slack? |
01:57
π
|
godane |
SketchCow: i'm uploading more tagesschau 20 clock evening news from 1989 |
01:57
π
|
|
schbirid2 has joined #archiveteam-bs |
01:58
π
|
godane |
also 1989 set will be complete as it can be |
01:58
π
|
godane |
they only started in September 1989 |
02:03
π
|
SketchCow |
Great |
02:06
π
|
|
primus104 has quit IRC (Leaving.) |
02:28
π
|
|
JesseW1 has quit IRC (Quit: Leaving.) |
02:50
π
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
02:51
π
|
xmc |
slack is ... hm. i used to be cranky-old-man about it and then i used it |
02:52
π
|
xmc |
they replace irc servers and webchat, but let you use your irc client against it |
02:52
π
|
xmc |
and their revenue model is "it is free, if you have a retention policy and so want to delete history, then insert money" |
02:52
π
|
xmc |
which is pretty well targeted, i think |
02:53
π
|
xmc |
not sure how much that will cover their expenses, but i wish them the best |
03:02
π
|
|
dashcloud has joined #archiveteam-bs |
03:09
π
|
|
bzc6p_ has joined #archiveteam-bs |
03:10
π
|
|
Start has quit IRC (Read error: Connection reset by peer) |
03:11
π
|
|
Start has joined #archiveteam-bs |
03:11
π
|
|
mistym has quit IRC (Remote host closed the connection) |
03:15
π
|
|
bzc6p has quit IRC (Ping timeout: 600 seconds) |
03:28
π
|
|
mistym has joined #archiveteam-bs |
03:40
π
|
|
JesseW has joined #archiveteam-bs |
03:42
π
|
Start |
i found an article that used the apple logo from the wiki and even credited us: http://www.hallels.com/articles/9738/20141016/apple-event-16-october-latest-news-updates-what-expect-os.htm |
03:44
π
|
|
mistym has quit IRC (Remote host closed the connection) |
03:45
π
|
|
mistym has joined #archiveteam-bs |
04:04
π
|
|
JesseW has quit IRC (Quit: Leaving.) |
04:13
π
|
|
vitzli has joined #archiveteam-bs |
04:26
π
|
|
aaaaaaaaa has quit IRC (Leaving) |
04:42
π
|
|
mistym has quit IRC (Remote host closed the connection) |
04:43
π
|
|
mistym has joined #archiveteam-bs |
04:51
π
|
xmc |
0/r |
04:52
π
|
xmc |
Start: that's kinda weird |
04:52
π
|
xmc |
i didn't know we were hosting apple's press kit :P |
04:52
π
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
05:01
π
|
|
dashcloud has joined #archiveteam-bs |
06:00
π
|
|
RichardG has quit IRC (Ping timeout: 370 seconds) |
06:13
π
|
Rotab |
haha |
06:36
π
|
|
RichardG has joined #archiveteam-bs |
06:38
π
|
|
mistym has quit IRC (Remote host closed the connection) |
07:05
π
|
|
bzc6p_ is now known as bzc6p |
07:18
π
|
|
JesseW has joined #archiveteam-bs |
07:32
π
|
Ctrl-S |
python guys: is it reasonable for a 10 threaded program that fetches from a JSON api and inserts the data into a DB to use 700 megs of ram? |
07:39
π
|
|
mistym has joined #archiveteam-bs |
07:43
π
|
|
Boltsie has quit IRC (Ping timeout: 506 seconds) |
07:48
π
|
|
mistym has quit IRC (Read error: Operation timed out) |
07:57
π
|
|
JesseW has quit IRC (Leaving.) |
07:59
π
|
|
Fusl has quit IRC (Read error: Operation timed out) |
08:04
π
|
|
primus104 has joined #archiveteam-bs |
08:33
π
|
|
Fusl has joined #archiveteam-bs |
08:42
π
|
|
Fusl has quit IRC (Read error: Operation timed out) |
08:49
π
|
dan- |
Ctrl-S: unfortunately it can be, what db layer and python interpreter are you using? (sqlite3, pymysql, mysqldb library running on standard cpython, pypy?) |
08:49
π
|
Ctrl-S |
um |
08:49
π
|
Ctrl-S |
python 2.7 sqlalchemy postgres |
08:53
π
|
dan- |
that's really large for something like that, never used sqlalchemy but I wouldn't expect 700 megs |
08:55
π
|
dan- |
maybe partly multithreading messing up? this line in the docs stood out: "The Session object is entirely designed to be used in a non-concurrent fashion, which in terms of multithreading means βonly in one thread at a timeβ.", but I haven't used sqlalchemy much |
08:58
π
|
Ctrl-S |
I believe i'm keeping each session local to one thread |
08:58
π
|
dan- |
other than that, if you really wanna nail it down, you can get the current memory size with something like this: http://stackoverflow.com/questions/938733/total-memory-used-by-python-process#answer-7669482 |
08:58
π
|
Ctrl-S |
thanks |
08:59
π
|
dan- |
I've had luck in the past with getting memory size, running 5k queries, getting memory size again, etc |
08:59
π
|
dan- |
changing what you do between those two grabs of the memory size can sometimes help nail down leaks and stuff like that |
09:00
π
|
Ctrl-S |
I think my problem was SQLAlchemy's unit of work thingy |
09:00
π
|
Ctrl-S |
Since i added a call to it's flush function to force it to push data to the DB, it seems to be behaving |
09:02
π
|
dan- |
ah, that's good |
09:02
π
|
Ctrl-S |
sitting at under 70 now |
09:03
π
|
|
primus104 has quit IRC (Leaving.) |
09:03
π
|
dan- |
whoo, much better |
09:03
π
|
Ctrl-S |
now i just have to see if it's like this in a days time |
09:04
π
|
dan- |
databases are fun, had an application a while ago that would just go down a few times a week, stop talking to the database at all and actually segfault cpython sometimes |
09:04
π
|
Ctrl-S |
half of my optimisation seems to be based on hunches |
09:04
π
|
dan- |
only ever happened in production though |
09:05
π
|
dan- |
turned out the db layer we were using didn't understand threading at all, and was munching all over itself >_> |
09:05
π
|
Ctrl-S |
I still need to add WARC output to my code |
09:05
π
|
Ctrl-S |
also i just realised that my API keys would be included in the WARC file |
09:06
π
|
Ctrl-S |
since they're URL parameters |
09:06
π
|
xmc |
\o/ |
09:06
π
|
xmc |
winner winner chicken dinner |
09:07
π
|
Ctrl-S |
storytime xmc? |
09:11
π
|
xmc |
bedtime |
10:01
π
|
|
Muad-Dib has quit IRC (Ping timeout: 252 seconds) |
10:15
π
|
|
vitzli has quit IRC (Quit: Leaving) |
10:37
π
|
|
vitzli has joined #archiveteam-bs |
10:43
π
|
|
mistym has joined #archiveteam-bs |
10:44
π
|
|
primus104 has joined #archiveteam-bs |
10:49
π
|
|
mistym has quit IRC (Read error: Operation timed out) |
11:44
π
|
|
mistym has joined #archiveteam-bs |
11:45
π
|
|
Muad-Dib has joined #archiveteam-bs |
11:52
π
|
|
mistym has quit IRC (Read error: Operation timed out) |
12:08
π
|
|
primus104 has quit IRC (Leaving.) |
12:38
π
|
|
Fusl has joined #archiveteam-bs |
13:30
π
|
godane |
i'm looking at mirroring metro.co.uk |
13:45
π
|
|
mistym has joined #archiveteam-bs |
13:46
π
|
godane |
i'm up to feb 1990 of tagesschau 20:00 evening news |
13:53
π
|
|
Boppen has quit IRC (Ping timeout: 198 seconds) |
13:54
π
|
|
mistym has quit IRC (Ping timeout: 512 seconds) |
14:31
π
|
|
mistym has joined #archiveteam-bs |
14:39
π
|
|
mistym has quit IRC (Remote host closed the connection) |
14:55
π
|
|
mistym has joined #archiveteam-bs |
14:59
π
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
15:14
π
|
|
primus104 has joined #archiveteam-bs |
15:19
π
|
|
Start has quit IRC (Read error: Connection reset by peer) |
15:20
π
|
|
Start has joined #archiveteam-bs |
15:22
π
|
|
primus104 has quit IRC (Leaving.) |
15:32
π
|
|
vitzli has quit IRC (Quit: Leaving) |
15:36
π
|
|
bzc6p_ has joined #archiveteam-bs |
15:41
π
|
|
zenguy_pc has quit IRC (Read error: Connection reset by peer) |
15:42
π
|
|
bzc6p has quit IRC (Ping timeout: 600 seconds) |
15:47
π
|
|
mistym has quit IRC (Remote host closed the connection) |
15:57
π
|
|
zenguy_pc has joined #archiveteam-bs |
15:59
π
|
|
JesseW has joined #archiveteam-bs |
16:15
π
|
|
Boppen has joined #archiveteam-bs |
16:21
π
|
|
JesseW has quit IRC (Quit: Leaving.) |
16:32
π
|
|
mistym has joined #archiveteam-bs |
17:02
π
|
|
aaaaaaaaa has joined #archiveteam-bs |
17:15
π
|
|
primus104 has joined #archiveteam-bs |
17:25
π
|
godane |
i'm uploading march 1990 of tagesschau 20:00 evening news |
17:30
π
|
arkiver |
godane: awesome, where did you get them? |
17:31
π
|
arkiver |
godane: you have misspelled the language: https://archive.org/details/tagesschau-20-clock-evening-news-1990-02-27 |
17:31
π
|
arkiver |
"germen" needs to be "german" |
17:33
π
|
Nertsy |
Hey so i have a question on how to do something |
17:33
π
|
Nertsy |
There is this site that is constantly updated, and on occasion things will get bumped back a page |
17:34
π
|
|
diacope has quit IRC (Ping timeout: 252 seconds) |
17:34
π
|
Nertsy |
I want to archive it, but how can I be sure that a post wont have moved across pages since i archived the page? |
17:34
π
|
godane |
http://www.tagesschau.de/inland/tsvorzwanzigjahren126~_origin-08770167-e3c2-45f7-a0b4-001139c0bbce.html |
17:42
π
|
|
diacope has joined #archiveteam-bs |
17:56
π
|
|
mistym has quit IRC (Read error: Connection reset by peer) |
17:56
π
|
|
mistym_ has joined #archiveteam-bs |
18:27
π
|
|
bzc6p_ is now known as bzc6p |
18:27
π
|
bzc6p |
Nertsy: archive it from front to rear |
18:27
π
|
bzc6p |
That is, from newer posts to older. Doing so, worst case is you archive a post twice. |
18:29
π
|
Nertsy |
bzc6p, makes sense... Unfortunately it just went down |
18:56
π
|
|
logchfoo1 starts logging #archiveteam-bs at Fri Jun 26 18:56:35 2015 |
18:56
π
|
|
logchfoo1 has joined #archiveteam-bs |
19:19
π
|
|
goekesmi has quit IRC (Remote host closed the connection) |
19:23
π
|
|
goekesmi has joined #archiveteam-bs |
19:44
π
|
|
mistym has quit IRC (Remote host closed the connection) |
20:11
π
|
|
RichardG has quit IRC (Remote host closed the connection) |
20:12
π
|
|
RichardG has joined #archiveteam-bs |
20:15
π
|
|
mistym has joined #archiveteam-bs |
20:27
π
|
schbirid2 |
is that comic at the bottom already known https://github.com/gilesbowkett/rewind ? :) |
20:38
π
|
|
mistym_ has joined #archiveteam-bs |
20:38
π
|
|
mistym has quit IRC (Read error: Connection reset by peer) |
20:45
π
|
|
godane has quit IRC (Ping timeout: 370 seconds) |
21:14
π
|
|
godane has joined #archiveteam-bs |
21:35
π
|
Start |
.title https://www.techdirt.com/articles/20150625/10561131460/canada-saves-public-public-domain-extends-copyright-sound-recordings-another-20-years.shtml |
21:35
π
|
botpie91 |
Start: Canada Saves Public From Public Domain, Extends Copyright On Sound Recordings Another 20 Years | Techdirt |
21:37
π
|
Start |
"Now songs such as Buffy Sainte-Marieβs "Universal Soldier" -- released 50 years ago this August -- are no longer in danger of entering the public domain." |
21:37
π
|
Start |
"no longer in danger" |
21:37
π
|
Start |
are you fucking kidding me |
21:45
π
|
wyatt8740 |
^ |
21:47
π
|
|
RichardG has quit IRC (Ping timeout: 252 seconds) |
21:48
π
|
garyrh |
Millions could have died. |
21:49
π
|
|
Asparagir has joined #archiveteam-bs |
21:51
π
|
schbirid2 |
Start: sarcasm |
21:51
π
|
|
RichardG has joined #archiveteam-bs |
21:51
π
|
schbirid2 |
oh wow, it is not |
21:59
π
|
aaaaaaaaa |
Won't somebody please think of the children!? |
22:01
π
|
schbirid2 |
http://i.imgur.com/yfbjCeP.jpeg |
22:05
π
|
Smiley |
I have a unfunny story about a girlfriend, a floppy disk and corrupted images which did exactly that.. |
22:11
π
|
garyrh |
something something https://xkcd.com/598/ |
22:12
π
|
Smiley |
rofl |
22:17
π
|
|
Asparagir has quit IRC (Quit: Leaving) |
22:28
π
|
godane |
SketchCow: i'm grabbing Gamespot podcast called The HotSpot |
23:11
π
|
SketchCow |
I THINK we have it, double check |
23:23
π
|
|
BlueMaxim has joined #archiveteam-bs |
23:41
π
|
godane |
SketchCow: i just looked and we don't have it: https://archive.org/search.php?query=The%20HotSpot%20gamespot |
23:42
π
|
godane |
maybe its in some web archive or tar ball some where but there is not collection for |
23:42
π
|
godane |
*for it |
23:43
π
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
23:44
π
|
|
dashcloud has joined #archiveteam-bs |