Time |
Nickname |
Message |
00:03
🔗
|
|
ris has quit IRC () |
00:28
🔗
|
|
antomati_ has joined #archiveteam |
00:28
🔗
|
|
swebb sets mode: +o antomati_ |
00:30
🔗
|
|
antomatic has quit IRC (Read error: Operation timed out) |
00:38
🔗
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
00:40
🔗
|
|
jspiros has quit IRC (Read error: Connection reset by peer) |
00:40
🔗
|
|
jspiros has joined #archiveteam |
00:55
🔗
|
|
JesseW has joined #archiveteam |
01:10
🔗
|
|
r3c0d3x has quit IRC (Ping timeout: 260 seconds) |
01:39
🔗
|
|
philpem has quit IRC (Ping timeout: 260 seconds) |
01:46
🔗
|
|
r3c0d3x has joined #archiveteam |
02:07
🔗
|
dashcloud |
SketchCow: any chance you're able to change the collection & type for this? https://archive.org/details/dayyouwereborn Should be a playable Win3.1 title, but I messed up. Thanks! |
02:22
🔗
|
|
ploop_ has joined #archiveteam |
02:26
🔗
|
|
ploop has quit IRC (Ping timeout: 633 seconds) |
02:55
🔗
|
|
kcaj has quit IRC (Ping timeout: 250 seconds) |
02:55
🔗
|
|
d_rebel has quit IRC (Ping timeout: 250 seconds) |
02:55
🔗
|
|
Fletcher_ has quit IRC (Ping timeout: 250 seconds) |
02:55
🔗
|
|
logchfoo4 has quit IRC (Ping timeout: 250 seconds) |
02:57
🔗
|
|
logchfoo1 starts logging #archiveteam at Tue Jun 07 02:57:29 2016 |
02:57
🔗
|
|
logchfoo1 has joined #archiveteam |
02:58
🔗
|
|
kcaj has joined #archiveteam |
03:00
🔗
|
|
dashcloud has joined #archiveteam |
03:01
🔗
|
|
Gfy has joined #archiveteam |
03:08
🔗
|
|
Stilett0 has quit IRC () |
03:09
🔗
|
|
xXx_ndidd has joined #archiveteam |
03:10
🔗
|
|
vtyl has joined #archiveteam |
03:14
🔗
|
|
fie_ has joined #archiveteam |
03:18
🔗
|
|
fie has quit IRC (Ping timeout: 370 seconds) |
03:19
🔗
|
|
lytv has quit IRC (Read error: Operation timed out) |
03:22
🔗
|
|
ndiddy has quit IRC (Read error: Operation timed out) |
03:27
🔗
|
|
koon has joined #archiveteam |
03:32
🔗
|
|
xhdr has joined #archiveteam |
03:44
🔗
|
|
espes__ has joined #archiveteam |
03:45
🔗
|
|
Fletcher_ has joined #archiveteam |
03:46
🔗
|
|
Deewiant has joined #archiveteam |
04:16
🔗
|
|
Sk1d has joined #archiveteam |
05:02
🔗
|
|
BlueMaxim has joined #archiveteam |
05:24
🔗
|
|
consarnit has joined #archiveteam |
05:26
🔗
|
consarnit |
hey all! |
05:26
🔗
|
consarnit |
Can I have the wiki signup password? |
05:26
🔗
|
consarnit |
WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD |
05:26
🔗
|
consarnit |
in case there's a bot.. |
05:27
🔗
|
consarnit |
Or, alternately, could somebody start a page putting https://seene.co/ on deathwatch? |
05:27
🔗
|
consarnit |
It's a weird little creative network for 3D scans, just got acquired by SnapChat, no product updates since 2015 |
05:28
🔗
|
dxrt |
hey, yahoosucks is the password |
05:28
🔗
|
consarnit |
Seems like it won't last much longer |
05:28
🔗
|
consarnit |
takk |
05:36
🔗
|
consarnit |
looks pretty scrapeable |
05:36
🔗
|
consarnit |
undocumented api but their web renderer uses one |
05:37
🔗
|
consarnit |
.oemodel files |
05:37
🔗
|
consarnit |
which I think are proprietary |
05:37
🔗
|
consarnit |
ex https://d2qkfprjkxv2r7.cloudfront.net/uploads/scene/model/16e40b69-1834-456e-b729-ac5fc08bacee/scene.oemodel |
05:37
🔗
|
consarnit |
oh but sweet there is already a FOSS viewer |
05:37
🔗
|
consarnit |
https://github.com/detunized/seene-viewer |
05:38
🔗
|
consarnit |
so ya |
05:38
🔗
|
consarnit |
should be a pretty do-able job |
05:38
🔗
|
consarnit |
I don't know what your process is though |
05:38
🔗
|
consarnit |
do you have a scraper farm that I can like write a job for? |
05:42
🔗
|
HCross2 |
If it's small #archivebot |
05:44
🔗
|
consarnit |
Looks like there are maybe 500,000 users, lets say avg 20 pics items per user? |
05:44
🔗
|
consarnit |
Probably quite less than that |
05:44
🔗
|
consarnit |
Is that "small"? |
05:44
🔗
|
consarnit |
I have no context |
05:45
🔗
|
consarnit |
I've written lots of pythony scrapers before but IDK how you guys plan your attacks - is there a wiki page on writing Tracker jobs? |
05:48
🔗
|
|
philpem has joined #archiveteam |
05:52
🔗
|
JesseW |
That's probably small, yeah. |
05:53
🔗
|
JesseW |
We have two basic processes -- #archivebot and #warrior jobs. |
05:54
🔗
|
JesseW |
#archivebot is a set of donated servers that can manually-triggered spiderings of sites (and one-level deep external links) which then get automatically uploaded to the Internet Archive, and (generally) added to the Wayback Machine. |
05:55
🔗
|
consarnit |
oh nice! |
05:55
🔗
|
consarnit |
#ab would probably work for a small social/media network right? |
05:56
🔗
|
consarnit |
how do I schedule that? |
05:56
🔗
|
JesseW |
The #warrior is a VM, run by a few hundred people (you could be one, too!) that runs custom scripts (generally all written by our hard-working and generally amazing member named arkiver) to handle bigger or more rush jobs. |
05:57
🔗
|
JesseW |
Join the #archivebot channel on this network -- that's where the bot is commanded from. |
05:58
🔗
|
JesseW |
Initially you can just trigger specific (non-recursive) jobs, but if you suggest other ones, there are generally people available to trigger them for you. And if you stay around for a while, you'll likely get granted permission to do so yourself. |
05:58
🔗
|
JesseW |
You can see what is currently being worked on at this dashboard: http://dashboard.at.ninjawedding.org/beta |
05:59
🔗
|
JesseW |
(that's actually the beta version, but I like it a lot better than the other one) |
05:59
🔗
|
consarnit |
great domain |
06:00
🔗
|
JesseW |
yep, a lot of the domains used for archiveteam stuff are ... entertaining. |
06:11
🔗
|
xmc |
lots of personal domains mostly |
06:12
🔗
|
xmc |
woop woop woop off-topic siren |
06:12
🔗
|
xmc |
--> #archiveteam-bs |
06:28
🔗
|
|
Honno has joined #archiveteam |
06:48
🔗
|
|
WinterFox has joined #archiveteam |
07:22
🔗
|
|
schbirid has joined #archiveteam |
07:24
🔗
|
|
Baljem_ has joined #archiveteam |
07:24
🔗
|
|
Baljem has quit IRC (Ping timeout: 370 seconds) |
07:35
🔗
|
|
Cameron_D has quit IRC (Ping timeout: 370 seconds) |
07:39
🔗
|
|
maseck has quit IRC (Read error: Operation timed out) |
07:41
🔗
|
|
Cameron_D has joined #archiveteam |
07:41
🔗
|
|
maseck has joined #archiveteam |
07:44
🔗
|
|
dxrt has quit IRC (Excess Flood) |
07:46
🔗
|
|
dxrt has joined #archiveteam |
07:46
🔗
|
|
dxrt- sets mode: +o dxrt |
07:58
🔗
|
|
JesseW has quit IRC (Ping timeout: 370 seconds) |
08:05
🔗
|
|
Emcy_ has joined #archiveteam |
08:05
🔗
|
|
consarnit has quit IRC (Remote host closed the connection) |
08:13
🔗
|
|
rduser has quit IRC (Ping timeout: 370 seconds) |
08:14
🔗
|
|
jut has joined #archiveteam |
08:14
🔗
|
|
rduser has joined #archiveteam |
08:18
🔗
|
|
Emcy has quit IRC (Read error: Operation timed out) |
08:21
🔗
|
|
atomotic has joined #archiveteam |
08:35
🔗
|
|
Honno_ has joined #archiveteam |
08:41
🔗
|
|
fie has joined #archiveteam |
08:43
🔗
|
|
fie has quit IRC (Remote host closed the connection) |
08:43
🔗
|
|
fie has joined #archiveteam |
08:44
🔗
|
|
fie_ has quit IRC (Ping timeout: 244 seconds) |
08:47
🔗
|
|
arkiver3 has joined #archiveteam |
08:48
🔗
|
|
Honno has quit IRC (Read error: Operation timed out) |
08:56
🔗
|
|
W1nterFox has joined #archiveteam |
08:57
🔗
|
|
WinterFox has quit IRC (Ping timeout: 1208 seconds) |
09:04
🔗
|
|
arkiver3 has quit IRC (Ping timeout: 244 seconds) |
09:05
🔗
|
|
consarnit has joined #archiveteam |
09:09
🔗
|
|
ariscop has quit IRC (Leaving) |
09:09
🔗
|
|
consarnit has quit IRC (Ping timeout: 244 seconds) |
09:14
🔗
|
|
SN4T14 has quit IRC (Ping timeout: 370 seconds) |
09:21
🔗
|
|
SN4T14 has joined #archiveteam |
09:22
🔗
|
|
fie has quit IRC (Quit: Leaving) |
09:27
🔗
|
|
fie has joined #archiveteam |
09:32
🔗
|
|
SilSte has joined #archiveteam |
10:00
🔗
|
midas |
https://torrentfreak.com/takedown-staydown-would-be-a-disaster-internet-archive-warns-160607/ |
10:02
🔗
|
|
ariscop has joined #archiveteam |
10:47
🔗
|
|
Honno__ has joined #archiveteam |
10:56
🔗
|
SketchCow |
----------------------------------------------------- |
10:56
🔗
|
SketchCow |
A LITTLE BIRD TOLD ME TWEET TWEET GOOGLE GROUPS GONE WITHIN A YEAR |
10:56
🔗
|
SketchCow |
----------------------------------------------------- |
10:57
🔗
|
|
Honno_ has quit IRC (Read error: Operation timed out) |
10:58
🔗
|
|
W1nterFox has quit IRC (Read error: Operation timed out) |
10:59
🔗
|
SketchCow |
So... plan accordingly |
11:00
🔗
|
SketchCow |
dashcloud: That thing's a broken mess |
11:05
🔗
|
dashcloud |
SketchCow: thanks- I'll take a look at it. |
11:05
🔗
|
|
Emcy has joined #archiveteam |
11:08
🔗
|
|
Honno has joined #archiveteam |
11:09
🔗
|
PurpleSym |
At least we can start with a list of groups discovered in 2011. |
11:10
🔗
|
PurpleSym |
-> https://archive.org/details/archiveteam-googlegroups?&sort=-publicdate |
11:12
🔗
|
SketchCow |
I think there's fundamental issues with the item. I got it to sort of boot and it was DLL city |
11:18
🔗
|
|
Emcy_ has quit IRC (Read error: Operation timed out) |
11:20
🔗
|
|
Honno__ has quit IRC (Read error: Operation timed out) |
11:28
🔗
|
|
WinterFox has joined #archiveteam |
11:30
🔗
|
|
Stiletto has joined #archiveteam |
11:34
🔗
|
|
dcmorton has quit IRC (Ping timeout: 370 seconds) |
11:36
🔗
|
|
atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) |
11:36
🔗
|
|
dcmorton has joined #archiveteam |
11:36
🔗
|
|
swebb sets mode: +o dcmorton |
11:57
🔗
|
|
klg_ has joined #archiveteam |
11:57
🔗
|
|
klg has quit IRC (Ping timeout: 370 seconds) |
11:58
🔗
|
|
n00bLurke has joined #archiveteam |
12:07
🔗
|
|
n00bLurke has quit IRC (n00bLurke) |
12:07
🔗
|
|
RichardG has quit IRC (Read error: Connection reset by peer) |
12:29
🔗
|
|
dcmorton has quit IRC (Ping timeout: 370 seconds) |
12:32
🔗
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
12:33
🔗
|
|
atomotic has joined #archiveteam |
12:34
🔗
|
|
dcmorton has joined #archiveteam |
12:34
🔗
|
|
swebb sets mode: +o dcmorton |
12:39
🔗
|
|
BartoCH has joined #archiveteam |
12:50
🔗
|
|
Aranje has quit IRC (Ping timeout: 260 seconds) |
12:51
🔗
|
|
VADemon has joined #archiveteam |
13:00
🔗
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
13:01
🔗
|
|
Aranje has joined #archiveteam |
13:12
🔗
|
|
WinterFox has quit IRC (Remote host closed the connection) |
13:22
🔗
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
13:29
🔗
|
|
n00bLurke has joined #archiveteam |
13:29
🔗
|
|
BartoCH has joined #archiveteam |
13:37
🔗
|
|
BartoCH has quit IRC (Quit: WeeChat 1.5) |
13:38
🔗
|
|
BartoCH has joined #archiveteam |
14:20
🔗
|
|
atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) |
14:29
🔗
|
|
hawc145 has quit IRC (Ping timeout: 370 seconds) |
14:32
🔗
|
|
hawc145 has joined #archiveteam |
14:34
🔗
|
|
jut_ has joined #archiveteam |
14:37
🔗
|
|
jut has quit IRC (Read error: Operation timed out) |
14:40
🔗
|
|
Start has quit IRC (Quit: Disconnected.) |
14:43
🔗
|
|
HCross2 has quit IRC (Ping timeout: 260 seconds) |
14:44
🔗
|
|
sigkell_ has quit IRC (Ping timeout: 260 seconds) |
14:44
🔗
|
|
sigkell_ has joined #archiveteam |
14:55
🔗
|
|
SN4T14 has quit IRC (Ping timeout: 370 seconds) |
14:55
🔗
|
|
SN4T14 has joined #archiveteam |
14:57
🔗
|
|
HCross2 has joined #archiveteam |
15:15
🔗
|
|
VADemon has quit IRC (Ping timeout: 250 seconds) |
15:26
🔗
|
|
VADemon has joined #archiveteam |
15:27
🔗
|
|
Cameron_D has quit IRC (Ping timeout: 370 seconds) |
15:27
🔗
|
|
Cameron_D has joined #archiveteam |
15:32
🔗
|
|
Start has joined #archiveteam |
15:48
🔗
|
|
JesseW has joined #archiveteam |
16:03
🔗
|
|
Aranje has quit IRC (Quit: Three sheets to the wind) |
16:04
🔗
|
|
sivoais_ has joined #archiveteam |
16:04
🔗
|
|
sivoais has quit IRC (Ping timeout: 370 seconds) |
16:07
🔗
|
|
Start has quit IRC (Quit: Disconnected.) |
16:10
🔗
|
|
Aranje has joined #archiveteam |
16:13
🔗
|
|
JesseW has quit IRC (Ping timeout: 370 seconds) |
16:19
🔗
|
|
Start has joined #archiveteam |
16:20
🔗
|
|
Start has quit IRC (Client Quit) |
16:34
🔗
|
|
twrist has joined #archiveteam |
16:48
🔗
|
|
GLaDOS has quit IRC (Read error: Operation timed out) |
16:48
🔗
|
|
twrist is now known as GLaDOS |
17:09
🔗
|
|
consarnit has joined #archiveteam |
17:24
🔗
|
|
hawc145 is now known as HCross |
17:36
🔗
|
|
Simpbra1 has quit IRC (Ping timeout: 370 seconds) |
17:38
🔗
|
|
Cameron_D has quit IRC (Ping timeout: 370 seconds) |
17:38
🔗
|
|
Cameron_D has joined #archiveteam |
17:41
🔗
|
|
RichardG has joined #archiveteam |
17:53
🔗
|
|
Simpbra1 has joined #archiveteam |
18:13
🔗
|
|
consarnit has quit IRC () |
18:22
🔗
|
|
Start has joined #archiveteam |
18:30
🔗
|
|
Tomcat_ has joined #archiveteam |
18:47
🔗
|
|
klg_ is now known as klg |
19:01
🔗
|
|
winr5r has quit IRC (Read error: Operation timed out) |
19:07
🔗
|
|
Start has quit IRC (Quit: Disconnected.) |
19:09
🔗
|
|
Simpbra1 has quit IRC (Read error: Operation timed out) |
19:11
🔗
|
|
Start has joined #archiveteam |
19:14
🔗
|
|
jut has joined #archiveteam |
19:16
🔗
|
|
jut_ has quit IRC (Read error: Operation timed out) |
19:18
🔗
|
|
winr4r has joined #archiveteam |
19:18
🔗
|
|
ranma is now known as madpent |
19:19
🔗
|
|
madpent is now known as ranma |
19:21
🔗
|
|
Simpbra1 has joined #archiveteam |
19:23
🔗
|
|
jut has quit IRC (Quit: Leaving) |
19:26
🔗
|
|
atomotic has joined #archiveteam |
19:40
🔗
|
|
Start has quit IRC (Quit: Disconnected.) |
19:56
🔗
|
|
maseck_ has joined #archiveteam |
20:02
🔗
|
|
atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) |
20:04
🔗
|
|
Honno has quit IRC (Ping timeout: 492 seconds) |
20:07
🔗
|
|
maseck has quit IRC (Ping timeout: 1208 seconds) |
20:24
🔗
|
|
Tomcat_ has quit IRC (Remote host closed the connection) |
20:36
🔗
|
|
VADemon has quit IRC (Quit: left4dead) |
20:36
🔗
|
|
schbirid has quit IRC (Quit: Leaving) |
20:49
🔗
|
|
ariscop has quit IRC (Quit: Leaving) |
21:05
🔗
|
|
tomwsmf-a has joined #archiveteam |
21:07
🔗
|
|
pikhq has quit IRC (Ping timeout: 506 seconds) |
21:16
🔗
|
|
n00bLurke has quit IRC (n00bLurke) |
21:23
🔗
|
|
pikhq has joined #archiveteam |
21:24
🔗
|
|
fie has quit IRC (Ping timeout: 244 seconds) |
21:26
🔗
|
|
schbirid has joined #archiveteam |
21:28
🔗
|
|
ariscop has joined #archiveteam |
21:35
🔗
|
|
ris has joined #archiveteam |
21:48
🔗
|
arkiver |
Let's get https://seene.co/ and google groups |
21:48
🔗
|
arkiver |
:D |
21:52
🔗
|
arkiver |
seene.co indeed looks pretty doable |
21:58
🔗
|
|
schbirid has quit IRC (Quit: Leaving) |
22:22
🔗
|
|
Pudsey has joined #archiveteam |
22:23
🔗
|
Pudsey |
Any word on the robots.txt issue with the blip archive? You could access it yesterday by adding www. to blip.tv but now even that gives robots.txt |
22:28
🔗
|
|
Ravenloft has joined #archiveteam |
22:39
🔗
|
JW_work1 |
arkiver: I think we got seene.co via archivebot yesterday. |
22:39
🔗
|
arkiver |
all of it? |
22:39
🔗
|
arkiver |
https://seene.co/u/zettlerm/ |
22:39
🔗
|
arkiver |
https://seene.co/s/nXH5qs/ |
22:39
🔗
|
arkiver |
for example |
22:40
🔗
|
JW_work1 |
well, we'll need to wait till it posts to IA to check, but I think we got those, yes. |
22:41
🔗
|
|
Pudsey has quit IRC (Remote host closed the connection) |
23:02
🔗
|
|
Start has joined #archiveteam |
23:06
🔗
|
|
ris has quit IRC () |
23:58
🔗
|
|
xmc has quit IRC (Read error: Operation timed out) |