Time |
Nickname |
Message |
00:18
🔗
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
00:20
🔗
|
|
SmileyG has joined #archiveteam |
00:21
🔗
|
arkiver |
yahooanswers grab is started!! |
00:21
🔗
|
arkiver |
500000 items queued |
00:24
🔗
|
|
Smiley has quit IRC (Read error: Operation timed out) |
00:25
🔗
|
OpticalSw |
Oh no haha will fire up home server and remote machines |
00:31
🔗
|
|
bai has quit IRC (Quit: server reboot) |
00:36
🔗
|
* |
nicolas17 has updated http://archiveteam.org/index.php?title=Mapillary |
00:49
🔗
|
|
OpticalSw has quit IRC (Ping timeout: 268 seconds) |
00:56
🔗
|
|
bauruine has quit IRC (Ping timeout: 260 seconds) |
01:00
🔗
|
JesseW |
not sure why you asked me specifically about Mapillary earlier, but FWIW, I agree with all your changes |
01:09
🔗
|
|
BlueMaxim has joined #archiveteam |
01:21
🔗
|
nicolas17 |
JesseW: I wanted to ask *somebody* for feedback before making my first edit to the wiki, you were quoted in the page (and I removed the quote because it seemed outdated by now), and we talked about it elsewhere :P |
01:24
🔗
|
JesseW |
ah, that makes sense :-) |
01:25
🔗
|
JesseW |
yes, thank you for removing my name (and the quote) -- I didn't need to be explicitly mentioned |
01:25
🔗
|
nicolas17 |
also, shame on me, I still didn't ask them about the "wikimedia commons export shouldn't have watermark" thing |
01:27
🔗
|
JesseW |
well, now you are reminded :-) |
01:51
🔗
|
|
tuankiet has joined #archiveteam |
01:56
🔗
|
tomaspark |
I've uploaded pov-ray usenet archive to IA |
02:08
🔗
|
JesseW |
tomaspark: nice -- feel free to add a link from somewhere relevant on the archiveteam wiki |
02:08
🔗
|
nicolas17 |
neat :O |
02:11
🔗
|
nicolas17 |
I wonder if the usual crowd still hangs out in povray.off-topic, I should peek in |
02:12
🔗
|
* |
nicolas17 gets nostalgic |
02:20
🔗
|
tomaspark |
i've posted the url on the usenet talk page |
02:22
🔗
|
tomaspark |
I am currently downloading the mozilla/netscape usenet |
03:07
🔗
|
|
RichardG has quit IRC (Read error: Connection reset by peer) |
03:08
🔗
|
|
RichardG has joined #archiveteam |
03:16
🔗
|
alembic |
orkut all done :3 |
03:16
🔗
|
nicolas17 |
:O |
03:16
🔗
|
alembic |
http://tracker.archiveteam.org/orkut/ |
03:16
🔗
|
alembic |
0 to do |
03:17
🔗
|
alembic |
(unless that's a partial list?) |
03:17
🔗
|
|
tomwsmf has quit IRC (Read error: Operation timed out) |
03:17
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
03:22
🔗
|
nicolas17 |
while true; do curl 'https://a.mapillary.com/v2/stats/im?client_id=MkJKbDA0bnZuZlcxeTJHTmFqN3g1dzo1YTM0NjRkM2EyZGU5MzBh'; echo; sleep 60; done |
03:23
🔗
|
|
dashcloud has joined #archiveteam |
03:27
🔗
|
tomaspark |
I've just found a list of usenet servers @ http://www.nyx.net/~bkraft/ |
03:28
🔗
|
nicolas17 |
I have never used usenet |
03:29
🔗
|
nicolas17 |
but the private NNTP servers I ever used were gmane, povray, and lugnet |
03:29
🔗
|
* |
joepie91 whoop whoop off-topic siren |
03:29
🔗
|
joepie91 |
best move to #archiveteam-bs :P |
03:43
🔗
|
|
nicolas17 has quit IRC (Read error: Operation timed out) |
04:06
🔗
|
JesseW |
nicolas17 -- that client_id is invalid |
04:08
🔗
|
JesseW |
flightcar.com shut down in July -- might as well stick it on the Deathwatch list |
04:15
🔗
|
|
aMunster has joined #archiveteam |
04:17
🔗
|
|
Sk1d has quit IRC (Ping timeout: 194 seconds) |
04:23
🔗
|
|
Sk1d has joined #archiveteam |
04:29
🔗
|
xmc |
... but it's dead already |
04:30
🔗
|
Frogging |
we can watch it be dead |
04:30
🔗
|
Frogging |
(also http://archiveteam.org/index.php?title=Deathwatch#Dead_as_a_Doornail ) |
04:33
🔗
|
JesseW |
we can be a place for historians in 20 years to ask "what services died in July 2016" and get an answer |
04:37
🔗
|
|
d_rebel_ is now known as d_rebel |
04:59
🔗
|
|
Start_ has joined #archiveteam |
04:59
🔗
|
|
enr1c0 has quit IRC (Read error: Operation timed out) |
05:02
🔗
|
|
Start has quit IRC (Read error: Operation timed out) |
05:03
🔗
|
|
Start has joined #archiveteam |
05:06
🔗
|
|
Start_ has quit IRC (Read error: Operation timed out) |
05:22
🔗
|
|
redlob has quit IRC (Read error: Operation timed out) |
05:28
🔗
|
|
redlob has joined #archiveteam |
05:43
🔗
|
|
tomaspark has quit IRC (Ping timeout: 255 seconds) |
05:45
🔗
|
|
mutoso_ has quit IRC (Read error: Operation timed out) |
06:00
🔗
|
|
mutoso has joined #archiveteam |
06:02
🔗
|
|
arrith has quit IRC (Read error: Operation timed out) |
06:09
🔗
|
|
Honno has joined #archiveteam |
06:12
🔗
|
|
Coderjoe has quit IRC (Read error: Operation timed out) |
06:16
🔗
|
|
patrickod has quit IRC (west.us.hub irc.mzima.net) |
06:16
🔗
|
|
Chorca has quit IRC (west.us.hub irc.mzima.net) |
06:20
🔗
|
|
patricko- has joined #archiveteam |
06:24
🔗
|
|
JesseW has quit IRC (Read error: Operation timed out) |
06:39
🔗
|
|
Chorca has joined #archiveteam |
06:43
🔗
|
|
Coderjoe has joined #archiveteam |
06:54
🔗
|
|
Coderjoe has quit IRC (ircd.choopa.net irc.mzima.net) |
06:58
🔗
|
|
Coderjoe_ has joined #archiveteam |
07:30
🔗
|
|
zenguy has quit IRC (Read error: Operation timed out) |
07:38
🔗
|
|
zenguy has joined #archiveteam |
07:43
🔗
|
|
tomaspark has joined #archiveteam |
07:55
🔗
|
|
BartoCH has joined #archiveteam |
08:05
🔗
|
|
BartoCH has quit IRC (Remote host closed the connection) |
08:05
🔗
|
|
BartoCH has joined #archiveteam |
08:25
🔗
|
|
bzc6p has joined #archiveteam |
08:25
🔗
|
|
swebb sets mode: +o bzc6p |
08:25
🔗
|
|
bzc6p has left |
08:43
🔗
|
|
phuzion has quit IRC (Read error: Operation timed out) |
08:47
🔗
|
|
_vOYtEC has quit IRC (Ping timeout: 250 seconds) |
08:48
🔗
|
|
phuzion has joined #archiveteam |
09:31
🔗
|
|
WinterFox has joined #archiveteam |
09:49
🔗
|
|
GLaDOS has quit IRC (Quit: Oh crap, I died.) |
09:49
🔗
|
|
GLaDOS has joined #archiveteam |
09:59
🔗
|
SketchCow |
We also might be a series of embittered old men |
10:39
🔗
|
|
tomaspark has quit IRC (Ping timeout: 255 seconds) |
10:40
🔗
|
|
tomaspark has joined #archiveteam |
10:45
🔗
|
|
W1nterFox has joined #archiveteam |
10:50
🔗
|
|
WinterFox has quit IRC (Read error: Operation timed out) |
10:55
🔗
|
|
W1nterFox has quit IRC (Ping timeout: 492 seconds) |
10:59
🔗
|
|
nicolas17 has joined #archiveteam |
10:59
🔗
|
|
WinterFox has joined #archiveteam |
11:08
🔗
|
|
bzc6p has joined #archiveteam |
11:08
🔗
|
|
swebb sets mode: +o bzc6p |
11:09
🔗
|
|
bzc6p has left |
11:14
🔗
|
|
alembic has quit IRC (Read error: Connection reset by peer) |
11:15
🔗
|
|
alembic has joined #archiveteam |
11:15
🔗
|
|
aMunster has quit IRC (Read error: Operation timed out) |
11:15
🔗
|
|
dxrt has quit IRC (Read error: Operation timed out) |
11:19
🔗
|
|
aMunster has joined #archiveteam |
11:19
🔗
|
|
dxrt has joined #archiveteam |
11:22
🔗
|
|
Madthias has quit IRC (Quit: â–’^Ù¥ â–’^Ù¥) |
11:26
🔗
|
|
aMunster has quit IRC (Read error: Operation timed out) |
11:34
🔗
|
|
aMunster has joined #archiveteam |
12:06
🔗
|
arkiver |
#noanswers for Yahoo! Asnwers! |
12:07
🔗
|
|
vOYtEC has joined #archiveteam |
12:13
🔗
|
arkiver |
Problem with yahooanswers not running on the warrior is fixed. |
12:16
🔗
|
wp494 |
reposting from -bs on YA: |
12:16
🔗
|
wp494 |
<Igloo^> For those looking at yahoo answers, Keep your concurrency low otherwise you get banned and get a 500 (printed in browser as error 999) |
12:17
🔗
|
arkiver |
thanks |
12:33
🔗
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
12:39
🔗
|
|
BartoCH has joined #archiveteam |
12:41
🔗
|
|
nicolas17 has quit IRC (Ping timeout: 244 seconds) |
13:01
🔗
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
13:04
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
13:07
🔗
|
|
dashcloud has joined #archiveteam |
13:21
🔗
|
Nemo_bis |
Need to find the dependencies for fedora 23... http://archiveteam.org/index.php?title=Talk:Wget_with_Lua_hooks |
13:37
🔗
|
|
WinterFox has quit IRC (Read error: Operation timed out) |
13:39
🔗
|
|
z00nx has quit IRC (Quit: WeeChat 1.5) |
13:44
🔗
|
|
z00nx has joined #archiveteam |
13:44
🔗
|
|
powerKitt has joined #archiveteam |
13:47
🔗
|
|
z00nx has quit IRC (Client Quit) |
13:49
🔗
|
powerKitt |
So, Gawker.com is going to be closing its doors on the 25th. According to their shutdown announcement, they don't have a finalized plan for the site's archives. It's likely possible to just grab it with wget. Notably, two of the site's articles are hidden from robots.txt |
13:51
🔗
|
|
powerKitt has quit IRC (Quit: Page closed) |
14:07
🔗
|
Igloo^ |
We need to have a bot which responds to any mention of gawker with "It's done" |
14:09
🔗
|
Medowar |
we have the title... |
14:15
🔗
|
|
z00nx has joined #archiveteam |
14:16
🔗
|
|
z00nx has quit IRC (Client Quit) |
14:18
🔗
|
|
z00nx has joined #archiveteam |
14:20
🔗
|
|
z00nx has quit IRC (Client Quit) |
14:21
🔗
|
|
z00nx has joined #archiveteam |
14:21
🔗
|
|
z00nx has quit IRC (Client Quit) |
14:22
🔗
|
|
z00nx has joined #archiveteam |
14:25
🔗
|
|
z00nx has quit IRC (Client Quit) |
14:25
🔗
|
|
z00nx has joined #archiveteam |
14:28
🔗
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
14:33
🔗
|
|
BartoCH has joined #archiveteam |
14:49
🔗
|
Frogging |
tfw nobody reads topics |
14:50
🔗
|
|
tomaspark has quit IRC (hub.efnet.us irc.Prison.NET) |
14:50
🔗
|
|
patricko- has quit IRC (hub.efnet.us irc.Prison.NET) |
14:50
🔗
|
|
godane has quit IRC (hub.efnet.us irc.Prison.NET) |
14:50
🔗
|
|
Jogie has quit IRC (hub.efnet.us irc.Prison.NET) |
14:50
🔗
|
|
db48x has quit IRC (hub.efnet.us irc.Prison.NET) |
14:50
🔗
|
|
yipdw has quit IRC (hub.efnet.us irc.Prison.NET) |
14:50
🔗
|
|
Fake-Nam1 has quit IRC (hub.efnet.us irc.Prison.NET) |
14:50
🔗
|
|
Igloo^ has quit IRC (hub.efnet.us irc.Prison.NET) |
14:50
🔗
|
|
midas has quit IRC (hub.efnet.us irc.Prison.NET) |
14:50
🔗
|
|
achip has quit IRC (hub.efnet.us irc.Prison.NET) |
14:50
🔗
|
|
Fake-Name has joined #archiveteam |
14:51
🔗
|
|
midas1 has joined #archiveteam |
14:51
🔗
|
|
swebb sets mode: +o midas1 |
14:51
🔗
|
|
yipdw_ has joined #archiveteam |
14:51
🔗
|
|
Igloo^_ has joined #archiveteam |
14:52
🔗
|
|
patrickod has joined #archiveteam |
14:56
🔗
|
|
nicolas17 has joined #archiveteam |
15:13
🔗
|
|
godane has joined #archiveteam |
15:20
🔗
|
|
achip has joined #archiveteam |
15:22
🔗
|
|
bzc6p has joined #archiveteam |
15:22
🔗
|
|
swebb sets mode: +o bzc6p |
15:22
🔗
|
|
bzc6p has left |
15:25
🔗
|
|
Igloo^_ is now known as Igloo^ |
15:27
🔗
|
xmc |
TWO of the site's articles are hidden you guys |
15:27
🔗
|
xmc |
you guys you guys |
15:44
🔗
|
|
JesseW has joined #archiveteam |
15:49
🔗
|
JesseW |
Yahoo Answers still doesn't work with the warriror, at least for me. |
16:14
🔗
|
|
RichardG has quit IRC (Ping timeout: 370 seconds) |
16:17
🔗
|
|
JesseW has quit IRC (Read error: Operation timed out) |
16:33
🔗
|
|
Morbus has quit IRC (Read error: Operation timed out) |
16:49
🔗
|
|
BartoCH has quit IRC (Ping timeout: 260 seconds) |
17:00
🔗
|
|
AlexLehm has joined #archiveteam |
17:13
🔗
|
|
BartoCH has joined #archiveteam |
17:15
🔗
|
|
sep332 has joined #archiveteam |
17:16
🔗
|
ErkDog |
If you google the string of text princeton-alums-state-dept-staffer-compete-in-revolting-sex-contest like dozens of websites have that in their robots. txt for some reason |
17:20
🔗
|
Kaz |
most/all gawker sites |
17:35
🔗
|
DFJustin |
heh somebody from the article must have lawyered up |
17:36
🔗
|
ErkDog |
it's still on Jezebel |
17:36
🔗
|
ErkDog |
http://jezebel.com/5723470/princeton-alums-state-dept-staffer-compete-in-revolting-sex-contest |
17:36
🔗
|
ErkDog |
can someone throw that into ArchiveBot? |
17:38
🔗
|
xmc |
why don't you go to #archivebot and !ao it yourself |
17:38
🔗
|
ErkDog |
because I can't |
17:38
🔗
|
xmc |
why not? |
17:38
🔗
|
ErkDog |
you have to have ops |
17:39
🔗
|
xmc |
not for !ao |
17:39
🔗
|
ErkDog |
ohhhhh no crap, thanks |
17:40
🔗
|
ErkDog |
ahhh ok o is no recursion, does that mean it basically saves the page itself and all assets of that page? |
17:40
🔗
|
xmc |
yes |
17:40
🔗
|
ErkDog |
gracias |
17:47
🔗
|
|
kristian_ has joined #archiveteam |
18:14
🔗
|
|
arrith has joined #archiveteam |
18:20
🔗
|
|
RichardG has joined #archiveteam |
18:43
🔗
|
|
Famicoman has quit IRC (Ping timeout: 260 seconds) |
18:56
🔗
|
|
tomwsmf has joined #archiveteam |
19:09
🔗
|
|
VerifiedJ has joined #archiveteam |
19:36
🔗
|
|
kristian_ has quit IRC (Leaving) |
19:38
🔗
|
|
arrith has quit IRC (Leaving) |
19:47
🔗
|
|
RichardG has quit IRC (Ping timeout: 370 seconds) |
19:48
🔗
|
|
schbirid has joined #archiveteam |
20:13
🔗
|
SketchCow |
https://archive.org/details/gawkeryoutube |
20:22
🔗
|
|
kristian_ has joined #archiveteam |
20:45
🔗
|
|
Jogie has joined #archiveteam |
20:51
🔗
|
|
Famicoman has joined #archiveteam |
20:55
🔗
|
|
tomaspark has joined #archiveteam |
20:58
🔗
|
|
Martini-- has joined #archiveteam |
20:58
🔗
|
Martini-- |
Hi |
20:59
🔗
|
Martini-- |
...by any chance is somebody here ? |
20:59
🔗
|
schbirid |
nope |
20:59
🔗
|
|
schbirid has quit IRC (Quit: Leaving) |
21:00
🔗
|
Martini-- |
Since you have more experience than me on the Intenet Archive, I was wondering if there are some tools to automate the uploding of files to the Archive. |
21:01
🔗
|
HCross |
ia cli toolk |
21:01
🔗
|
HCross |
tool |
21:04
🔗
|
DFJustin |
Martini--: https://pypi.python.org/pypi/internetarchive |
21:04
🔗
|
Martini-- |
https://github.com/jjjake/internetarchive |
21:04
🔗
|
Martini-- |
That one? |
21:04
🔗
|
DFJustin |
yep |
21:11
🔗
|
Martini-- |
Thanks for the pointers. |
21:12
🔗
|
Martini-- |
Do you know if someone has made some kind of frontend script to have sharing file website, but on the background the files are hosted on the Internet Archive? |
21:12
🔗
|
nicolas17 |
I think that would lead to IA misuse |
21:13
🔗
|
nicolas17 |
IA is not "free storage for anything" |
21:14
🔗
|
MrRadar |
The IA would probably be able to figure it out pretty quickly due to the types of files those sites tend to attract |
21:14
🔗
|
MrRadar |
(Encrypted multi-part RARs and obviously pirated content) |
21:16
🔗
|
SketchCow |
People already do this. |
21:19
🔗
|
Martini-- |
My single idea is not to pirating material. Is to make a file sharing service for OS/2 Warp. |
21:20
🔗
|
Martini-- |
I want to make something like hobbes - http://hobbes.nmsu.edu/h-browse.php?dir=/pub/multimedia/pointer |
21:20
🔗
|
nicolas17 |
any "file sharing frontend website" will attract piracy |
21:20
🔗
|
Martini-- |
...and at the same time store all files at Internet Archive. |
21:21
🔗
|
MrRadar |
On the one hand OS/2 software would probably be a good idea to collect and archive since it's a dead platform |
21:21
🔗
|
MrRadar |
On the other the IA can terminate accounts for any reason, including getting DMCA requests for content uploaded by them |
21:22
🔗
|
Martini-- |
...the idea is not to have illegal files there. Only the files that the community shares for the platform. |
21:25
🔗
|
MrRadar |
Even if nobody uploads anything illegal they could still close your account and hide the items for essentially using them as a backend for a file sharing service |
21:26
🔗
|
nicolas17 |
Martini--: and the idea of email is not to have spam, so? :P |
21:26
🔗
|
MrRadar |
Mirroring your content onto the IA would probably be a good idea, but it should only be used as a mirror/backup of your primary storage |
21:28
🔗
|
Martini-- |
The terms of use says "The Archive may immediately terminate this Agreement at its sole discretion at any time upon written notice..." but I can not find a wording yet that they forbide that practice. |
21:30
🔗
|
Martini-- |
MrRadar, have you seen any project/tools to make that mirroring easy? |
21:31
🔗
|
MrRadar |
No, though it should be easy enough to whip something up with the IA CLI tool |
21:31
🔗
|
MrRadar |
If you really want to use the IA as a backend for a file sharing site you should contact them directly about it and get permission from them |
21:34
🔗
|
Martini-- |
I would contact them if I found way to do it first :) |
21:34
🔗
|
MrRadar |
info@archive.org |
21:35
🔗
|
Martini-- |
Is it against the rules to make a site that has a "direct link" to an IA file? I don't see direct linking to a file as an issue to them. (if the file is legal) |
21:35
🔗
|
Martini-- |
nicolas17: Estas feliz sin los K en Argentina ? |
21:36
🔗
|
nicolas17 |
OT |
21:36
🔗
|
MrRadar |
Yes, we should take this to #archiveteam-bs |
21:36
🔗
|
Martini-- |
good. |
21:42
🔗
|
|
mls_ has joined #archiveteam |
22:02
🔗
|
Martini-- |
...wow...I'm reading on Twitter that SketchCow shaved some days ago. |
22:03
🔗
|
|
mls_ is now known as Kksmkrn |
22:07
🔗
|
|
Honno has quit IRC (Read error: Operation timed out) |
22:10
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
22:14
🔗
|
|
dashcloud has joined #archiveteam |
22:24
🔗
|
|
Kksmkrn has quit IRC (leaving) |
22:29
🔗
|
Martini-- |
Bye, thanks for the pointers. |
22:29
🔗
|
|
Martini-- has quit IRC (Quit: Page closed) |
22:54
🔗
|
|
AlexLehm has quit IRC (Ping timeout: 260 seconds) |
23:00
🔗
|
joepie91 |
https://t.co/BcBVXmznGm |
23:00
🔗
|
joepie91 |
gah |
23:00
🔗
|
joepie91 |
http://urbanmilwaukee.com/2016/08/19/journal-sentinel-archive-disappears/ |