#archiveteam-bs 2017-07-02,Sun

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)

WhoWhatWhen
jrwrits done HCross2
All setup and confirmed working, it lives in #torarchivebot channel
[00:18]
***Sue has joined #archiveteam-bs
BlueMaxim has joined #archiveteam-bs
[00:24]
......... (idle for 43mn)
Sue_ has joined #archiveteam-bs [01:11]
dashcloud has quit IRC (Remote host closed the connection) [01:18]
dashcloud has joined #archiveteam-bs
pizzaiolo has quit IRC (Quit: pizzaiolo)
[01:23]
..... (idle for 23mn)
j08nY has quit IRC (Quit: Leaving) [01:48]
.... (idle for 18mn)
DopefishJ is now known as DFJustin [02:06]
.......... (idle for 49mn)
BubuAnabe has quit IRC (Ping timeout: 268 seconds) [02:55]
........... (idle for 53mn)
icedice has quit IRC (Read error: Operation timed out)
qw3rty2 has joined #archiveteam-bs
[03:48]
qw3rty has quit IRC (Read error: Operation timed out) [03:54]
........ (idle for 36mn)
Sk1d has quit IRC (Ping timeout: 194 seconds) [04:30]
Sk1d has joined #archiveteam-bs [04:36]
underscor has quit IRC (Read error: Operation timed out) [04:44]
..... (idle for 23mn)
Harzilein has quit IRC (Ping timeout: 260 seconds)
underscor has joined #archiveteam-bs
swebb sets mode: +o underscor
[05:07]
....... (idle for 33mn)
Famicoman has quit IRC (Ping timeout: 260 seconds) [05:43]
Famicoman has joined #archiveteam-bs [05:51]
...... (idle for 27mn)
Honno has joined #archiveteam-bs [06:18]
acridAxid has quit IRC (Quit: marauder) [06:30]
............... (idle for 1h10mn)
acridAxid has joined #archiveteam-bs [07:40]
...... (idle for 29mn)
Famicoman has quit IRC (Ping timeout: 260 seconds) [08:09]
.... (idle for 17mn)
Famicoman has joined #archiveteam-bs [08:26]
....... (idle for 30mn)
Honno has quit IRC (Read error: Operation timed out) [08:56]
....... (idle for 31mn)
SHODAN_UI has joined #archiveteam-bs
j08nY has joined #archiveteam-bs
[09:27]
kristian_ has joined #archiveteam-bs [09:43]
........ (idle for 36mn)
SHODAN_UI has quit IRC (Remote host closed the connection) [10:19]
..... (idle for 23mn)
Famicoman has quit IRC (Ping timeout: 260 seconds) [10:42]
Famicoman has joined #archiveteam-bs [10:51]
.... (idle for 19mn)
pizzaiolo has joined #archiveteam-bs [11:10]
.............. (idle for 1h8mn)
SHODAN_UI has joined #archiveteam-bs [12:18]
odemgHCross2, more comics on the way in, just got up the entire DC chronology and now working on marvels [12:32]
..... (idle for 22mn)
***Harzilein has joined #archiveteam-bs [12:54]
....... (idle for 31mn)
BlueMaxim has quit IRC (Read error: Operation timed out)
BlueMaxim has joined #archiveteam-bs
pizzaiolo has quit IRC (Read error: Operation timed out)
[13:25]
........... (idle for 53mn)
BlueMaxim has quit IRC (Read error: Operation timed out) [14:19]
bmcginty has quit IRC (Ping timeout: 268 seconds) [14:26]
jtn2https://archive.org/details/gna_tickets "The item is not available due to issues with the item's content."
Anyone know what this means?
(I'm finally going over others' work on Gna.)
[14:29]
***Asparagir has quit IRC (Asparagir) [14:36]
jtn2Also, ISTR something go by about how items from us on archive.org should be tagged as "Archive Team" somehow. Is that retrospective -- should I tell someone about AT-related items
>
#?
[14:37]
***kristian_ has quit IRC (Quit: Leaving)
pizzaiolo has joined #archiveteam-bs
[14:40]
Froggingjtn2: it means a copyright claim or something like that [14:58]
jtn2ugh
How does one find out what exactly it was? Will the item owner have more info?
(That's Zeryl, but they're not here any more. Can probably dig out their email address)
Could it have been automatically flagged from a malware scan?
[15:02]
***pie_ has joined #archiveteam-bs [15:09]
pie_Lord_Nigh, any chance you know of a magical way to run windows steam games from linux steam?
i can get the package with download_depot but i cant actually start it with steam nor can i just run wine game.exe
[15:10]
jtn2If I suspect someone (Zeryl) caused some stuff to be ingested into the Wayback Machine, is there any way to verify this? [15:16]
pie_nevermind i was starting the wrong exe *facepalm*
it autmatically starts steam
hm nevermind. it starts steam but doesnt run :/
[15:25]
PurpleSymjtn2: Marked as spam. https://catalogd.archive.org/log/672398126 [15:30]
jtn2PurpleSym: argh. By Jeff Kaplan, presumably. Any idea if I can appeal this? (I am assuming it is not in fact spam; Zeryl appeared to be acting in good faith and their other items are good.)
(Thanks for digging that out)
[15:35]
PurpleSymI don’t know. info@archive.org ? [15:38]
jtn2It's quite possible that some Gna tickets do have spammy content, although it appeared magically immune to spam.
I did have to flag things a few times though.
PurpleSym: should I say this is an Archive Team project, do you think?
[15:38]
.... (idle for 17mn)
***superkuh has quit IRC (Remote host closed the connection)
superkuh has joined #archiveteam-bs
[15:56]
....... (idle for 33mn)
pizzaiolo has quit IRC (Read error: Operation timed out)
pizzaiolo has joined #archiveteam-bs
icedice has joined #archiveteam-bs
[16:30]
ivan has quit IRC (Leaving)
ivan has joined #archiveteam-bs
[16:44]
odemgvoidsta, git in hur [16:49]
...... (idle for 26mn)
***Swizzle_ has joined #archiveteam-bs [17:15]
BubuAnabe has joined #archiveteam-bs [17:20]
Swizzle has quit IRC (Read error: Operation timed out) [17:27]
JAAFive million URLs completed on the Tilt API grab. Unfortunately, the queue has been growing again since yesterday evening and is now at 6.31M URLs. I've changed the concurrency and delay settings a few hours ago and am now retrieving about 50k URLs per hour (previously 30k). [17:35]
***ivan is now known as marvinw [17:37]
.............. (idle for 1h8mn)
BartoCH has quit IRC (Remote host closed the connection)
BartoCH has joined #archiveteam-bs
[18:45]
jrwrGO JAA GO [18:59]
........ (idle for 37mn)
hook54321What do I do when grab-site is stuck on a url? [19:36]
HCross2Leave it. It'll sort itself
hook54321: are you using phantomjs or YouTube-dl?
[19:40]
hook54321HCross2: Whatever is on by default on grab-site [19:43]
HCross2So neither [19:43]
hook54321It fixed itself [19:44]
..... (idle for 21mn)
JAAHCross2: So, do you want to start an Al Jazeera project or should I continue throwing the URLs into ArchiveBot? [20:05]
HCross2JAA: I'd do something by myself but all my crawl boxes are in use. Keep loading archivebot and I'll free some room [20:06]
JAAOk.
Arguably, the most important parts (the news pages) have already been archived. But there's a lot more content to grab, obviously.
I also have a huge list of social media accounts, but I'm not sure how to reliably grab those.
I've figured out something for Instagram, but other sites I'm not so sure.
[20:06]
odemgHCross2, do you want 1.2TB of manga? [20:16]
HCross2Sure [20:17]
godaneso i'm uploading another 2849 pdfs for the ERIC archive [20:20]
btw eric.ed.gov https sometimes fails to establish connection
i make my upload script to download the html using -O EDxxxxxx.html
so if can just check for any html as zero size files
*so i can just check for any html as zero size files
then use that to make a list to do a update-metadata from
[20:30]
.... (idle for 16mn)
***Honno has joined #archiveteam-bs [20:48]
JAAHCross2: http://doc.aljazeera.net/ probably needs some special treatment to retrieve the videos (Brightcove player). Youtube-dl seems to have some support for Al Jazeera, but I don't think it will work here (plus it's broken in ArchiveBot). [20:56]
HCross2JAA: looks like they do geo blocking too
I can't play some of the videos from my UK IP
[20:57]
JAAYay
Do you have an example which doesn't work for you? I just tested a few and those seemed to work here.
[20:58]
***Honno has quit IRC (Read error: Operation timed out) [21:08]
....... (idle for 32mn)
hook54321Has anyone else had issue with using WebRecorder Player to read warcs? I can't seem to be able find pages that should be there...
*issues
[21:40]
..... (idle for 20mn)
nvm it's a bug
what's the best way to load multiple warc files simultaneously so they can be browsed at the same time?
[22:00]
kisspunchhook54321: another option is to combine them with tools like https://github.com/alard/megawarc [22:02]
....... (idle for 30mn)
***fie has joined #archiveteam-bs
SHODAN_UI has quit IRC (Remote host closed the connection)
[22:32]
Ravenloft has quit IRC (Read error: Operation timed out)
Panasonic has joined #archiveteam-bs
[22:39]
Panasonicso, this guy merged two things SketchCow love https://arstechnica.com/gaming/2017/07/a-programmer-turned-wikipedia-into-a-classic-text-adventure/ [22:50]
***Dash has joined #archiveteam-bs [22:59]
...... (idle for 27mn)
Dash has quit IRC (Quit: Page closed) [23:26]
...... (idle for 26mn)
pie_ has quit IRC (Read error: Operation timed out) [23:52]

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)