Time |
Nickname |
Message |
00:00
🔗
|
|
BlueMax has quit IRC (Read error: Connection reset by peer) |
00:00
🔗
|
godane |
SketchCow: its now down it looks like: https://computer-literacy-project.pilots.bbcconnectedstudio.co.uk/ |
00:00
🔗
|
godane |
also this : https://old.reddit.com/r/DataHoarder/comments/azy6k6/bbc_computer_literacy_project_videos_down_have/ |
00:00
🔗
|
godane |
so good thing i grab all of it when i did |
00:01
🔗
|
|
BlueMax has joined #archiveteam-bs |
00:30
🔗
|
godane |
dashcloud: so i got a buffy episode from one of your tapes and turns out that i uploaded it : https://archive.org/details/Buffy_WB_WOC_2001-05-08 |
00:58
🔗
|
|
godane has quit IRC (Read error: Connection reset by peer) |
01:12
🔗
|
|
godane has joined #archiveteam-bs |
01:15
🔗
|
dashcloud |
I really thought you had a hardware capture card- I think I have some spares, so let me check storage, and if they work, I'll send you one |
01:15
🔗
|
godane |
i don't want to put any into a computer |
01:16
🔗
|
godane |
i prefer usb based ones so i don't screw up my computer |
01:16
🔗
|
dashcloud |
the big benefit of the card is that you can totally avoid ffmpeg- you can just cat /dev/video0 > vid.mpg |
01:16
🔗
|
godane |
there is a usb based box that does that too |
01:17
🔗
|
godane |
https://www.mythtv.org/wiki/Hauppauge_HD-PVR |
01:17
🔗
|
godane |
thats a bit pricely to me see what i have been speading |
01:17
🔗
|
godane |
its some where in the $70 to $150 range |
01:18
🔗
|
dashcloud |
well, if you really have no interest in a internal card, I won't search for one then |
01:57
🔗
|
|
rustypand has left http://quassel-irc.org - Chat comfortably. Anywhere. |
02:15
🔗
|
|
omarroth has quit IRC (Remote host closed the connection) |
02:15
🔗
|
|
BlueMax has quit IRC (Quit: Leaving) |
02:19
🔗
|
|
omarroth has joined #archiveteam-bs |
02:45
🔗
|
|
ndiddy has left |
03:31
🔗
|
|
kiska1 has quit IRC (Ping timeout (120 seconds)) |
03:39
🔗
|
|
turnkit_ has quit IRC (Read error: Operation timed out) |
03:41
🔗
|
|
BlueMax has joined #archiveteam-bs |
03:48
🔗
|
|
kiska1 has joined #archiveteam-bs |
04:02
🔗
|
|
odemgi has joined #archiveteam-bs |
04:05
🔗
|
|
odemgi_ has quit IRC (Ping timeout: 252 seconds) |
04:11
🔗
|
|
odemg has quit IRC (Ping timeout: 615 seconds) |
04:17
🔗
|
|
omarroth has quit IRC (Remote host closed the connection) |
04:17
🔗
|
|
odemg has joined #archiveteam-bs |
04:18
🔗
|
|
Mateon1 has quit IRC (Read error: Operation timed out) |
04:41
🔗
|
|
qw3rty115 has joined #archiveteam-bs |
04:43
🔗
|
|
qw3rty114 has quit IRC (Ping timeout: 600 seconds) |
05:01
🔗
|
|
odemgi_ has joined #archiveteam-bs |
05:03
🔗
|
|
odemgi has quit IRC (Ping timeout: 252 seconds) |
05:10
🔗
|
|
odemg has quit IRC (Ping timeout: 615 seconds) |
05:16
🔗
|
|
odemg has joined #archiveteam-bs |
06:01
🔗
|
|
synm0nger has quit IRC (Read error: Operation timed out) |
06:02
🔗
|
|
mr_archiv has quit IRC (Read error: Operation timed out) |
06:02
🔗
|
|
Jopik has quit IRC (Write error: Broken pipe) |
06:05
🔗
|
|
BnAboyZ has quit IRC (Read error: Operation timed out) |
06:05
🔗
|
|
mr_archiv has joined #archiveteam-bs |
06:06
🔗
|
|
colona has quit IRC (Read error: Operation timed out) |
06:10
🔗
|
|
colona has joined #archiveteam-bs |
06:11
🔗
|
|
BnAboyZ has joined #archiveteam-bs |
06:11
🔗
|
|
decay has quit IRC (Remote host closed the connection) |
06:19
🔗
|
|
decay has joined #archiveteam-bs |
06:34
🔗
|
|
Atom__ has quit IRC (Ping timeout: 252 seconds) |
06:53
🔗
|
|
Mateon1 has joined #archiveteam-bs |
07:29
🔗
|
|
SynMonger has joined #archiveteam-bs |
07:33
🔗
|
|
abstract has quit IRC (Read error: Operation timed out) |
07:34
🔗
|
Exairnous |
JAA: I noticed that you did the social media I asked for with !ao as opposed to !a and I've wondering why? Do social media sites have lots of hidden links that loop or something? |
08:13
🔗
|
godane |
dashcloud: you can look for one of your capture card |
08:14
🔗
|
godane |
another tape is out of sync for some reason |
08:14
🔗
|
godane |
but really i may need a new computer also cause my usb ports are acting weird |
08:17
🔗
|
|
killsushi has quit IRC (Quit: Leaving) |
08:17
🔗
|
|
killsushi has joined #archiveteam-bs |
08:44
🔗
|
godane |
latest post: https://www.patreon.com/posts/having-problem-25315413 |
08:51
🔗
|
|
wp494 has quit IRC (Ping timeout: 506 seconds) |
08:51
🔗
|
|
wp494 has joined #archiveteam-bs |
08:56
🔗
|
|
xoxo has quit IRC (Ping timeout: 265 seconds) |
08:57
🔗
|
|
xoxo has joined #archiveteam-bs |
09:10
🔗
|
|
atomicthu has quit IRC (Read error: Operation timed out) |
09:10
🔗
|
|
atomicthu has joined #archiveteam-bs |
09:30
🔗
|
JAA |
Huh, we're looking for volunteer writers? I wasn't aware of that. |
09:30
🔗
|
SmileyG |
writing what? |
09:31
🔗
|
JAA |
I don't know. See what rustypand wrote yesterday at 23:28 UTC. |
09:33
🔗
|
JAA |
Exairnous: An !a job on a social media page almost never works (Mastodon being an exception). The reason is that those sites rely heavily on JavaScript/xmlHttpRequests to load more content, and the bot can't do that. So I retrieve the individual posts' URLs with my own tool (snscrape) and then archive each of those instead. It's not a perfect solution, in particular because the profile page won't work |
09:33
🔗
|
JAA |
correctly (no scrolling), but at least the content's preserved. |
09:35
🔗
|
SketchCow |
Oh fusl - start communicating with me or others before uploading piles of archiveteam items into the open collections |
09:38
🔗
|
SketchCow |
I mostly noticed because my thing that tells me how many things uploaded into the open collections noticed the spike. |
09:39
🔗
|
Fusl_ |
k ill stop the uploading, feel free to delete my stuff in the open collections, do note though that i deleted tbem from my side |
09:39
🔗
|
SketchCow |
WAIT NO |
09:39
🔗
|
SketchCow |
NO |
09:39
🔗
|
SketchCow |
N O |
09:40
🔗
|
SketchCow |
the_office_no_gif.gif |
09:40
🔗
|
VoynichCr |
ALARM ALARM ALARM |
09:40
🔗
|
Fusl_ |
what |
09:40
🔗
|
SketchCow |
I MEAN, let me know because we have faculty to just have you upload directly into the archive team stacks |
09:40
🔗
|
SketchCow |
Like, skip the line, go right into the collections |
09:41
🔗
|
SketchCow |
I just shoved all your shit into https://archive.org/details/archiveteam_googleplus |
09:41
🔗
|
SketchCow |
I can do things, I have powers |
09:42
🔗
|
SketchCow |
But you might as well be uploading directly |
09:42
🔗
|
SketchCow |
Same with these minecraft forums, which I'm going after next. |
09:42
🔗
|
Fusl_ |
i dont have access to that? |
09:42
🔗
|
SketchCow |
No, you don't |
09:42
🔗
|
SketchCow |
But I can give you access |
09:42
🔗
|
SketchCow |
I can do that |
09:42
🔗
|
SketchCow |
This is like Archive Team Top Uploader 101 |
09:43
🔗
|
SketchCow |
How have they all not told you this in whatever seedy pub at the docks you all meet in |
09:43
🔗
|
Fusl |
multiples in #googleminus asked for access, including HCross and kiska, we all dont have access yet so ¯\_(ツ)_/¯ |
09:43
🔗
|
SketchCow |
Hcross certainly has access |
09:43
🔗
|
SketchCow |
I believe kiska does too |
09:43
🔗
|
Fusl |
they do? |
09:44
🔗
|
Fusl |
well then i'm the only one without it |
09:44
🔗
|
SketchCow |
Access is not hard |
09:44
🔗
|
SketchCow |
You probably are! |
09:44
🔗
|
Fusl |
if you could grant me access that would be great |
09:44
🔗
|
HCross |
I have access |
09:45
🔗
|
Fusl |
23:56 <@HCross> I dont have collection, its going straight into opensource atm |
09:45
🔗
|
Fusl |
¯\_(ツ)_/¯ |
09:47
🔗
|
kiska |
We got access about 24 hrs ago, we assumed you got it as well |
09:47
🔗
|
Fusl |
nope |
09:47
🔗
|
HCross |
I asked for email addresses |
09:47
🔗
|
kiska |
I think I told you in voice... |
09:48
🔗
|
SketchCow |
You are all adorable bon-bons |
09:48
🔗
|
SketchCow |
Anyway, I just made a minecraft forum collection, it's throwing in the 416 items |
09:48
🔗
|
Fusl |
i'm gonna let the uploaders finish and exit after the current items are uploaded |
09:48
🔗
|
SketchCow |
http://fos.textfiles.com/RECOGNIZER/ Here's how I see all the stuff |
09:49
🔗
|
SketchCow |
That 1203 in the texts list upper left. I see a spike, I go see who's being the hero |
09:50
🔗
|
SketchCow |
Don't be thin-skinned, Fusl |
09:50
🔗
|
SketchCow |
Not a quality that works in this bag of marbles |
09:50
🔗
|
SketchCow |
Here, go read all this crazy Word-processing shiznat that Marcin Wichary is uploading |
09:50
🔗
|
SketchCow |
https://archive.org/details/@marcin_wichary |
09:51
🔗
|
SketchCow |
I find it calming |
09:51
🔗
|
Fusl |
anyways if you can get me access to the google- collection, perfect, if not, also fine, i'll just push to someone else and call it a day idc ¯\_(ツ)_/¯ |
09:52
🔗
|
Fusl |
as well as the mcf |
09:55
🔗
|
SketchCow |
You now have access to archiveteam_googleplus |
09:55
🔗
|
Fusl |
cool |
09:55
🔗
|
Fusl |
kiska or HCross: please assist in modifying the config? |
09:55
🔗
|
SketchCow |
Getting you the other one. (Things R slow) |
09:56
🔗
|
Fusl |
IA_COLLECTION="opensource" -> IA_COLLECTION="archiveteam_googleplus" ? |
09:56
🔗
|
kiska |
Yep |
09:56
🔗
|
kiska |
Save then run factory |
09:57
🔗
|
SketchCow |
Boop, you also have access to archiveteam_minecraftforums |
09:57
🔗
|
Fusl |
yeet, you are the hero |
09:58
🔗
|
Fusl |
now to figure out how to upload into a specific collection with the ia cli |
09:59
🔗
|
SketchCow |
ia upload blach blah blah -m "collection:<collection>" |
10:01
🔗
|
Fusl |
this correct? --metadata=mediatype:web --metadata=collection:archiveteam_minecraftforums |
10:01
🔗
|
SketchCow |
I use -m " " and find it works better |
10:01
🔗
|
SketchCow |
-m "mediatype:web" -m "collection:archiveteam_minecraftforums" |
10:02
🔗
|
Fusl |
k, thanks |
10:02
🔗
|
JAA |
--metadata=mediatype:web works perfectly fine as well. |
10:02
🔗
|
JAA |
Well, maybe not if you're on Windows or something, but who does that anyway? :-) |
10:04
🔗
|
SketchCow |
In only 5 more hours, my machine will be done uploading the FIRST 1.4tb batch of Minecraft Artifacts. |
10:08
🔗
|
SketchCow |
I can see that the 645 remaining gigabytes of what's uploaded so far is going in, THAT's taking an hour every 50gb |
10:08
🔗
|
Fusl |
SketchCow: is uploading to the open collection generally fine for completely random grab-site warcs or do you want me to push those into a separate collection? |
10:09
🔗
|
|
Jens has quit IRC (Remote host closed the connection) |
10:10
🔗
|
|
Jens has joined #archiveteam-bs |
10:14
🔗
|
SketchCow |
If someone uploads WARC archives to the general upload collection, it will get pushed into the outsider WARCs collection and never go into the wayback |
10:14
🔗
|
SketchCow |
The archivebot is the way for random grab-site warcs |
10:16
🔗
|
Fusl |
SketchCow: my grab-site jobs are ones that archivebot fails on |
10:16
🔗
|
Fusl |
for example, when pipeline ips get banned from websites due to request rate |
10:16
🔗
|
SketchCow |
Give it a name like archivebot_alt |
10:16
🔗
|
SketchCow |
archivebot_alt_* |
10:18
🔗
|
Fusl |
just the item name prefixed with archivebot_alt_? |
10:18
🔗
|
Fusl |
and no special collection? |
10:18
🔗
|
SketchCow |
It should go into archivebot's collection. |
10:19
🔗
|
JAA |
Uh, do we want that? That collection's already annoying enough to handle. |
10:19
🔗
|
SketchCow |
I don't know, tell me why it's a bad idea |
10:19
🔗
|
SketchCow |
What makes it annoying to handle |
10:20
🔗
|
JAA |
Well, partially it's just bugs in the ArchiveBot viewer, but inconsistent filenames etc. |
10:20
🔗
|
JAA |
But at least all files currently belong to some ArchiveBot job. |
10:21
🔗
|
JAA |
If we start throwing other files in there, it gets even messier. |
10:21
🔗
|
SketchCow |
Messier in what way? it all get just gets yanked in by IA anyway |
10:21
🔗
|
JAA |
Yeah, I mean more like "I want to find the WARCs that belong to ArchiveBot job X". |
10:22
🔗
|
JAA |
For example, when the site's excluded from the WBM or blocked by robots.txt, but you want to check what it grabbed. |
10:22
🔗
|
SketchCow |
So we could make an archivebot_alt collection |
10:24
🔗
|
JAA |
Yeah, that'd work I guess. Although at that point we might as well give it a different name since it doesn't have anything to do with ArchiveBot other than that there's some overlap of which sites are grabbed. |
10:24
🔗
|
JAA |
"archivebot_alt" to me suggests that it's another instance of ArchiveBot or something. |
10:29
🔗
|
Fusl |
"notarchivebot" |
10:31
🔗
|
Fusl |
SketchCow: speaking of "never go into the wayback", do the minecraftforum warcs go into the WBM? |
10:31
🔗
|
Fusl |
since that's technically what i grabbed them for |
10:38
🔗
|
SketchCow |
Well, now they will. |
10:41
🔗
|
|
Jopik has joined #archiveteam-bs |
10:48
🔗
|
kiska |
Can you name it archiveteam_grabsite ? |
10:53
🔗
|
SketchCow |
I could. |
10:57
🔗
|
|
argus has quit IRC (Remote host closed the connection) |
10:57
🔗
|
|
argus has joined #archiveteam-bs |
11:51
🔗
|
|
mr_archiv has quit IRC (west.us.hub irc.mzima.net) |
11:51
🔗
|
|
Terbium has quit IRC (west.us.hub irc.mzima.net) |
11:51
🔗
|
|
Hani has quit IRC (west.us.hub irc.mzima.net) |
11:51
🔗
|
|
evul has quit IRC (west.us.hub irc.mzima.net) |
11:51
🔗
|
|
purplebot has quit IRC (west.us.hub irc.mzima.net) |
11:51
🔗
|
|
LFlare has quit IRC (west.us.hub irc.mzima.net) |
11:51
🔗
|
|
Coderjo_ has quit IRC (west.us.hub irc.mzima.net) |
11:51
🔗
|
|
Fusl has quit IRC (west.us.hub irc.mzima.net) |
11:51
🔗
|
|
casc0de has quit IRC (west.us.hub irc.mzima.net) |
11:51
🔗
|
|
Soni has quit IRC (west.us.hub irc.mzima.net) |
11:51
🔗
|
|
svchfoo3 has quit IRC (west.us.hub irc.mzima.net) |
11:51
🔗
|
|
tjg1_ has quit IRC (west.us.hub irc.mzima.net) |
12:00
🔗
|
|
mr_archiv has joined #archiveteam-bs |
12:00
🔗
|
|
Terbium has joined #archiveteam-bs |
12:00
🔗
|
|
Hani has joined #archiveteam-bs |
12:00
🔗
|
|
evul has joined #archiveteam-bs |
12:00
🔗
|
|
purplebot has joined #archiveteam-bs |
12:00
🔗
|
|
LFlare has joined #archiveteam-bs |
12:00
🔗
|
|
Coderjo_ has joined #archiveteam-bs |
12:00
🔗
|
|
Fusl has joined #archiveteam-bs |
12:00
🔗
|
|
casc0de has joined #archiveteam-bs |
12:00
🔗
|
|
Soni has joined #archiveteam-bs |
12:00
🔗
|
|
svchfoo3 has joined #archiveteam-bs |
12:00
🔗
|
|
tjg1_ has joined #archiveteam-bs |
12:00
🔗
|
|
irc.mzima.net sets mode: +o svchfoo3 |
12:12
🔗
|
|
odemgi has joined #archiveteam-bs |
12:15
🔗
|
|
odemgi_ has quit IRC (Ping timeout: 252 seconds) |
12:18
🔗
|
|
BlueMax has quit IRC (Quit: Leaving) |
12:21
🔗
|
|
odemg has quit IRC (Ping timeout: 615 seconds) |
12:26
🔗
|
|
deevious has quit IRC (Remote host closed the connection) |
12:28
🔗
|
|
odemg has joined #archiveteam-bs |
13:08
🔗
|
|
deevious has joined #archiveteam-bs |
13:13
🔗
|
|
Atom__ has joined #archiveteam-bs |
13:17
🔗
|
|
killsushi has quit IRC (Quit: Leaving) |
13:18
🔗
|
|
abstract has joined #archiveteam-bs |
13:48
🔗
|
|
ndiddy has joined #archiveteam-bs |
13:56
🔗
|
arkiver |
Fusl: you used grab-site for the minecraftforums? |
14:48
🔗
|
|
argus has quit IRC (Remote host closed the connection) |
14:48
🔗
|
|
argus has joined #archiveteam-bs |
15:06
🔗
|
|
fredgido has joined #archiveteam-bs |
15:07
🔗
|
|
fredgido_ has quit IRC (Ping timeout: 252 seconds) |
15:24
🔗
|
|
slyphic has quit IRC (Read error: Connection reset by peer) |
15:24
🔗
|
|
slyphic has joined #archiveteam-bs |
15:38
🔗
|
|
BlueMax has joined #archiveteam-bs |
15:38
🔗
|
DFJustin |
https://twitter.com/DylanLJMartin/status/1101873832003018757 |
15:49
🔗
|
|
rustypand has joined #archiveteam-bs |
15:51
🔗
|
Fusl_ |
arkiver: yes, it required lots of ignores that i had to manually compile |
15:55
🔗
|
|
rustypand has left http://quassel-irc.org - Chat comfortably. Anywhere. |
16:57
🔗
|
|
slyphic has quit IRC (Quit: leaving) |
16:57
🔗
|
|
slyphic has joined #archiveteam-bs |
17:36
🔗
|
|
marked2go has joined #archiveteam-bs |
17:52
🔗
|
|
wp494 has quit IRC (Read error: Operation timed out) |
17:52
🔗
|
|
wp494 has joined #archiveteam-bs |
18:02
🔗
|
|
abstract has quit IRC (Read error: Operation timed out) |
18:23
🔗
|
|
Pixi` has quit IRC (Quit: Pixi`) |
18:24
🔗
|
|
wp494 has quit IRC (Ping timeout: 255 seconds) |
18:25
🔗
|
|
wabu has quit IRC (Read error: Operation timed out) |
18:25
🔗
|
|
wabu has joined #archiveteam-bs |
18:27
🔗
|
|
Pixi has joined #archiveteam-bs |
18:40
🔗
|
|
Stiletto has quit IRC () |
18:59
🔗
|
|
icedice has joined #archiveteam-bs |
19:09
🔗
|
|
wp494 has joined #archiveteam-bs |
19:25
🔗
|
|
wp494 has quit IRC (Ping timeout: 268 seconds) |
19:43
🔗
|
Exairnous |
JAA: Interesting! And thanks for putting up with my noob questions. |
19:45
🔗
|
|
kiska1 has quit IRC (Read error: Operation timed out) |
19:47
🔗
|
|
kiska1 has joined #archiveteam-bs |
19:59
🔗
|
|
Stiletto has joined #archiveteam-bs |
20:08
🔗
|
|
BlueMax has quit IRC (Quit: Leaving) |
20:10
🔗
|
|
BlueMax has joined #archiveteam-bs |
20:37
🔗
|
|
Exairnous has quit IRC (Ping timeout: 615 seconds) |
20:55
🔗
|
JAA |
Did we make any progress regarding .eu domains owned by UK residents? |
21:36
🔗
|
|
wp494 has joined #archiveteam-bs |
21:49
🔗
|
|
abstract has joined #archiveteam-bs |
21:52
🔗
|
|
Exairnous has joined #archiveteam-bs |
22:00
🔗
|
|
killsushi has joined #archiveteam-bs |
22:35
🔗
|
|
abstract has quit IRC (Read error: Operation timed out) |
23:13
🔗
|
|
ndiddy has quit IRC () |