This video demonstrates how to implement cross entropy loss with one-hot encoding for training a PyTorch transformer model, addressing common issues like padding token handling, dimension mismatches, and the importance of proper target formatting for accurate model training.
Deep Dive
Prerequisite Knowledge
- No data available.
Where to go next
- No data available.
Deep Dive
PyTorch Transformer: Part 4Added:
Hey, how's it going everyone? Welcome on in.
>> Happy Thursday already. We're getting close to the weekend. We're going to be continuing our transformer today. We've got really close. We basically ran into a bug yesterday where we messed up some things in our dictionary creation. Our tokenizer, our tokenizer got broken. We fixed it. And now we're going to be doing the training. And then we're going to adding positional encodings. And I think that will basically mean that we're done with that. Wait. I think if we get to it, if we get to it, we can make it work. All right. Let me see here. Let me do a quick little share button on our Discord.
Over to main. Here we go. Oo, nice patent. Oh, we a patent pilot quoting idea. No priority. Does anybody have advice or suggestions on me? Uh this is a good idea. Someone mentioned previously that legal zoom would be an option. Legal zoom may be a good place to start for patent. All right.
Let's see here at everyone. There we go.
>> All right.
Okay. Let's get started with some coding. Ah, this is my favorite part.
All right.
accelerator.
>> Here we go.
>> Okay, so we've got Where do we go here on? So, we basically we need to do this.
We should get some more data. Upgrading the dictionary support to betterment. We don't need to worry about that right now. I don't think we don't I think we're pretty fine without it. Training data for generator based on our input. Wait, hold on. Training data generate based on our input. Oh, right, right, right.
Okay. Yeah. Yeah. Hey, Seek the Vault.
Good to see you. Welcome on in. Happy Thursday. We're working on our transformer. We basically mostly got it done. We just got a few things to do.
We've got a few items left. We did their masking. We did our tokenizer. We did the the the multi head. We did the target parameters. We built the dictionary, the tokenizer, which we obviously this is kind of the same thing. And this is where this is where we had our bug yesterday. That's where we had a problem yesterday. We fixed that. This is an identical item. We duplicated that. Okay. So, we need to get our training data. And then we need to do a positional encoding.
So, let me do positional.
This is the one I'm excited about because I think we can do a pretty good job on this one. Hey, Just Human.
Greetings. Good to see you. Welcome on in. Happy Thursday. Hope you have a good week so far. positional encoding.
All right, we're going to do defaf in it. We're just going to copy and paste this for the most part. And we don't need we basically need is it what is our we need to match dims. I'm just going to call it dims cuz this is what I've been tracking. I've been tracking I'm gonna be calling it dims basically.
>> There we go. Okay, now I'm curious. Uh what are we doing? Stephen here. Hey, how's it going there Deepac? Good to see you. Happy Thursday. We're building our transformer. We're building our trans.
We're going to make Chad GPT. That's what we're doing. Although, we're going to make like a little mini Jet GPT cuz I don't want to like just spend a lot of money on training a large a super big model. We're going to train like a mini model, like a little mini model, and see if we can get it to do something very specific, right? So, that'll be the plan for that. Once we have completed with the initial transformer. So this is our model right here that we wrote earlier this week. We did this model right here. Really simple straightforward. This layer specifically I would like to implement myself.
This is a nice layer that is pre-built for us in PyTorch. The good part is I think we can build our own. I think we could cuz the part's pretty straightforward I think. Okay. So, and we don't we don't have to do like a whole bunch of like optimization things like for the number of heads, we don't really need to split the dimensions to parallelize the processing, right? We only need an encoder. We don't need it.
Well, we only need a decoder. We don't need encoder or one way or the other. I we only need one. Although, it doesn't really matter because it's pretty much identical architecture.
It's just how you stack them. And we could add a dropout. We don't need to worry about that too much. So, your implementing attention is the key paper.
Yeah. Look at where we got it right here. Attention's all you need. Look it.
This is it. That's it. You called it.
You called it. That's great. In a simple manner. Yep. We can We can simplify it.
Exactly. How's it going there, Horser Hobbit? Welcome on in. Thank you for saying hello. Good to have you here.
Happy Thursday. We're building Attention's all you need today. That's what we're doing today. We're building this right here. This like nice diagram.
It's a nice one. So, we've got quite a few things to do in order to achieve our objective. I think we could I think we can do it, though. I'm pretty I'm pretty positive.
>> I want to do this. I think we could do positional encoding today. We might spend the entire day on this because it's quite a bit. This could be quite a bit cuz there's a lot to consider here.
>> You're no scholar, but even heard about this paper. YOU DID? NO WAY. ALSO, hi Steven. Good to have you here. Hope you're having a great week so far. We're almost to the weekend, you guys. Should I use Premiere Pro or Cap Cut?
>> Cap Cut is free, isn't it?
>> Why wouldn't you just use Cap Cut?
Right. There's Cap Cap Cut's free.
>> Premiere Pro is going to give you a lot more features, though. And it's going to have a lot more major industry thing.
Hey, thank you for the hearts.
Appreciate it. Hey, look at that. I see little floaty heart icons there. Yep. I see them throwing up. Thank you so much.
I appreciate it. I can see when you guys do that. Yes. That's really exciting. So excited for you. All right, more power to you, sir. Thank you. Fantastic. All right, this is great, you guys.
Appreciate it. I like that. All right, so we're going to do some positional encoding and then we need to do the training itself, which I think we can almost do. We're like really close. We need our criterion, which we can put here, right? Can Can I just do that? So, we got our criterion right here. I think so. I think so. Thank you for all the hearts. Appreciate it.
Okay. Thanks, man. Man, you made my Oh, yeah. made it clear. Love from India.
Hey, Ankit, that sounds great. All right. Happy to hear it. That's fantastic. Thank you. Thank you for hanging out here. Okay, so we've got our criterion. Let me see.
Let me change the music real quick.
Okay, there we go. Let me see if we can run this Python transformer.
Okay, you got a sub, man. Thanks, NK.
Oh, that's great. Hey, you joined the right channel for software engineering.
What we're doing here is we code by hand on our keyboard. We don't use those AIs for that much because we want to learn.
This is learning. You got to make sure not to have someone else do the work for you. Otherwise, right, you won't learn anything. Oh, hi Mr. Bumpy Pickle. Good to see you, Steven and France. Good to have you here. Hope you had a great week so far. Like and subscribe or else I see that.
Thank you, Mr. Bumpy Pickle.
I appreciate it. I appreciate it. Torch has no module cross entropy loss. Yeah, it does. Wait, is it torch inn?
I don't think it's in, right? It's not in. It's cross entropy loss, right? Oh.
Oh. Oh, wait. No way. Really? Hold on.
Wait, wait, wait, wait. I feel like I should know that. Python. Let's see.
Import torch. We need We need to be like you, dude. You made my day. Oh, and kit.
Whoa. Hey, that's great feedback. Thank you. Appreciate it. That's so nice for you to say. Hey, Krishna, good to see you. Hello, bro. Happy Thursday. Hope you're having a great week. Today, we're programming and we're doing the best thing, the best thing ever.
Torch.n.cross entropy loss. Hey, it's right there. All right, perfect. Perfect. Okay, so it really is only available in the neural network. Okay, sounds good to me. All right, let's rerun that and see where it's going. Hey, Sergio, welcome on in.
Good to have you here. Happy Thursday, Sergio. Hope you're having a good week as well. All right, so I want to let's see if we can get one loss. Let's see if we can do a loss. I don't know how we're going to do it. I don't know how. What we need to do is we need our target data. We need our target data.
So, we did an input, right? Where's our input at? Where' we do our output model transform model train and then we got our output. Here we go. Here we go. This right here. Okay.
Let me move this here. So we got optimizer. Then we're going to do through epochs.
So this is going to be our input and output data.
Bro, what are you? Uh oh. What are you man? What are you man? Wait Krishna, what do you mean by that? I'm a software engineer. Wait. Uh, even more. If you guys don't know, CTO at PubNub, we raised 134 million series 1 billion devices and we connect a lot of devices, you guys.
He's a goat. Hey, Mr. Bumpy Pickle.
Thank you. Appreciate it. Good to have you guys here. All right, so this is what we're going to do today.
We're going to be optimizing our model, which means we're training the model.
And we're going to do it for so many epochs. How many epochs? I don't know.
Uh, epochs equals, we'll do like 10. No, we'll do 100 cuz we have not that much data. For epoch in range epochs. Hey, squirrel. Hey, squirrel squad. Hey, it's good to see you. Welcome on back. Good to have you here.
Happy Thursday. For epoch in range epoches, we're going to say print epoch.
All right.
Then I want to run this through on the output. Here we go. Print the output shape words equals dictionary decode output. We don't need this right now.
And I'm going to indent it anyway. So here, this is what this is the this is the this is the magic shot right here.
This is the magic right here you guys. This is the magic.
This is what we want to make it work.
Sorry, bro. Typing error. I'm asking you, what are you doing today? Oh, what are you making? We are building chat GPT is what we're making. You've probably used an AI to do some work for you before, have you? You've probably everyone's done it now. So, we're going to be building our own transformer, our own language model, you guys. That's what we're currently doing. And our language model is only going to know a few things. It's it's going to know just a few things. Just a few. And we'll be we'll we'll make it we'll make it work is what we want to do. I want to learn the algorithm. That's what we are doing.
For the most part, I understand about 95% of this for for example the nuances between encoding and decoding here and what the difference is. So your input and output you need to put both in at the same time. I'm not I it that I have that part is weird. I get it. I understand. I just don't fully get it.
Hey Twilight, good to see you. Don't forget to drink water, bro. Oh, we got the water right here. Thank you for reminding me. Apparently, it's important. It is cuz you can have problems if you don't. I tried not drinking water for a while.
Have you tried that? You can do it, but you you It'll be hard.
And it was a mistake. Whoops. Make sure to drink your water.
Okay, here we go. So this is probably going to fail because the training data is is just words and the output is going to be numbers. So hey Mark Lemon, good to see you. Thank you for clicking the hi button and welcome here the Thursday.
Happy Thursday.
Good to see you.
All right, lost criterion training data we need. So here's the problem. The model in my model allows me to input full on text data. So full-on text data, right?
The problem with that is that I don't have a way to capture the correct encoding essentially, right? The the correct embeddings, the vectorzation part, the part where we turn the words into numbers.
This is where I need help with really quick.
If I'm going to be passing data through, I don't know what the output's going to be. So if our output is this, which are pro probabilities, if there are probabilities of which number it's going to be, we need to capture that.
We need to capture that in a way so that way we can we can train what the actual the actual expected output is going to be. Right?
So that that's what that's what I need that's what I need to fix right here.
This this one little part it's like one variable right here is going to be a little bit of effort.
Just a smidgen of effort.
So how do we want to go about that is my question. Let's think about that. So it's words. We got to convert it into tokens.
And then once we convert it to tokens, we need to turn it into probabilities.
But the good news is the probabilities are going to be one hot, right? They're guarantees.
They're full-on guarantees.
So it's more code than I have written right now. So we need to capture this convert it into a So we need to we need to one hot this essentially right here.
This needs we need to one hot on that.
Where am I going to put that? How do I want to do that? So I I probably want to put it in the model the dictionary. I have a decoder here. So what if I have So I've got my normalize tokenize. Yes.
Oh, this actually I already have it. I have tokenizer here and then it needs to go to one hot after that. So I need to one hot my tokenizer.
All right. So defaf one hot self, tokens. So it has to be tokens.
Okay. Then I need to convert that into one hot. Now I think torch has a built-in onehot. We could build this ourselves. So basically we do torch.0 zeros, create all zeros, and then we update the index where the token is the value at that index.
Hello. Hello. Hey, Bogoteers. Good to see you. Happy Thursday. Welcome on in.
Hope you're having a great day so far and a great week. It's almost the weekend. All right. So, we could I want I want to see if I can use a native pie torch. All right. Pie torch one hot. Is there Is there a one hot? There is right here. There's the one hot right here.
Hey, look at that number of classes. OH, IT'S ALREADY READY for us to use. Hey, that's what I like right there.
All right. Good to hear it. Good to hear it.
So, we can just do functional one hot and then we build. We We already have the tokens. We already have the tokens.
So, we we should just be able to do this function real quick.
This might not have been as hard as I thought it was going to be. We We could do this. So, we could do this. So, return torch.nfunctional one. Was it one hot one_h hot tokens?
Is that it? Is that really all it was?
You think?
All right, let's see. So, we can't do that yet. So let's do target uh target uh labels target. How do we say this? If this is this is the this is the answer. This is what we're looking for. How about if we say targets t target targets equals training data and then we say dictionary oneh hot like that. Is that going to work? You think that'll work?
They ripped out the entire crypto API that my entire Linux kernel patch was based on. Oh, Bonzupi. That's right. You mentioned that yesterday. They did. So, my sanity took another heavy hit. Oh, no. That's terrible. Wait, but why?
Everyone wants the cryptography library to be there. There's a lot of standard cryptography libraries. Why did they remove it, you guys? Hey, Ronnie, good to see you. Ronnie in one, welcome on in. Happy Thursday. How are you getting the subtitles on YouTube live? Isn't that great? Right there. Technology I wrote many years ago right here. You can grab this yourself. It's for free. Free forever, you guys. Free forever. It's a common question everyone always asks.
So, I have the tab pinned. The tab is pinned. Also, it's here in our Discord channel. So, if I go to link share, I believe it will be around here somewhere. Here it is right here. So, if you go on our link share channel, it'll be right there on Discord, you guys.
No, yesterday I found the update that just refactored the network stack API.
Ah, got it. That's what I was complaining about yesterday. Okay, got it. Got it. And so now they've officially removed the crypto API for your entire Linux kernel patch. Oh, that's terrible. Bunupi, you found an update where they just ripped out the crypto API and replaced it with a new one. A well, okay, that makes sense. I understand they What was I'm actually kind of curious. What did they choose? I know myself some crypto. I know a little bit of crypto. When I say crypto, I mean cryptography, right? I mean, obvious and then bitcoins or other things. What you think of like cryptocurrency. In this case, we're talking about cryptography, right?
It's a misnomer. Did you know there is no encryption?
There's no encryption in cryptocurrency.
It's that Wait, you're That's counterintuitive. There's signatures.
That's why it's called crypto. There's signing, cryptographic signing, but there's no encryption, per se. If you didn't know, now you do.
Now you do. All right. So, is this one not going to work? Print. I want to print the targets. Targets.
Targets. Whoa. Target. There we go. All right, we got our targets. Print targets. Is that going to do the trick?
I don't know. We're gonna find out. Here we go. Here we go.
Uh, tokens. Oh, it doesn't like that.
Why not? Uh, argument input one must be a tensor, not a string. Oh, wait. Oh, right, right, right, right, right. Oh, right, right. Okay, we have to go a step further here. Uh, we need to say, see, not tokens. Let's say let's see see words tokens equals self dot do we normalize and tokenize right that's what we got to do we do normal and tokenize do we automatically normalize wouldn't that be better if we did right I think that would be better if we did subn words so for each word for each input sentence today. Oh, they removed the monolithic PCBC crypto API from cryptography subsystem.
And for the RXRPC subsystem, they implemented a simplified version of the same cipher in a way that breaks everything I did. Oh, they could have kept the API compatible. They could have for that. They could have just replaced the underlying cipher if they wanted to.
So many selfves. I know. What does it do? So the self just refers to the classes instance. So you see here we've got a dictionary class definition. We can make an instance of it which just means like creating a copy of it kind of cuz this is kind of like a template.
This is like saying hey this is a code structure template let's create a copy of it in memory. So I create a copy of it in memory right here calling it a dictionary right. So I create a new dictionary and all the self is going to reference the dictionary's instance itself, right? So it means this dictionary right here. That's what that means.
So in a way you're starting over. Hey, oh well your implementation simpler now that they have a crypto library that is uh local to the network stack that you're working on. So okay, well that's that's like silver lining, right? Where it's just pretty good. is a little good.
Okay, thanks. All right. Hey, how's it going there? Uh, Vinnet Vinetkumar. Hey, Vinnetkumar, thanks for saying hello.
Good to have you here. Happy Thursday.
We're building technology. Check it out.
It's so great. This is my favorite thing. It's my favorite thing, you guys.
All right. Also, for those of you who are new here, we do have a hackathon going on right now. Visit our Discord and the hackathon channel right here. Go to blocks.ai, AI create an agent and then submit it to the Discord and you'll be entered to win.
You'll be entered to win. Oh, Vinnetkumar abs. Yep. Glad to have you here. All right, so we need to finish writing our one hot with the words. We're going to say tokens.normalize norm and then we're going to pass in my words. And then the tokenize is not is still words, but it's normalized words.
So, what I think I want to do is simplify this code really quick because I'm always normalizing before I'm tokenizing. And I think that's necessary. I think it's necessary, you guys. Which PC are you using? Oh, I've got your answer right here. Are you ready? Check this out. You name Darwin right here. What does that mean? I'm running MacOSS. I'm running Mac OS. Yes, this is a Mac. You got it. It's a Mac, you guys. We're doing a Mac here, which is very Unix based. That means that I get to have all of the wonderful terminal interfaces that I want without having to have any emulation layer, basically. That's what that means.
That's what that means.
So, what do you how about you? What do you use? What do you guys use? Hey, Oxno. Good to see you. WHAT'S UP?
WELCOME on in. Good to see you back.
Happy Thursday. We're getting close to the weekend. All right, you guys. We're going to curious about this. Let's take a look. What OS do you use? Question mark. Do you go You go Windows. Wait, I got to spell that right. Wind. Yeah, that's I spel I spelled that right.
Windows. You Mac. Let's add some options here. We go Linux, right? Or other slash Arch. I don't know.
like I don't know maybe maybe like other I don't know what other it's just let's see what you guys say I'm curious about it I'm thinking Windows is going to be number one I think Windows might be number one it is I I'd be curious to see if anyone's running a Mac all right Darwin all right so I want to update this here to do I'm going to double it's okay to double normalize actually it's perfectly fine so I'm going to say words equals self.norm words. All right. So, now we need to optimize. Now that I've normalized that, we only need to say tokenize.
Tokens. Tokens. Boom. Done. Did we fix it? Yeah. I think so.
You're good. Currently, use the old Intel Mac. Oh, wa. No way. That's crazy.
Hey, check this out. I've got that right here. Wait, wait, wait. I'll show you.
I'm not using it right now, but it's the old Intel Mac right here. The old Intel Mac. It's Intel. It's got an Intel right here.
I got it. It's just sitting over here on the other desk.
But we got it. It's a 2019. It's a 2019.
It's the one with the Mac with the Intel inside. Okay.
All right. So we normalize here on our tokenizer. I also want to see if we do any other norms here. We don't need to do norms anymore. So what we do instead, we just remove the norm. We get rid of the norm as well. And boom. Simplified.
Done. Easy. Easy.
Finally, someone else. Ah, right. I know. It's crazy. I mean, it is like a seven-year-old system. Although at the same time, that's still really powerful.
Same as 2019. A. All right. Very nice.
That's awesome.
That is amazing. So, we uh we do need to do this, but we're going to deal with that later. Okay. And then uh transformer. We already got this. We're using a swish glue, which is pretty neat. Where is this? All right. So, I need to see my normalizer here. Word list normalizer. Okay. So, we do need to have it as a separate function, which is nice. And we got to keep that there. Okay. Bonzupi. Oh my lord. They bricked the protocol. Oh, what? No. Wait. Why' they do that?
That's crazy. Don't break the protocol.
That causes problems. That causes problems. Why would you do this?
Got to brick. Bricking is not a Hey, Lun. Good to see you. Welcome on in.
Happy Thursday. We're almost to the weekend, loons. We almost made it.
Did you submit your your hackathon project yet?
You know, I haven't seen too many being submitted. You might be the only one.
You might win first place.
Ben Kater, good to see you. Welcome on in. Happy to have you here. Happy Thursday. It's Welcome back. Ben Kater's here, everyone. All right, so I think I got everything I want here. Let me switch up the music really quick. Let's go to the next song. Let's try this. Is this a good song?
This is a good song.
I think we'll listen. We'll try it.
We'll try it. You just installed Chinese Linux on a VM. What is Chinese Linux?
What is that? I don't know what that is.
What could that possibly be? I'm curious about it.
Let me know.
What could it be? Let me see if I can scroll here. All right. Perfect. Okay.
Think of switching to framework 13. Ooh, nice. Sadly, they don't deliver to Saudi. Oh, no. Oh, well, you could probably have it like delivered through some sort of like hopping or something.
It's really cool UI. Oh, is it? Does it really? Oh, does it? Okay. Deep in Linux. Oh, is it deep in Linux? Is that what that is?
All right. Normalize. Normalize. We're looking good there. Okay. I think we're in the right spot now.
I think we are. All right. Things are looking good here. Let me see over here real quick.
This just smidgen. There we go. Okay.
All right. Looking good now.
Oh, Clint OS. Or is it Klein OS? Let's try this. C K Y Kalen. Kalen Linux. Is it Linux Windows clone? Oh, wa. No way. That's crazy. Look at that. It's Linux. Wait, wait, wait, wait. How are we?
>> I don't want Wait, wait, wait, wait. Uh, is there just images? So, wait. It's basically Windows, but it's Linux. Is that what that is? It looks like the Windows interface, but it's running Linux.
Never heard of that one. How is it?
I can ask their support. They don't recommend it because it might break while shipping. Oh, okay. Got it. Yeah. Yeah, that would be that would be terrible.
Go to try deepen. All right, try deepen Linux.
Oh, hey, it's all it's very similar. It looks like a Mac kind of kind of got that Mac type interface. All right, so you've got Oh, neat. So deep in Linux kind of like userfriendly Mac or you could go with the other kind of Linux which looks like a Windows.
Perfect. Then you never have to worry about anything else.
I bet it phones home to the CCP. Yep. Uh it probably lose. Oh, Ki Linux Ted. Yep.
Kali Linux for the penetration testers.
The pen testers. You guys, you got your Kali Linux here that's got all the pre- nicely builtin utilities for testing security. Kali Linux. It was featured on a popular TV show called Mr. Robot. They had Kali Linux on there.
Omari. Oh, wait. Is the Go distribution.
Is it Omari? Wait, is Omari like Omar?
There's Omar. Omari versus Omari. Is that separate?
Omarky. Omarky. So, is that just a misspelling? Mr. Robot's cool. It was a great show, right? I like that show.
Thank you for clicking the hide button, Linds. I think you meant Omari, right?
Black Arc is better than Oh, wait.
Bonzupi. I haven't heard about that one.
Uh, Black Arch. Let's see here. Ooh, we've got an upgraded version, you guys.
This reminds me of the early days. Oh, look at that. This reminds me of like the 2000s era. Wow. This this aesthetic like way back 26 years ago.
No, you you misspelled it. All right.
You misspelled it. Okay. I was wondering. I'm like, is there something?
Is there a new one? Cuz you never know.
There might be a new one. There might be a new one, you guys. Fedora. Great. I love that. I like I like Fedora. That's the yum, right?
Fedora is uh for for the package manager.
Fedora package manager is yum, right?
Oh, wait. Oh, it is yum. Yeah, yum in Fedora.
Okay, let's keep on going.
Okay, how's it going there? We we a gamer. Hey, we a gamer. Welcome on in.
Happy Thursday. Hope you're having a good week so far. Thank for clicking the high button. Okay, I want to see if our one hot worked now. Let's see if our one hot worked. Okay, did we make it work? Oh, look at that.
It worked. Hey, we're on the right track now. Now we can do some loss detection.
Oh, looking good. Yep, that's it right there. That's the correct answer.
However, it doesn't have Oh, no. No, it has the right one. Okay, so we got it.
We got it. Okay. All right, we're good.
We can start training our model now.
It's going to be great.
Good. Pen trashing testing. DRO. Also, Parrot. I've heard about Parrot. I heard about Parrot. It's been a long time.
Parrot OS.
Parrot security. Yes, I've heard about this. In fact, we've been to this website because I've already zoomed in on it.
I remember this OS. I've never used it.
It never came up. I always went to use different distros. Ah, there's different ones, you guys. You can also bootstrap the the repo with vanilla arc Linux. So, you can keep your regular daily driver set up and install the pentest tools you need for whatever you're doing. That's right. You can just install the pentest tools directly in any any of your Linux.
You don't need to install any of the dros that come pre-installed with the with the pentest tools. It is nice though to have a pentest USB, right? So you have a full ISO and then you flash your USB and you just plug it in.
Klein 11 is paid just like Windows. You got to get a license key, but it isn't necessary. And there are Open Klein, too. Oh, you can get Open Klein. Hey, I like the free option. Very nice. PNG Hunter. Uh oh, PNG Thunder. Hey, PNG Thunder. You got yourself a file extension. That's an image. An image Thunder. Good. Thank you for clicking the high button there.
Have you been following your content for a few days? Hey, great. And man, you're very underrated. Well, thank you. Your content is JavaScript.
Wait, wait, wait, wait, wait. I'm pretty sure JS has a different a different meaning in that context. I'm guessing that is a positive. I think it's a positive compliment. Thank you. Your cunning is gold. Thank you. That's so nice for you to say. I appreciate it.
We're building ourselves a chat GPT today, you guys. We just figured out our one hot as you can see here on the screen. That's exactly what I want right there. That's exactly what Now that it's one hot, we can train our model cuz we got the logs. We or the logits. They're called logs. I called them logs when I didn't know that they were called logits, right? So logits. Let's see. How do you pronounce?
Let's see. I'll show you. All right.
Let's do pronunciation. Pronuniation.
Logits.
Logits. Oh, so it's kind of like a log.
Oh, wait. Yeah, log like lodge. Oh, logits. Wait, ski lodge. Wait, no, wait.
How do you guys pronounce this? I'm so confused.
Use the soft G sound. Uh, rhymes with the word like gadgets, logets, widgets, logits, logits, logets, logets. All right, there we go. You tried installing Metas-ploit on Ubuntu.
No, wait, you have it. Metas-ploit is like a really nice pre-built package that includes a whole bunch of pin testing tools all right there. It's a really good one. You use Ventoy on a USB stick which contains digital forensics distributions. Oh, nice. I haven't heard about that one. Ventoy OS. Is it Ventoy? What is this?
Open source software tool created bootable USB drive with the ISO.
You don't need any format on the disc over. You just drag and drop the files on USB and then they then boot them directly. You can copy many files, different folders.
Vento will give you a boot menu. Select them. You can browse and you can boot them directly. Oh, that's great. I've never used this. I didn't know about it.
A new bootable USB solution. Hey, very nice. Dylan, welcome on in. Thank you for saying hello. Good to have you here.
Happy Thursday. We're talking about a whole bunch of logic, some pen testing, different operating systems, right? And AI. It's way too complicated for you to to say pronunciation.
Log logets. It's logit. But you know, if anyone says it differently, like logits, like the word log, then I think everyone kind of knows what you're saying. I think they do.
Ventoy is a bootloader, I believe. Uh, so it's a bootloadader and then you can put whatever files you want into it which is kind of nice which means you can create your own custom your own custom which is nice only the tools you use.
Okay, get status get add get commit. We want to say I know I don't I want to say that we added train. See trainable target. See if that works. Push. Get push origin main. Okay.
We might need it cuz I remember we we're missing a piece. Let me see this to-do item. All right. So, there might be a to-do item here. To-do. You might want to add the web search plugin to your Zshell. Search the browser direct from the terminal.
Oo, I think I might want to do that cuz I keep popping in and out between Chrome.
It would be nice if I didn't have to, right to do No, no, no, no, no, no, no. Wait, wait, wait. I want to There we go.
Oxnol. I kind of want to do that. web search plugin into your Zshell.
What would that look like? H.
Okay. Okay. I'll think about it. I'll think about it. Do you have a preference? Do you have a specific one that you use?
I want to use like a lightweight one.
Thank you for the hundreds. Appreciate it. Thanks for clicking 100 over there.
Whoever is doing it, I don't know cuz I can't tell. I can only see when the hundreds float to the screen.
We need to we might need to split. So we need to trim start and end between the target and training. Right? So we need to trim this and training. There we go.
Okay. That's a todo item. Let me put this at the top.
There we go.
Okay, we don't need to do it right now, I don't think. I think we'll probably be perfectly fine. No, it's actually named web search literally. Oh, it is really.
All right, let me learn more about this web search.
Uh, ZSH H.
The term web search and the context typically refers to the web search plugin. Oh my Zsh plugin that allows you to run web search directly in your terminal by opening your default browser with a search query. Oh, okay. So it does it still pops me out.
It still pops me out. So I mean it won't stay in my terminal is what this is. So it won't let's see allow me to stay in terminal or it only load browser open I'll say open browser it would be nice to be able to stay in the terminal only opens your browser okay yeah cannot display results directly inside my terminal which is what I want you know I could I could just use anti-gravity for that right so I could pop out over Here I could say cd empty and I do agy and then I could just run search queries right here, right?
I could just do that as well cuz that's essentially Google's AI and then I'd stay in my terminal and I could just be doing that. I should get used to that. The only problem is it's it's not that fast. Yep. It's still people with all the two. Yeah, exactly.
That's what I want. I want to stay in my terminal. It's my favorite. Doesn't it make you like excited to like stay in your terminal? I think it just is very it's very like heartwarming. It is probably because I've been doing this for so many years now. It's it's my IDE of choice. Okay. So, if I was going to say web search zsh, see it takes a little while.
It just takes a while to load. and see if I had done that in Chrome it would have it would have given me the answer already and see it's just taking so long.
All right. It did it did come up with some something though. Here using Oh my plugin easy if using you already using the Okay. Yeah.
Here. Let's let's let's take it let's take a look at it though.
Let's let's just at least dump it jump in here real quick. Do we have an example?
La.
I get it. I get it. Okay. Oh, there's a lot more to this. Okay. It's doing more than I want it to do. I just want a basic command.
I could write my own. I could write my own right now. Why? Why? You don't need this. You guys want to build our own shell script real quick? You want to see how this actually works? I think we could do this.
I mean, this is this is just a bit too much. If it's just going to open up a browser window, then why what what's the point?
You should write a plugin for that.
Parsing HTML in your terminal. Sheesh.
Oh, yeah. That's a good point. Yeah. Uh, yeah, that would be pretty nice. I think we could probably do it though. I think we probably could.
Okay.
Oh, so we say Google web search. Okay.
Yeah. So, I could just have a terminal command called Google that runs an open command, right? So, we could do that. Why don't we do that really quick, you guys? So, look. I mean, it's not hard. Look at this.
Question mark search equals Q equals.
How hard is that? How hard is that? It can't be that hard, right? Here, watch this. Open.
And we say oxnull-r.
Ready? Enter. Oh, wait. That should have worked.
Uh, how about we put that in quotes?
There we go. See, right there. Right.
It's like that right there.
Hey, how's it going there? 11 Malcolm, good to see you. Which free paid researchers are the best for to become AI machine learning engineer? All right.
My recommendation is that you find all the bad terrible tutorials and all the good tutorials in a self-discovery adventure on your on your own. And this is important because you need to find the struggle and the failures in order to really learn. If you find all of the answers, you're not going to know the most important part, the part that lets you fail. Your failures are what solidifies your learning. If you just get the easy answers, you're not going to get the greatest possible learning journey. You're not It's not going to work. Are you hiring? Hey, yes, I am.
Theo is light. Good to see you again, Theo. We are hiring. Yep. Just jump on over here. Go to pubnub.com. Scroll down to careers. Click on the careers link and then we zoom in. We are currently hiring some software roles and some looks like corporate council role as well. Senior solutions architect and a senior software engineer for AI and large language model systems.
Damn, don't expose like that. Oh, wait.
It did. Wait, I didn't. Google knows. Google knows already. I didn't know. Oxno. I didn't know that was going to happen. I was just giving you an example. So, like look, we could create our own alias, right? Watch this. Uh, I don't here. I don't know if it's okay for me to open my ZSH real quick, though. What we could do though is we can just type an alias in here though.
Alias Google equals and then we can do a command, right? And then we do like money one, right? We do money one. Is that it?
Omar. Hey Omar. Awesome. I'm drastically underqualified. Hey Pro swing my the best advice that any someone gave me. Swing for the fences.
There you go. It's a term that means that take the shot even if you don't qualify. And if you fail, that's fine.
Just swing. Swing. Swing for the fences.
Love your show. I could watch it all day. Hey, Theos Light, that's great to hear. Thank you. Good feedback. Uh, we'll keep doing this, you guys. We're live every day now, even the weekends.
Just kidding. It's okay. Hey, good.
Good. I was worried for a second. I was like, "Oh, no.
I'm a network ops guy." Ooh, nice. So you know about like network masks and cers right with the for slash32 to get that cider like rocks like there's nothing it has to be this IP address and it can be no others right what is slash32 you mean guys all right so we're going to say open wait did I already copy paste this yeah open and then we're going to put money one right there I'm going to put this in quotes I believe this should do the trick maybe uh I don't know. I might Hold on. Let's see here. How do we do an alias?
Alias. Alias. Alias. Did we already close it? Not this one. Up here. Here we go. Let's see. URL. Yeah. See, look at that right here. We're We're doing that right now. That's exactly what that is.
Look at that here in Turkey. Oh, I see. Wait. You see? What do you see?
All right, let's learn networking together, you guys. This is pretty cool.
See, I told you. Look how easy that is right there. It's so easy, you guys. And then you do money one, right?
for the well the only problem is I need all the monies cuz if I if I want to type in naturally like say Google blah blah blah blah blah blah with spaces then I need I need to do more than just money one but for now let's do money one alias Google hold on let's see hold on a second alias next yeah here we go H how do I want to do this? Okay. See, the only thing is I think it makes sense to do we just do this? Do we put it in quotes like that and then we say money one? Will that work?
Google help.
Help does not exist. Google.
Oh, that did something I wasn't expecting.
What do you think about the theory about recruiters or hiring managers interviewing candidates at the first stage then leaving them hanging while they finish their batch of interviews already in let's see basically you get the interview and wait till they finish who's interviewing but that results you have hearing back.
So yeah, they should it's what would be nice is if they just let you know upfront where you are. If they take a long time to respond that that's terrible, right? It's terrible. It's not good.
Also, I have wanted to know what is the difference between the role of machine learning engineer and AI engineer or are they the same? I think they're basically the same. They're the same, right? Cuz you're using machine learning to train AI, right? You use machine learning algorithms to make AI. AI is the outcome. This is the way I define it.
Everyone defines it differently. I think everything's machine learning algorithms and that creates the model weights and parameters, right? The biases. This gives you the output of AI, artificial intelligence. Now other people might say machine learning is like low rank AI. So it's the machine and then AI is deep learning.
That's that's what other people classified this, right? Which is a different it's a different dimension.
So that's kind of how I think of it.
There are lots of free materials out there. Network Chuck. Oh, Network Chuck.
Yeah, good channel. Learn the basics.
Why don't you check the plug-in source code and get an idea? Oh, right, right, right, Oxnal, that's a good idea. We could do that. It's pretty neat. Check this out. Um, I do have this. My Google command does work. Google, it works right there. It just It's just missing the query part. Google. Boom. Google like overbooking. Ah, Ray Silvers is exactly like overbooking, right? It's exactly like overbooking.
See, where was it? Here.
Uh, here's the source code. All right.
Alias. Alias.
See here. So, I want to do an alias that allows me to pass in a variable instantiation.
That's what I want.
Oh, there's too many folders to really look. There's too many folders. All right. Let's just do All right. So, uh, alias here. I'm just going to copy and paste what I wrote so that way I can get this really quick cuz this isn't our main objective.
Let's see. There's a problem though.
Google open. Yes, that's exactly what I want. Yes.
Uh, la join all the arguments with a Yes, exactly. It already knows. It already knows.
So, how do So, wait, wait, wait, wait, wait.
How to alias all on one line. Let's see if that works.
You've been most mostly making webcrawler cos light data compressors and analyzers using Python and Ubuntu Windows Linux sub subsystem. Wait w is that Windows Linux subsystem or is that a different thing? The virtual environment.
Are there more opportunities in cyber security or AI engineering? There's both. They're both right. I think more probably right now definitely spent in AI. There's more spend in AI than anything else right now. So if you're looking if you're looking AI is the answer. Hey, here we go. All right. So we could have written this ourselves.
However, uh let's just do this. All right. So alias Google Google search.
Boom. All right. Google, we'll say help me learn about torch. change function right there. Boom. Quick and easy. Look at that.
I like that. That's pretty nice. I think I might want to save that. I think I might want to save it.
Theo's light. Yeah, correct. All right.
Good to hear it. Hey, Peace Lord. Hello.
Hello. Hello. Welcome on in. Happy Thursday. Good to have you here. What are you working on today? 11 Malcolm. We are creating an alias really quick that I'm going to save because I really like this alias into my terminal. So that way I can now easily Google from my terminal window and not have to pop open another browser. It just does that for me directly in my terminal window which is pretty nice. I think it's pretty nice.
Here by I'm I'm going to do something off screen real quick. Zshrc I'm not going to share. All right, here we go. We got an alias.
Okay, I can I can show you guys.
This is what I'm doing right here. All right, so paste. We added a new alias.
Google search right there. Right there.
Google search right there. Close. Offscreen editing.
Perfect. Oh, nice. No plugin needed.
Stephen the rescue. Hey, Oxno. That's because of you. You inspired it. You inspired it. Thank you. Thank you very much. Happy Thursday. All right.
Why would you spend two seconds opening a browser tab when you can automate it in 30 days? Average engineer, right? I know, right?
We can automate it. We We can save We can save time. It'll be better. It'll be better this way. It'll be better. I promise.
That's pretty funny.
Oh, hey.
I didn't see you. Do we read Twitch comments also or just YouTube ones? Uh, Silver Ray, I just saw your message over there. It was hiding. It was hiding in the corner. Uh, if I see it, I'll say it.
Good. Good to see you. Okay.
All right. So, where are we at now?
Where are we at here? Let me uh get back to our our business over here. Get all the things in the right spot. Perfect.
Okay. Good. Good. Good.
All good. Here. All right. Good to hear it on YouTube. Yeah. Ray Silvers. Sounds good to me. Look more energetic today.
Peace, Lord. We're having a good day. We had a uh a lot of good healthy food and we're also doing things that we love doing, which is softer computer stuffy things, right? I love it.
I love it. Oxnal, thanks. Yeah, it's because of you. All right, so hey, I love Liberty Justice. Thank you for the high five. Appreciate it. That was What' you say? Oh, you said what? Uh, high five. High five. Thank you. My gratitudes to you. That's very nice of you. You're I love Liberty Justice. I've been thinking about you. I have off stream. I'm like that. I love Liberty Justice. Always sending over the wonderful the wonderful super chats.
Likewise, man. First time watching the stream, not clipped videos. Hey. All right. Good to have you here. Thank you.
Yep. We're trying to get our our Twitch up here. So, we got our Twitch up and running because we get started late on Twitch. We got started late on Twitch.
Do you have a system SOC team? Let's see. SOC team or incident response in your Yes, we do. Yeah, we call them the We call them Well, they're SRRES is what we call them. Call them SRRES. Site reliability engineering team. S team. That's what we do.
Broccoli. Peace. Lord. Exactly.
Broccoli. Yeah. Have you guys eaten your broccoli today? I have. I've eaten mine.
Google. Let's see. Broccoli.
Hey, there it shows. See how fast that is? It's instantaneous.
It's instant.
That's why I like it. That's why I like it.
All right. All right. Let's get back to it. So, where did we leave off? So, we're doing We're doing the self attention.
Yeah, nice. Watching this while applying to jobs on LinkedIn. It's horrible. Oh, race slers. You know what? That is a pretty tough situation. It is. Peace, Lord. Nice. Good to hear it. Okay, let's get back into our coding, you guys. Now that we've got our new fancy Google CLI alias, thank you for the hundreds.
Appreciate it.
All right, we need to do a new poll. Let's see here. All right, let's pull the world.
Let's pull the world. Broccoli or uh animal product. Wait, no. That That's too That's too conceited. I can't do that to you guys. I can't do that to you guys. Are you being eating healthy? No, I'm not going to ask that question. I'm not going to do it. That was mean. I shouldn't do that. I'm sorry.
All right, let's get back to our coding.
is going to be like, "Are you guys eating properly?" No, don't worry about it. It's fine. You do whatever you want.
It's It's whatever you want to do. All right, let's look at Let's look at the poll really quick. I'm kind of curious to see what the actual results of the poll are. So, most of y'all use Windows, some of y'all use Linux, and then Mac. Wow, more Linux than Mac.
Impressive. Other other basically can go into like the the Unix Linux type family. So, if you combine all these up, it's just a little bit more on the alternative operating system, right? 3% extra on the alternative operating systems. That's impressive. I was not expecting that.
Hey, how's it going there? Uh, Baron, wait.
Barrento, thank you for subscribing.
Appreciate it. Welcome on in. You join the right channel for software engineering. Today, we're building chat GPT Linux equals architecture. Uh or the ark as in Omar. Omar dogu. The ark. The ark. Nice. Different names for the cyber security team. Yes, exactly.
You got it, Axel. I mean, I will take meat over broccoli any day, anytime, anywhere. Oxnal. I figured as much. I figured. Hey, Peter Parker. Good to see you. Welcome on in. Happy Thursday. Good to have you here. Did we do this correctly, you guys? the desktop. I the transformer. Okay. Did we make it work?
Python transformer. Nope. No. No. Nope.
Python transformer. All right.
So far so good. Did we get our criterion?
Did we do the criterion? No, we didn't.
Okay. Let's do print loss.
Here we go, guys. Moment of truth. Wait, wait, wait, wait, wait. Uh, it's got to be targets.
Targets.
There we go. Okay.
Is it going to work? Hey, loons. By the way, I'm back. All right. Good to have you back, Lons.
When someone tries to roast me by saying uh you suck, you all I always tell them, tell me what I did wrong so I can be better person. Oh, that's a good that's a good approach. That's a good approach.
You're doing it right. You're doing it right. Look for that positivity. I recommend it. Baronto definitely sounds like a dro name. Oh, it kind of does.
Like a Linux dro. It sure does.
Thanks. Sorry for English levels intermediate. Oh, good. Omar. Oh, good.
Just happy you're here. You just learned how to see or how to use the visual line feature in Vim yesterday. Oh, yes.
Visual line. Isn't that great? Now that you got visual line, you want to do visual block, right? Visual block. Can do visual block. I like that one. That's a good one. All right. You got to do a little bit of visual blocking right there. Just a smidgen of visual block.
Got visual line. And you've got visual mode, which is line, which is like string mode.
Three visual modes. Line and block. and whatever this is just regular visual mode. Wait, what's that? How I know right buni? Isn't that great? It's pretty cool. It's so shift V. So you see on my screen here, you can see the keyboard at the very bottom at the bottom of the screen. I'm holding shift and pressing V. Block mode is controlV.
Visual block mode, right?VRLV is the way to go. There you go. Shift control I. Shift control I.
I just That doesn't do anything for me.
Uh yeah. Yeah, you guys. That's a good one. V to get visual block mode. I like it. I like visual block mode. It's one of my favorites. I you actually visual line mode is the best though by far.
Visual line number one. Hello. I just got back. Good news. We've passed the 50% thesis defense. Oh, wait. Really?
Got busy coating our 3D segmentation using a new new net of the human firmer and wait hold on and proximal tabia. I don't know what that is. That sounds like something biology related, right?
Human biology.
Is that what that is? I don't recognize those words. I know computers. Biology though, it's a different story. I've known visual for quite some time. Yeah.
And then get your visual block mode right there.
Always revert to nano as your default cuz I always forget to handle. Yeah, I I see what you mean there, Oxnal. All good. It's all good. Use what you know.
All right. Moment of truth, you guys.
Will it work? Will our criteria work? I think it might. Oh, no. Exploit voting point type target for class probability.
Oh, right, right, right. Okay. One hot.
All right. So, can we say d type equals we say torch.flat.
Can we do that?
No. Doesn't like d type. All right.
We've got a solution for that. Two.
See if that works there. Yeah. Wait.
Ignore index is not supported for floating point target. No. No. Really?
Oh, okay.
That's unfair. That's unfair.
Stephen, why don't you use Visual Studio Code? Because I like myself, my Vim editor, and it makes me happy. You know what? I've got your answer. Use the IDE that makes you happy. There you go.
Easy, easy answer. Basically, you created a model with the new net model.
Then it segments the CT image. So you create like blocks. You take the large image and you chunk the block. You chunk it and then you convert it into a 3D for visualization.
Oh, got it. Okay. So are you making 3D models using this approach? 2D images into 3D 3D models. Is that what you're doing?
What's the thing you do when you select multiple lines and you prepend all of them at once?
Oh, right. Yeah. Okay. So, this is what I do. I don't know if there's other ways to do it. This is how I do it. So, if I want to comment a bunch of lines out in Bim.
Check this out, you guys. Wait, who is that? Who's that? Uh, Mong.
Hey, Mong. Thank you for joining.
Appreciate it. Thank you for your subscription. Welcome on in. Happy Thursday. you are joined the right channel for software engineering. We're about to share a trick in Vim if you use the Vim text editor. Thank for the party poppers. Check this out you guys. All right, so hold hold shift to plus V to get visual line mode and then I press col/arro slash uh octothorpg.
Done.
Is that how you do multiline comment?
That's how I do it.
Uh, did you did you catch that?
Hey, how's it going there? Unknown.
Unknown. Hey, Unknownverse. Thank you for clicking the hi button. Good to have you here. Happy Thursday. Think I saw you do that once for a second. You did, Bonzupi. You did. You also use Flask.
Hey, for backend. Flask or 3JS for the front end? Yes. JavaScript for it uses WebDL. It uses WebGL. It's a nice little higher level programming interface for WebGL. It's pretty great.
It's been around for decades.
Our thesis advisor asked me to convert everything to Rust. Oh, what? Really? No way. That's actually really powerful.
Rust is great. Although for a project, I don't, you know, I don't know if you necessarily need to use Rust for a project. Will AI bubble burst anytime soon? I think the AI bubble will just keep on like like flapping around. It's going to flap.
It's going to flap around. You're you're using it. I'm using it every day.
Everyone on the planet who is doing productive desk work is using it for the most part, right? So, it's it's here to No, it's here to stay. It is it's just a tool. Think of it as a tool. It's like a really good tool. It's not going to replace anyone. It's going to increase productivity and the number of jobs.
It's just there's a little bit of turbulence right now.
Oh, one of your thesis members also joined the Discord group, too. Hey, that sounds great. Fantastic.
What is just the regular said syntax?
Oh, right. It's pretty close. It's pretty close. It's similar to said reg x. Wait, what? You commented out that with macro you can just use GC after selecting something. GC.
Wait, I haven't done that. You said GC.
Wait, what is this? So, highlight and you say GC. That doesn't do anything for me. GC doesn't do anything. I'm pressing it right now. It's pressing nothing. You can see it on my screen doing nothing.
Yeah. So, what I do is I highlight the code that I want. I press colon s/c carrot/ octaport slower slashg. That's what I do. I know it's weird. Is there a better way? Do you guys know of a better way to comment out lines and oneliner?
You guys know have you ever seen a rocket engine designed by AI and its prototype is running? I've not I've not seen an AI proto I have seen an AI write a schematic for specific components and them being used in say like an airplane and they are improved.
Hi sir unknown uh unknown verse. How's it going there? You want to make a website but I don't know coding only know HTML and CSS basics is can you make it using AI? Yeah you can. It does a really good job. The first prototype.
Oh, wait. Peace lord. I have definitely not seen that. I haven't. Has someone done it yet? Has someone done this?
Boom. Comment. Easy.
Okay. So, let's continue on downward.
We've got a problem here.
It doesn't it So, does our output our outputs float? And it needs to stay float. Our targets are float, but it doesn't support padding indexes, which is annoying.
So, I have to remove this sadly. I'm sorry, you guys. I'm sorry. I know it's sad for it to go. I know we're all feeling sad about it. We had We can't ignore the index zero right now, which is very sad. Don't worry. We're going to put it in a a little a little place right here for now to restore this. We will. So I need to say res restore ignore index, right? We got to restore it.
Do you see the discord in link share?
Oh, wait. Oh, hey Kyle. Good to see you.
Welcome on in. Happy Thursday. Large language models are the tool. Consistent and efficient workflows are the bottleneck. Currently, they are. You got it. these large English models, they can achieve what we want them to achieve as long as you get them in the right spot, right? Thank you for all the reactions.
Appreciate it you guys. You think your thesis member asked in Discord for the other project about the distill classifier for our natural language processing project? Hey, there we go.
Distbert you can do that. So yes, NLP you want to use embedding model and it's really great because you don't need to train anything. It's just a classifier at that point. It works so good. That's the power of embedding. Everybody's sleeping on embedding models. They think, "Oh, it's all for vector and retrieval augmented generation, right? With the AI, we're going to rag. We got to rag it all." No, it's way better than that. It's way better.
You use lazy vim, but it's a package called comment inbim. Okay, got it.
Okay.
Got it. Got it. Got it.
We'll check it out. Thanks. All right.
Link share.
Oh, Lun. Um, yeah. What's going on here?
It looks like you've installed an operating system that has a default language set to some sort of Mandarin.
Hey, SSH3 Arlock. Good to see you. Hey, bro. What are you doing? Oh, we How are mate? We're doing great. Thank you.
We're building ourselves our transformer. Have you used chat GPT?
We're building one. Yes, we are. We are.
We're building it. Small distributed language models future. Yeah, I think so. Because you can use them for very specific tasks. And if you get them really good at a very specific task, you can use energy efficient, highly accurate AI.
Doesn't that sound better? Yeah.
Oh before that we'll have to check it out. Sir is there any tool like in N8 which to help make AI workflow? There is lang chain, right? You could do that or you could do, you know, just Python script, right? You could do that, too.
That's what I do. I do Python script.
Just easy Python script.
That'll work. All right. So, before that message. Okay. Uh oh. Hey. Wouldn't give this to my worst enemy. What do you got here? Low-level programming sandbox inspired by early game consoles. In this fantasy console, you program an assembly language. Oh no. With full control over execution, rendering your games and other creations to a retro monochrome screen.
Steam. It's a Steam game, you guys.
No way. It's a programming game.
Oh, that's really cool.
Ne, how do you say that? Neimonov.
Neimonyimov.
Neimonyimov. Okay, pretty nice. So, you can learn yourself some programming in assembly. Hey, there we go. Hey, Nea, welcome on in. Good to see you. Oh, Nea, you might have an answer for me. Although, this will just be off the top of your head. I don't know. It's a very silly. It's very silly. So, we've got a fun challenge right here. I'm using cross entropy loss that ignores the padding index. However, we got this error that says that when I'm using a float on my logit on my output from my criterion that I cannot use uh integers. I have to use floats which makes sense because I'm comparing floats. However, when I get my output, it's going to be in floats. It's just going to be floats.
That's what it's going to be. A whole bunch of floats.
So for now I just disabled the index ignoring for the loss and I it's probably fine.
However, my question is if we have this here. So here's our criterion, right? Our criterion is a cross entropy loss. I would like to keep ignore index.
However, I'm sure there's a way to do it. I just haven't really thought of it.
Which software do you use for employee productivity management? Jira out. We use We use Atlassian tools. No, it's definitely not fine to disable it. It's not fine. It's not. Well, we Okay. I'm pretty sure it'll still work though. AI very flexible.
Very flexible. It'll work. Uh though it's not fine. I I get it. All right.
So, we we want to bring it back.
However, if we do, it's going to say nope.
will not likey this Google dude. Oh, wait. Oh, Google this. I suppose we could obviously we could search around for this. We could.
I mean, the the whole point though is that we learn it. We don't want the easy answer. We want the hard answer. Test results are in the most recent patch.
Oh, the subsystem definitely broke. Oh, okay. Got it. Got it. Got it. Got it.
Got it. I see what you're talking about there. Thank you for all the likes, you guys. I appreciate it.
You also need a proper mask on attention layer. Oh, good news. We did. We did. We got the mask. We add the mask. We added it. We got the mask right there. We did it on both the question and the answer.
I I know we probably only need it on the answer. We just put it on the question as well.
SSH, how's it going? Hey, do you prefer Python JavaScript over the C? I like all three of them as the first language. Oh, I would prefer Python first because C is going to give you a whole bunch of roadblocks with the syntax and it's also doesn't have any built-in functions and so you have to build everything yourself. I'd say Python is great as an introduction to programming because you're going to get some of the foundational basics of variables, flow control, loops, if statements, those sorts of things, right? Things that make a programming language, what do you call that? uh a programming language touring complete.
Right? So if you get those foundational knowledges, then you can jump down to C where you get closer to the metal. It's going to require you to learn more of data structures and algorithms cuz you have to build everything yourself. You got to build it all. Why is your learning rate at uh instead of 0.1? I don't know. Let's change it. LR equals there. Boom. How about that? All right.
Now it's at 1%. Do you like that? Is that better? Is that better? Oh, Trello.
Oh, yeah. I know. I know Trello. We use I think Trello was acquired by Atlassian, right?
Hey, we we fixed it. We fixed it. All right, you guys getting caught up on the chats here. Google stepping on your toes over here. Uh, is it the ignore index is the one hot of what to ignore.
So, you can also just implement manually. Oh, it's a negative log. Okay.
Okay.
No, no. This dude from Google is stepping on your toes. All right. Bonsy B. Right. Right. Right. I've been learning a lot about Transformers the last few days. Thanks to Steven stream.
Hey, great. Been very informative. I appreciate it, Kyle. Thank you so much.
There's It's They sound complex and there are some nuances that are weird.
Don't get me wrong. They're like, "Why is that happening over here? Why why are we doing it like that? The good news is we're figuring it out. Messing up your work and breaking user space up on Zupi.
So just do negative log P correct sum.
Oh, okay. All right. All right. Okay.
Nea, you asked. We shall do it. Also, I see Trello is owned by Atlassian. Hey, they did acquire it. All right. Trello is now Atlassian tool. Also, I have a tip. You can also make your learning rate increase every epoch and then you can stop training on the epoch where it doesn't improve anymore. Hey, variable learning rate, right? You can get some momentum in there with your optimizer.
We can add some momentum.
Correct. Would not make masks. So, it ignores masks easily. Okay.
All right. Thank you very much, Neva.
Appreciate it. So let's change our criterion.
How do we do this? All right. So if we're going to do negative log P correct the correct answers and then wait correct sum.
Okay. So this would ignore the zeros.
Oh this is going to snag. So we take our output which is P. Is P the output?
Is P the output? So we do torch torch.log output.
We say negative. And then we do our targets like that.
Something like that.
Uh there is a lot of devs having actual freakouts over AI. uh they are because it's a big change to their daily cycles, right? It's a big change and like I used to code and I used to have this passion for my craftsmanship, right? It was so great. I have code comments. I have my classical architecture and I've got my design patterns that make my code look beautiful. You don't need that anymore.
The AI writes it for you now. So code craftsmanship is no longer a discipline.
Stephen seems to be the only engineer online taking it optimistically. Yeah, it because it makes it makes my day better. I can do a lot more. I can do so much more. I don't really have to care.
The sad thing is I do kind of miss code craftsmanship. The good news is I'm getting my my my appetite satisfied by live stream coding with you guys.
Most of them have courses and content.
They are losing money. Oh, right.
There are content creators who are teaching programming and now the world doesn't need that as much because AI can write it all for them. I know that kind of counts as us here, doesn't it? It does. I mean, there are still people who want to learn, I think. Right. Maybe I want to learn. I want to learn.
Oh, Muhammad. Hey, how's going over there? You're a Java developer or overwhelming by the data structures and algorithms I love building, but my job requires DSA. Not many people can say that. Most people just say they import libraries and their day is over. Yeah.
Use this library in your code. Boom.
Done. Easy. You just made an executive decision to use a library. DSA in your day-to-day. That's pretty special.
What's your opinion? Thank you very much for your time. Yeah, I would say you're lucky because a lot of people would really like that job to use DSA in their day-to-day.
Yes, Stephen. Now you just need to sum it or mean it some.
We do some sum there. All right, let's see what it looks like. Yeah. Okay, so I want to say my loss equals let's print our loss. All right, Nea.
All right, here we go. Here we go.
All right. Tensor used as MC's must be long in type or bite. Oh, they aren't.
Oh, right. All right. We We can fix that. We can fix it. One hot.
We just get rid of the this here.
Boom. Easy. Boom. Hey, I got an an uh-oh. Uh-oh. Neva, I got a not a number. I got a not a number. Mean probably more correct. Okay, I'll try the mean. I'll try the average mean squared error, right? But you're doing a negative log mean code craftsmanship is still important even though AI makes automatically it can can't beat the creativity of the human mind.
Right now it is. You're right. The human mind can get real crazy. It can get real crazy.
So, for the mask tokens, you need to ignore them.
And we do that.
I've got the mask. However, Oh, wait.
The mask tokens. Okay. So, you mean the pad tokens? The padding.
So, I already set my pad tokens to zero.
Uh, so it'll be the first element. So, we need to eliminate any pad tokens is what you're saying.
One hot is likely wrong. Also, you should be able to index it by an integer.
Okay. Well, I'm getting the the if there's another approach, let me know. I'm I actually I'm not familiar.
So, let me see. All right. So, let's do a mean here. Average. All right. See output. It's still a problem.
So, we've got not a number still.
So, what if we got rid of the negative here? All right, let's try that.
Not still a problem. Okay.
Torch log outputs. H.
All right. What if we just did the here? Let's just see if let's see see if the log is causing an issue.
Oh, look at that.
Oh, I see.
Aha.
Okay. Okay.
That's bad. Pad token should not be zero.
Why? Pad token should be totally you can put them wherever you want them to be.
Right.
If your pad is zero, then your max token index is one over the largest index of the vector.
So here, this is my these are my special tokens.
Pad, start, end, and unknown. Oh, Stephen, you didn't softmax. Oh, well, I sure didn't.
I sure didn't.
Let's soft max it. All right. All right.
We'll do ourselves some soft maxing. You You noticed. You noticed. We didn't softax. self do.Soft equals torch softmax. Can we do that? Is that right?
Python import torch torch soft max. Is that thing? No. Maybe it's one word. Not like that. Okay. Is it torchin?
There's a rand. There it is. Right there. Okay.
There's our soft max. Now we will say out equals self.soft Soft out. Done.
Soft maxed complete. We added it, Neba.
There we go. Oh, much better. Hey, thank you. Thank you very much. Do you stream all day? Hey, Gooseman. How's it going there, Gooseman? Good to see you.
Happy Thursday. I stream only for a little while each day because I have work that I got to do on a regular basis. How many hours? We still have time, though. We're still going to hang out here for a little while. If you want to avoid all the softmax log, you can use a dedicated function to avoid all that. But you know what? I think we're okay. I think we're okay. I think we're doing good now. So, let's see if we can do the rest of this. Here we go. See if we got it good. Got it good. Hey, look it right there. We got ourselves some loss. We got ourselves a little bit of loss right there. Does it need to be negative, though?
Does it need to be negative?
I don't. Oh, we we do. We have to negate it. There we go. There we go. So that way. Ah, much better. Okay.
You want to avoid softmech log so you can Okay. You'll hang out till I fall asleep. All right. Sounds good. Hey, good news. We're We're here for a little while, so you have plenty of time. 3:00 a.m. in the Philippines. Oh, wow. That is very late. Are you doing the hackathon? Oh yeah, Neva, are you doing the hackathon? We got our hackathon, you guys. Hey, look at here. For those of you who are here, we join our Discord.
We've got a hackathon occurring. Just go to Discordsai.com, follow the online instructions, build an agent, and then submit it to the Discord channel and you get entered to win.
This is how you do cross entropy. Is it really? Is it really that simple?
Is it really? Is that really how simple it is? Nea, you made my day. Oh, that's so great, Nea. Thank you. Oh, that is very satisfying. Keep coming. It's so good, Nea. Thank you so much. All right, just what you're doing is fantastic. I love it. I love it. I appreciate so much. Thank you.
So, I have a good idea for it. I'm not a good project manager. This is a PM thing. Okay. All right. Good to hear.
Good to hear you guys. Build your agent.
Build your agent. Submit it. And you get to enter into the hackathon for prizes.
This is so cool. This is so cool right here. It's so good.
All right. Now we can start optimizing our model. You guys ready? Cross domain synthesis. Wait, wait, wait. What is What is that? I I don't You mean turning from one discipline to another discipline?
Oh, what is that?
Google it. Okay, we're googling. Hey, we can use our new Google thing, right? All right. Google crossdomain synthesis.
All right. the process of generating new data, knowledge or solutions by bridging the gap between different fields or data environments. I want an example. Give me an example.
What is an example of that? 0.
Okay. Utilize across artificial intelligence, cyber security, and strategic problem solving to adapt, augment, and transfer insights from a source to a target. Well, I mean, that's very abstract. I need more. I need an example. Simple example.
voluntarily subscribed to the newsletter. Uh voluntarily.
Uh yes, I am voluntarily subscribed to one newsletter. It's called the the hacker newsletter.
However, I've been ignoring it recently.
That's a good that's a good poll, Lon.
That's a good poll. See, are you V? Wait, how do you spell that?
Let's see. Fuel unarily subscribed to a newsletter.
Yes or no? All right, let's find out.
Good poll. Good poll, loons. Good poll.
How do you balance your time with work and study? Can't do that. Oh, it's pretty tough. I pretty much plan play all day and code a bit. Oh, well, what I do is I do all of my prep on the weekends for the most part, like food prep, chores, and everything like that.
I do all that work on the weekend. And then I also stream on the weekends as well now. So, I've added that into my schedule. And then I just play a little bit less Minecraft.
That's what I do.
Leonardo da Vinci studied autonomy for drawing. Let's see here.
Where was that? A real world example of cross domain synthesis is using autonomous car technology to improve medical imaging. Oh, okay. Well, self-driving cars, hospital radiology. I don't know how that would possibly work.
Self-driving cars use computer vision algorithm to detect moving pedestrians lane markers in real time. Scientists transferred this exact algorithm structures to the medical field. Instead of looking for pedestrians on a street, the AI now scans MRIs to identify moving and growing tumors. Oh, hey. All right.
Got it. All right, Kyle. Thank you. I understand it now. Advances in neurology make AI better. Hey. Yeah, there we go.
Neural network was used modeling how the eyes work. Nice. Hey, Billy Eusic. Hey, how's it going? Welcome on in. Happy Thursday. What's the new Grock build best for? I don't know. I don't know.
Does anyone know?
I've never heard about the cross domain synthesis. Same here. I mean, I I feel like I've run into the concept before in AI specifically. Can we do Jeepa next time using PyTorch framework? Wait, what is Ja? Jeepa PyTorch. What is that? What is Jeepa?
VJ Jeepa method for self-supervised learning.
That's a good idea. Divine whisperer.
Let's add that. Let's add Jeepa. So, this is Is this VJA?
Is that what that is?
Let's see. Or is this a different kind of Jeepa? Hands-on Jeepa building selfsupervised vision models.
Okay.
Jeppa building self-supervised vision models that work.
Oh, new paper to read. Hey, Yep. new paper. Nice. All right, I'll add it to our our ideas or stream ideas. Here we go. New post. Here we go. It's called VJ Jeepa.
And that is There we go. And that would be Python and AI post. There we go.
There we go. Hold on one second. I heard my calendar, you guys. I heard my calendar. I have to check.
Uh, okay.
All right.
I might have to leave in 10 minutes because I've got a meeting that just suddenly magically appeared out of nowhere. Okay.
Divine Whisper is the Jeppa language.
Oh, then translate to there are too funny. Uh, wait, their translations are too funny. Are they? Wait, you love the ch Oh, really? The ch Yeah, Mandarin, you guys. Oops. I accidentally clicked on the wrong thing in the poll. I meant to click yes. Oh, Bonzupi. All right, no worries. No worries.
Most of their explainer articles remove details are sometimes just wrong. Okay.
Look for it on archive. Always best to implement the papers directly. Okay, Jeepa archive.
Let's see if we can get the archive.
Here we go. Found it.
Thank you, Nea. Hey, J bro. Good to see you. Welcome on in.
Jea, is there the Leon's baby? Oh, Lun's baby. Okay, got it. Is there a way to add a comment? Jeepa, here we go. All right, there it is.
Okay, future stream idea recorded.
Thank you very much, you guys. All right, so let's see if we can do some optimization. Yeah, loss.backward.
Is it backwards or back? I think it's just backward. One word.
Hi all. Hey J, bro. Good to have you here.
And a lot of good papers are just readable as is. Yeah. All right.
Yane uh Yan Lacunin is a scientist.
Lacun Lun. Got it. Got it. Got it. I figured as much. I figured I figured as much.
Is that Google Collab thing good? I tried building an agent in it. I think you burnt something. Oh, did you J bro?
It works pretty well. It's free. It's free and they give you hardware compute.
Stephen, that one hot on target is wrong. Is it really?
What What would it be?
I I understand that it's wrong. I don't know what else to make it though cuz it's I'm looking for the token output, right?
So, I got my logits. I did my transform from my uh logit logits, right? Or logits, however you say. I converted them to logats to target the output of token that I'm searching for. Then I do an argmax to find the right token. And since yes, for sure 100% just training one is fine.
I don't know about that cuz my training data is a bunch of words.
So I don't think that'll work very nicely.
Hi from India. Hey, how's it going?
Welcome on in Dpack.
Raj Dpack, thank you for joining in here, guys.
So, yeah. So, that is a bunch of number.
That's a bunch of words, right? It's a whole bunch of words.
It's pronounced the same as the word logic. Just replace the C with a at with a t. Oh, log log it. Oh, log it. Yeah. See, I thought it was logit. I thought it was logit. I was looking that up earlier today. We were looking that up and I thought it was logit. Like the word log, like you're logging data to a disk or to a, you know, uh, some sort of central logging repository. You're logging the data. It's a logit. All right. All right. Bonzupi, I'm going to revert back to log it cuz that's what I thought it was.
Rad jeep. Oh, Rajep. Oh, Rajep. Okay.
Kundar. Kundo. Oh, Kundu. Kundu. Kundu.
Okay.
The index of the proper token to output as an in as an integer, not as a one hot.
I suppose we could do that as well, which is perfectly fine. I could do that. I could do that, too. All right.
All right. So, let me instead of doing one hot target, all we have to do is tokenize it.
We do dictionary tokenize.
There you go. All right. Are you sure, Nea? Are you sure? Are you sure? Before we do this. Okay.
All right. We're going to run it. Okay.
Here's the output. That's what we're looking for right there.
The proper input to a one hot would be your correct input.
Would be your correct input. Yes.
Yes. That's what That's what that is.
Yeah. So, so I've got my the actual answers, right? So, here uh and my output my model automatically converts words into tokens and then it runs them through an embedding, right? So, it normalizes. So, the model's forward function automatically does this. So, it tokenizes and then it runs it through the embedding which converts them into indexes, right?
So that part's done there and we can tokenize again. However, the output is here. So comparing this target to this output feels weird, right?
So that's why I did a one hot cuz this is we want to select the probability. Yes. Yes, we do.
Uh uh. Okay. Oh, I see what you mean.
Okay. All right. All right. We'll try it out. We'll try it out. Let's see if that works. Let's Let's find out. Let's find out. Loss.
Here we go. And then we're going to print the loss. All right. Let's try index out of bounds for dimension zero with size seven. I was wondering. See, that's what that's I I wanted. We needed to select the correct the the law. We we can select it through argmax. I get the right token on the Oh, we should probably arg good though. Do we still want to do it that way? You can also one you can also one hot zero all the elements that are pad and then multiply by the vector to get as input.
Okay.
All right. I'll have to think about that. I'll have to think about that.
You're stringing the pot now. Wait, Bonzupi guys, if it's GIF or GIF. Oh, yeah. You're stir You're stirring the pot. You're stirring the pot. GIF or GIF, right? GIF or GIF.
I also like another way is the target is a one hot targets pad token equals zero.
M Oh, okay. Got it. All right. Yeah, I need I need to think about that. I'll need to think about it. So, I do see a possibility here. I could grab the ARG max and then compare the tokens directly and use that as a loss.
Froms times targets.
Oo, I'm going to copy that. All right, so I I Give me one second here. Give me one second.
Um, a a magic meeting just showed up out of nowhere on my calendar. So, I might have to go to that here in a second. So, give me give me one second. All right, I'll say over one moment. All right. Uno momento.
All right.
Uh, that won't work either, will it? So, who who who did this? Who did it?
Uh, interesting.
Oh, that's what I thought. That's what I thought. Okay.
Argmax is wrong for several reasons. The major one is it's not differentiable.
So, I wouldn't have no grads.
Uh, okay. All right. Uh, it sounds good to me.
So then, well, as you can see, what's clearly wrong right here? Right. So, I've got my tokens and then I've got my softmaxed Uh, logits right here. My softmax logits. This could have been an email, Stephen, 10 minutes later. I know, right? It could have been an email.
It could have been.
It could have been an email, you guys.
All right, let's keep going. Let's keep going.
Okay. So the reason why So the reason why I did the one hot was to be able to say what the target output is trying to achieve to predict the next token.
Now I'm sure there's a lot of different ways to do it. However, very obviously this won't work because it's not going to be selecting the correct value.
Meanings can be productive when you have a great communication style. All right.
It could be, which is what Steven said.
Makes you uh Yeah. makes you hireable.
That is a very good That's a good point.
You Exactly. It's a very good point.
Hey, Brandon, welcome on in. Good to see you. Your stream on your phone died. Oh, BT. What's BT?
Wait, what's BT? Hold on. I feel like I should know what that means. You should be able to make your targets that are batch target. Oh.
Oh, it's the Okay. So then I just dot Wait, does it unsqueeze or squeeze?
Unsqueeze. No, it's squeeze. It's squeeze. S q U E E Z E. Can we say zero?
Can we do that?
Dim zero.
No. Okay. So 7 by4 versus H. Oh, wait. DIM one. That's what I want. Let's try that.
Okay, let me see this real quick. Let me do targets dots squeeze sq e dim equals 1. We'll do that there instead.
Okay.
Uh then use the target to index the probabilities that are batch token W where W is the word token.
I know if we watch Steven stream I get smarter. Hey Brandon, that's nice for you to say. Thank you. You can tell when my kernel is almost done compiling by the sound of the CPU fans. Yeah, it gets toasty, right? It gets really toasty.
You compile this thing like 30 300 times in a week. That's a lot of compilation.
Your you CPU is getting all the work out right now. What's the shape of target before you do anything? And the input, what's the shape? Okay, so here's the here's the output, right? Which we're going to compare the loss to 7x4.
And the target we will get the shape of here in a second. Uh it's basically 1 by 7. It's 1x 7.
So see yeah this is output.
So we'll see we'll do this really quick.
So we'll say output shape.
There we go.
Output shape.
I think you have a bug around the size of the output layer. Like basically some offset need to be added for to sub the from the target. Okay. See yeah it's I got you. I got you.
When do I I I see I see what you mean.
Yeah it makes sense. Hey Stephen, do you manage to choose your current career?
How did I manage? I wanted to be a game developer. I started making games and then I found out that writing enterprise software was immediately more valuable because you get you get revenue immediately.
You've been incisive for months between cyber security and backend development devops. So I would say cyber security is going to be a really critical item and that role is very necessary still backend and devops uh I would choose which one's going to make you happy that's actually one of the good which regardless of what the role is which one's going to make you more happy I think that's a important one which one do you want to do when I do something on your laptop it burns you oh do you hold your you put your laptop on your lap Yes, you have a bug in your data preparation. Stephen, you yeah are defining getting enough target tokens. I think my my my target is looking good.
So I I like I like my So here here's my tokens, right? My to my dictionary.
Let me let me uh hide this stuff for a second. Let's hide this. Okay. And then I will print to you Let's go to our model forward.
Okay.
So, here's the features and labels, right? So, I'll print to you the labels.
So, I will say print the labels.
So, this is the labels. Labels.
I think this is correct. I feel like this part is correct. So, here. So, here's my labels. Looks fine to me.
Let's hit the embedding. We got seven, right? Easy, easy tensor, right?
So, I've got basically a sentence and a bunch of tokens. Every word gets its own line on the embedding, right? So, that part I'm pretty confident about. That part makes sense.
You're not in microwave trouble again.
What's beeping? I don't know. What is beeping? Yeah, I love three so much yet I'm still indecisive. Senko. Ah, that's tough. That's tough, right? Because you got three good options. Well, here's the deal. Check this out. You don't always have to stick with one. You could try the one that you think is the best first and then continue down that path. And if you want to change your mind later, it might not be as bad as you think to change it later.
Bespoke AI. Yeah, we got these AIs. You never know. You never know. Like if you get the shape you have, the outputs are batch token and weights and targets that are tokens instead of BT. Well, yeah.
Right, right, right, right. Um, oh, do we need to on our t here? I've never done this before. However, Nea, would we be able to convert our our would we run it through embedding here? All right.
So, maybe this is the answer. I don't know if that would work, though. I don't see how that would work, though. I think we could try it. Basically, we're going to make them the same shape. We'll make them the same shape. Want me to set my clock? Buy the Panasonic inverter. All right, enough about boring microwave ovens. Oh, wait. Panasonic.
That sounds like a good one. Is it cold out there? You're always in a jacket.
Oh, yeah. I always have a hoodie on.
Yep. Oh, he's got the hoodie on. Senko, you got it.
It's not cold. It's actually really nice. I just It's comfortable.
Yeah, it's pretty easy to change career later, I think. So, Senko, I'm not sure if this word is vectoring. Oh, yeah.
Vectorizing. Yeah, we're vectoring. Yep, we are. That's what we're doing. Every bat should have a stream of tokens. Yes.
So, we got that.
I like seam only take part of the need targets. The thing is when uh for the embedding can I use the embedding to encode it feels it feels wrong to use the embedding for the targets as well right for the actual answers. Do we want to I think that would work. It just feels feels weird because you're training the embedding.
So you want the exact answers, right? Don't you want to target towards the one?
You should contribute to open source via comments. Yeah. Oh, yes. That's the way to do it. You got it. Basically, targets need to be 2D and actual 2D. Okay. All right. Sounds good to me.
Who really care about comment PRs? Oh, it's just a way. Not not they don't.
See, the good news is they see that all you're doing is adding documentation to the code. Well, it really depends though. Really, what you should be doing is building docs for the code or some way around that, right? Maybe if you find a line of code that might be helpful. It's just it's just a soft way to get into being a contributor to open source. It's not always going to work.
You're hiring out your company. Yes, Ray Silvers. Yes, we are. What's the company's name again? It's called PubNub. PUB and UB.
There you go. And we've got ourselves the answer right here. We are hiring for two specific engineering roles right now. Senior software engineering, artificial intelligence, large language model systems, and a senior solutions architect.
Hey, that Vasper, welcome on in. Good to see you. Good to have you here. Happy Thursday.
targets before the one hot needs to be 2D.
Got it. Okay. Seems like you have a target before it one hits one D instead of two. So basically the data is corrupt. Corrupt. Well, it's it's the right data for sure. It's just we need to put it in the right shape.
Right. So if our outputs are are logat which are going to be the next token right it's the next token what's the next word basically what's the next word and there probabilities of what the next word are right you see here on the screen we've got the probabilities of the next word so in this case the next word looks like it's going to be whatever this position is right here like for example maybe oh no it's going to be this one oh and it got it right interesting well actually it says this one uh but it's got it's getting close. So, we just need to match this. And so, my idea was why not tell it what the exact token should be, right?
The exact token. The total size is too small.
Well, is it though? Because look, my my my data size is very small. Look at this. This is my data size right here.
So, I think it looks about right.
There's only like 15 words there.
There's only 15 words. So, there should only be 15 positions. Well, actually, there's 14 words, right? So, there's only 14 positions. So, that looks good to me, right?
Oh, Lun. All right. Good to have you here. Thank you for joining on in. How do you get users for a platform like UGC content? How did the platform take off?
A bit of history lesson, if you may.
Sure, absolutely. I'll tell you. I'll give you the answer.
Can you tell me what the post one hot shape is currently?
Uh the post one hot shape will just be the tokens themselves. Right. So here I will share it to you. Here we go. The tokeniz. Here we go. And the shape.shape.
I so I I'm we I can get it to be whatever like it's not not a challenge here. Hey mom, I'm famous in Blum's code. Hey, yes, Kyle.
Wait. Oh, what happened? That should have worked. Uh why did that not work properly?
Uh tokenize. Oh, wait. Let's close that out real quick. Okay, that shape. Oh, it's not a function.
Here we go. Okay.
So yeah, so it's seven tokens and seven tokens and then we need to convert this.
That's why I one hoted it is so we need to know the exact answer. So if this is the if this is what the model returned its answer as and the target tokens or the actual answers are these tokens here.
See you next stream, Brandon. All right, sounds good. Thank you for joining in.
All right, we got a story to tell you guys. It's not a very long one. So, what happens uh what happens at Discord?
No. So, this is wrong. Post one hot should be 3D.
All right. Here. Get status. Get add here. Get commit am uh partway deciding.
Let's see. part way with loss back prop push. All right, I will send you the link so you can just see the code directly. Uh, and then cuz I don't think we're far off. I think we're like one or two lines from getting it to work.
Okay, let me see here. Story story. Hey, don't go anywhere. We're going to tell you about the UGC discussion here real quick.
All right. Uh, raise silvers, we will get you. So, GitHub GitHub Steven repositories uh transformer.
Here we go.
Copy link.
Easy. Oh, what? I I made a That's weird.
Okay, here you go. All right, Neva, I'll put this on link share. Link share at getting better. There you go. Over there on link share.
I think we're really close. I think we're really close.
Technommancy. Hey, Mike. Hey, Mike.
Mike. I don't know about technommancy.
What is it? What is technommancy?
You have batch dimensions, right? I don't have batch dimensions. I have not done batch dimensions yet. I know that sounds weird. I know it sounds weird. I don't have batch dimensions yet. I think the data propagation on the group. You have the target labels. Okay, I'm here.
No worries. I'm battling job applications on LinkedIn. That's right.
Okay. So, let's jump back to your your request here. How do you get users for platforms that has UGC content? So, what is UGC content first?
So, let me look that up really quick.
Let's see. Google UGC content.
All right. What is that? User generated content is a brand. Oh, got it. Okay.
So, you've got basically a Tik Tok or an Instagram. How do you acquire users for that platform? It's actually pretty not that difficult. So, you launch it in the app store. That's going to get you initial discovery. You launch it in media, right? So, you make news story coverage wherever you can get your your your main story released, right? So, if you if you built a new Tik Tok, right?
You build a brand new Tik Tok and you want to market it to the world. You'll then release it in places where other users are already existing. It's called OPA. It's the number one strategy. You can either pay for OPA or you can create an engaging story that is compelling for someone to author and release on their news outlet. Right? So there's another way. You can also add integrations. So you can integrate with other businesses or companies.
Right? That's another good way to do it.
That's fantastic. That's actually what we did. So, we don't have UTC business.
We have a platform for communication, an API company called PubNub. An API company called PubNub that allows communication for in-app experiences.
Things like chat, user chat, multiplayer games, tele medicine, when you talk to the doctor and patient, on demand delivery when you order food and you see the car on the map approaching you.
That's what we do. We integrate with all the different developer frameworks as part of our OPA approach, other people's audiences.
And we also have something called the loss leader where we give away a free tier that allows anyone to start without a credit card. So free forever, we have a loss leader. So that's another approach. So OPA and loss leaders, loss leaders are a lot of fun. They're my favorite because you get to build something amazing and give it away for free and attract new users. So you might have some sort of advanced AI feature in your app and you don't charge for it, right? Also for UGC businesses in general, most of the users are going to be free and you sell them you you make money through ads, right? So UGC based businesses will be ad driven.
All right, there you go. Let me know if you have any questions on the matter.
Let me scroll up here. Stephen, why do you guys use PubNub? Need ML engineers?
Uh, we've got a lot of data going through our system right now. We need things like NLP and moderation, uh, classification. So, so common standard stuff. Also, we have a code platform that allows you to mutate data through the network called PubNub functions. PubMed functions allow you to intercept messages, do some JavaScript work on them and then let the message pass on to the receiver. This happens in the infrastructure at the edge. So, it's edge computing. We offer this to our customers and they are also asking for AI features that can be added into the mutation and manipulation of those messages. So, that's where that's where we do it if that makes sense.
All right. Missed some of your chats, you guys. Oh, wait. That's so weird. It threw me off so bad. Sorry. Just assumed you did batches. Hey. Oh, good. No worries, Neva. No worries. We We You want Batches are ideal. You want to do batches. I just didn't start with it. We can add it in. I was going to add it in later.
Let's see.
All right. So, raceovers, let me know if that made sense. Good. I'm glad you don't know about it. I like you even better now. Oh, Mike Mike Jr. All right, that's great to hear. Yeah. Yeah, I technommancy. I've not heard about it.
Technommancy.
It's coding rituals into software for all types of purposes. Technommancy.
All right. I'm glad you don't practice it. I don't. I do not practice technommancy.
Kyle is listening. Who is handling your database? We handle our own databases.
We've got a lot of different ones. We've got Postgress, of course. We've got Cassandra, we've got Reddus, we've got uh a couple others I'm thinking of that I can't remember right now. And we run them and operate them in Amazon's infrastructure.
I want to make money through transactions. Hey. Yeah, that was a good one. Transactional revenue. I like that's a good one.
Your company seemed like UGC kind of. I have an event platform that I need users to become hosts in order to attract more users like Meetup hopefully better over time. Yeah, that's a good way to do it too. Raise silvers. You got it. You got it. Marketing without money is hard. It is. It's called growth hacking. It's like called growth hacking. It basically growing without a heavy investment.
Thank you for your viewpoints. You're welcome.
Do you need a dude who knows crypto?
Because Bunzupi might be someone I think I'm accidentally becoming expert in cryptography. Bunzi, that's how it happens. You become an expert. You've become an expert.
Could you guys do edge functions as Python? Oh, we are working on it, NeA.
We are. Python makes more sense for machine learning. We are right now it's JavaScript. We are working on it. We are working on it. It's going to happen. We think this year. We think this year. So, yes, Nea. Exactly.
Just got to put boots on the floor, grab attention little by little. Yep. That's how it works. And then you get the snowball rolling. And it gets a little bit bigger and a little faster and a little bigger and faster eventually.
Probably going to go give some away some of your features for free and call it cost of customer. Yes. Wait. Uh cost of customer acquisition or something like that. Uh customer cost of acquis what is CAC always? It's like cost of acquisition, right? CAC term busyiness.
Uh customer acquisition cost. Ah see right. Cost of customer acquisition is the way I say it. Customer acquisition cost. CAC.
Hey RXT RO. How's it going then? Oh, was it? Uh, Ruxro, hello. I love your informational videos. Hey, great to hear on Instagram. Oh, nice. You come from Instagram. All right. It's great. Just starting to learn Python. Very nice.
Very nice. It's so neat to see other people like coming across from different platforms. Yeah. From Instagram. Hey, how's it going over there? That's awesome. Hey, you're you caught a live stream. This is where the Instagram videos come from is this video.
That's great. That's so great.
Okay, let's see. So, I want to Where are we at? Here. Where are we at? Where did we leave up on this? Here we go. Here we go. Okay.
So, I can tokenize. I'm going to go back to one hot really quick and I'm going to try to test this to see if this works here. Blah blah blah. We're going to get a loss going. We got a loss right here.
Let's print our loss. We're going to do it backward and then we're going to do an optimization step. Oh, how is your subtitle so fast and accurate? Hey, right. Isn't it really good? It's so good. I'm glad you asked. We just happen to have a pinned a pinned tab in our Chrome browser here. If you want, it's free and available right here. It's on our Discord. If you're interested, go under the link share. Scroll up here.
You'll find it. You'll find it right here.
OBS is OBS subtitles for YouTube and Twitch. Free, fast, instantaneous. It's so good. It's good right here. Yep.
Right there. And it's also powered by my communication technology, PubNub. It's powered by and my obviously, as you saw, it's really fast. It's a really fast technology. PubNub right here. See, for example, you can kind of think of as as a chat. We've got speech to text, which goes through your microphone, gets translated to text, transmits over my technology pubnob to this tab that I've got. I've got a I've got a tab over here, right here.
This tab right here, and it then transmits that tab voice into OBS. And I've got an OBS overlay.
That's how it works. And it's free. It's open source available for you.
All right. So, let's get back to I want to see if we can make this work really well. So, I do have a We have 6 minutes left, you guys. So, 6 minutes. Will we make it in 6 minutes? Will we make it in 6 minutes?
There's the loss. All right. Let's do the step. Let's do the step. Optimizer step. Is this right? Is that Yep. Okay.
So, we got our optimizer there. Let's try it. Here we go. Here we go, you guys. This is uh 100.
All right. Oh, wo wo wo. We're printing too much stuff out here. Let's print.
Not that. Not that.
And we only lost. Okay. Here we go. All right. 16 minutes left, guys. I know, right?
Wait. Oh, it's still Wait, did I not do that right? Hold on.
Where'd this go? Where' this go? It should only print the It should only print loss. Oh, hold on. I see. I see. I see. I have another print up up here somewhere. There we go. That's what I was looking for. Okay.
Oh, the loss is going the wrong direction. Uh-oh. Whoops.
All right.
Okay. All right. Well, that definitely didn't work.
Okay. Oh, maybe maybe we need to lower lower the learning rate here. There we go. Let's lower that learning rate a smidgen.
Hey, that's what I'm looking for. All right. All right. All right. Let's get rid of the epoch here. And let's increase the learning rate a smidge.
Okay.
Oh, it's going the wrong direction.
Let's give it a few more epochs.
Let's go. Let's just There. Let's reduce it again. Zero. There we go. Extra zeros. Extra zeros. So, I'm probably going to give away most of my features for free. Okay. Yeah, I read that one already. Uh, let's see here. Would it be great if you ever need a React developer? I'll be on the lookout for a job posting race. All right. We'll let you know. Absolutely. We'll let you know. Okay. So, my model didn't learn. All right. So, we need to make some adjustments. We got to make some adjustments, you guys.
Oh, hey. Kyle just said race overs.
Kyle's interested.
Kyle said maybe, maybe, maybe.
Okay, so that didn't learn. Let's increase it a smidge.
All right.
All right. So, yeah, it's not learning.
It's not not It's not working. Okay. So, we have to come up with a different approach. We got to do something different, you guys. I think that'll be a plan for tomorrow. Oh, tell me more.
Ah, tell me more. Hey, Kyle. Very nice.
Very nice.
What if we don't do this? Yeah, we don't do that. Then it just goes uh in the wrong direction. Okay.
Yeah, that's not good. Oh. Oh, it's not good. Okay.
Okay. Okay. So, what if we did some try that? Oh, wow. Oh, it's Oh, it's not not happy now. It's not happy.
That is not happy there. Okay, let's try this. Okay, nice. Y'all are doing business. You're doing business in the chat. Also using a cursor. Stephen, I see your pink cursor.
Yeah, I do. I've got a cursor. Yeah, this is just my the location of my my cursor. Yeah, you a smear cursor. I don't know what a smear cursor is, Stephen. Some just changes with the scale of the loss.
Yeah, I figured I figured I know. I was I was messing around. I was just I was playing around. I was I didn't actually break it really. That's a good point.
All right, that's a good point. So, the loss could have actually just worked anyway, even though it was very big loss. Let's try it again.
Okay.
Smear cursor is a package for Neovim.
Oh, got it. Okay, so it's going the wrong direction.
Yeah, it's still it's still it's still struggling. Let's try this.
Yeah. Then it fails. It fails.
There we go. There we go.
I want to see what the smear cursor is.
All right. So, Google, you call it smear cursor in BIM.
Here we go. Let's take a look. What does that look like?
Let's play the video.
Oh, hey, that's fancy. Yeah, I don't have that.
I do like it. I think that's pretty fancy. I do not have it.
Is there it? It might look I mean I am I'm moving pretty fast on the screen, so it might look a little smeary, but it's not.
Is it against the rules? Really? Wait.
Uh, wait. Am I bad if it is, Kyle? I don't think so. I don't think there's any bad situations here.
Oh, we do have 1 minute left, you guys.
We got to go. We got to go. All right, everyone. Thank you so much for joining today. Had a lot of fun. We built a little bit more on our model. We We are trying to get the optimizer working. And the good news is we made some progress.
And Nea gave us the lost criterion. So that way we got our model at the beginning getting actually running through the back propagation. The only thing is it's not learning anything. So what we have to do tomorrow is figure out why. Let's figure out why tomorrow.
We'll beginning our training process for our our chat GPT model, right? We're building a transformer. That will be the plan for tomorrow. Then we will get that working. I'll add some uh positional encodings as well. I think that will be the plan. Bun zupy. All right, see you later everybody. See you Kyle.
See you Nea. See you Ray Silvers. Good to have you guys here. Good night, J bro. Thank you everybody. We'll be back tomorrow with more AI, more transformers. We're going to build our GPT, you guys. Bye, man. Have a nice day. You too. Ray Silvers. Oh, Sergio, have a good rest of your day. Bye, everybody. And of course, I love Liberty Justice. If you're still here, thank you. Thank you so much uh for all the donations. Peace out, dude. RXT. All right. Thank you guys. See you tomorrow.
Bye everybody. Have a good rest of your day.
Related Videos
Agentforce NOW AMA: Build with React and Salesforce Multi-Framework
SalesforceDevs
490 views•2026-05-28
How agent o11y differs from traditional o11y — Phil Hetzel, Braintrust
aiDotEngineer
450 views•2026-05-28
WEB TECHNOLOGIES UNIT-2 | Degree 4th sem BCOM Computers web technologies unit-2 full explanation💯✅
LearnwithSahera
1K views•2026-05-29
More tests are always better? How to use AI to identify tests that bring little value
Alliance4Qualification
335 views•2026-05-29
Search Algorithms Explained in 60 Seconds! 🤖💨
samarthtuliofficial
218 views•2026-06-01
People of Game of Thrones using JavaScript DOM
AltCampus
296 views•2026-05-30
Introduction to Problem Solving Part - 1 | Lecture 1 | Intermediate DSA
ascensionix
107 views•2026-05-29
So What's Odin Lang Even Good For
TechOverTea
131 views•2026-06-01











