@NotANumber

NotANumber@lemmy.dbzer0.com · 1 day ago

You are correct. I had to actually look that up.

NotANumber@lemmy.dbzer0.com · 1 day ago

Reading back over this I think you have me confused with another commentor. I don’t mention anything about IF in the commend you are replying to. Someone else did though.

NotANumber@lemmy.dbzer0.com · edit-2 7 days ago

Yes genetic algorithms are something different. Though they are used sometimes in training or architecting NNs, but not at the scale of modern LLMs.

Fyi you can have all or nothing outputs from a perceptron or other network. It all depends on the activation function. Most LLMs don’t use that kind of activation function, but it is possible. Have you heard of bitnet? They use only one of three states for the neuron output in an LLM. It’s interesting stuff.

NotANumber@lemmy.dbzer0.com · edit-2 7 days ago

Kinda but also no. That’s specifically a dense neural network or MLP. It gets a lot more complicated than that in some cases.

NotANumber@lemmy.dbzer0.com · 7 days ago

It’s only one type of neural network. A dense MLP. You have sparse neural networks, recurrent neural networks, convolutional neural networks and more!

NotANumber@lemmy.dbzer0.com · 7 days ago

Not all machine learning is AI. There are plenty of Machine Learning algorithms like Random Forests that are not neural networks. Deep learning would be big neural networks.

NotANumber@lemmy.dbzer0.com · edit-2 7 days ago

To be more specific this is an MLP (Multi-Layer Perceptron). Neural Network is a catch all term that includes other things such as Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Diffusion models and of course Transformers.

What you are arguing online is some variant of a Generative Pre-trained Transformer, which do have MLP or MoE layers but that’s only one part of what they are. They also have multi-headed attention mechanisms and embedding + unembedding vectors.

I know all this and wouldn’t call myself a machine learning expert. I just use the things. Though I did once train a simple MLP like the one in the picture. I think it’s quite bad calling yourself a machine learning expert and not knowing all of this stuff and more.

NotANumber@lemmy.dbzer0.com · edit-2 3 months ago

Man hackers would love you. Memory vulnerabilities are still one of if not the main cause of vulnerabilities in unsafe languages. Yes this even applies to large projects ran by experienced and skilled people like the Linux kernel. Stop thinking your the exception, because I guarantee you aren’t but even if you were you can’t read all of the code in the world in your lifetime. Nor can you write all the code in the world.