Projects

My Projects with research papers implementations, and the documentation for each.

May 5, 2024

nanograd🧬 is a neural net engine on top of PyTorch like API.

(Currenlty the project is private)

nanograd is a neural net engine (command line interface and python library) inspired by micrograd and tinygrad, built upon a PyTorch-like API. It aims to provide users with easy-to-use tools for creating and utilizing various neural network architectures.

May 27, 2024

Axon: AI research lab🔬

Welcome to Axon: AI Research Lab! This repository serves as a collaborative platform for implementing cutting-edge AI research papers and conducting novel research in various areas of artificial intelligence. Our mission is to bridge the gap between theoretical research and practical applications by providing high-quality, reproducible implementations of seminal and contemporary AI papers: InstructGPT, llama, transformers, diffusion models, RLHF, etc...

May 2, 2024

Xllama🦙: is an Extensible advanced language model framework, inspired by the original Llama model.

X-Llama is an advanced language model framework, inspired by the original Llama model but enhanced with additional features such as Grouped Query Attention (GQA), Multi-Head Attention (MHA), and more. This project aims to provide a flexible and extensible platform for experimenting with various attention mechanisms and building state-of-the-art natural language processing models

Dec 18, 2023

LlTRA-Model: Language to Language Transformer model where i used my transformer model for translation task. from the paper "Attention is all you Need" 2017 using pytorch.

LlTRA stands for: Language to Language Transformer model from the paper "Attention is all you Need", building transformer model:Transformer model from scratch and using it for translation using pytorch, Develop a specialized language-to-language transformer model that accurately translates from the Arabic language to the English language, ensuring semantic fidelity, contextual awareness, cross-lingual adaptability, and the retention of grammar and style. The model should provide efficient training and inference processes to make it practical and accessible for a wide range of applications, ultimately contributing to the advancement of Arabic-to-English language translation capabilities.

Dec 22, 2023

TAL: Transformer Architecture Library, I built the Transformer model itself from scratch from the paper "Attention is all you need", for using this model for your specific purposes: translation, text generation, etc...

I developed a Python library for transformers, leveraging the architecture I previously designed. With this library, users can freely install and utilize the transformer architecture.

Oct 29, 2023

Brain-Simulator: built a Multi-Layer Perceptron neural network from scratch and visualized it as a snake's brain. Additionally, I applied a genetic algorithm to enhance its learning capabilities.

Neural Network From Scratch using Java and processing environment. Brain is a neural network simulator tool implemented in Java using the Processing environment. The project aims to provide a visual representation of the decision-making process of a neural network controlling the movements of a snake. The tool incorporates a genetic algorithm to optimize the neural network's performance. Through real-time visualization, users can gain insights into the neural network's behavior, identify patterns, and witness the improvement achieved through the genetic algorithm.