Hi, my name is
Rohit Singh Rathaur,
an Independent AI Researcher
I'm deeply engaged in learning AI Alignment, with a keen interest in Mechanistic Interpretability, concept-based & Developmental Interpretability, and Aligning Language Models. My current endeavors include spearheading the Thoth project as a Project Team Lead (PTL) within Anuket at theLinux Foundation Networking(LFN) , where I'm committed to advancing AI technologies and applications in the telecommunications sector and beyond. Currently, I am collaborating with Rahul Sarkar from the Institute for Computational and Mathematical Engineering at Stanford University on a review paper titled A Review of Invertible Neural Networks: Theory and Applications. This work aims to shed light on the theoretical underpinnings and practical applications of invertible neural networks, with the final draft of the paper under process.
About Me
My academic journey has led me to theUiT The Arctic University of Norway, where I pursued an Academic Exchange Program, working on my Mini Master's Thesis under the guidance of Dr. Deepak Gupta from TransmuteAI Lab, Dr. Dilip from BioAI Lab, and Dr. Rakesh Sharma from the Julich Supercomputing Center, Germany. This experience enriched my understanding and application of AI in diverse fields. My passion extends to building accessible Carbon Markets and exploring Machine Learning use-cases for the Telecommunication sectors, aiming to leverage AI for sustainable and innovative solutions. Through my research and projects, I am dedicated to contributing significantly to the ever-evolving landscape of technology, with a particular focus on making AI systems more interpretable, aligned, and beneficial for humanity.
Here are a few technologies I’ve been working with recently:
- Python
- Julia
- Machine Learning
- Deep Learning
- AI Alignment
- PyTorch
- Node.js
- React

News
2024-02-17, Joined Multi-agent Reinforcement Learning and Autonomous Decision Making Reading group at University Maryland, College Park where we are also considering the fundamental Human-AI alignment problem through Multi-agent RL to be pertinent for discussions.
2022-09-17, Accepted as an Academic Exchange Student at UiT The Arctic University of Norway.
2022-05-24, Selected as a PTL for Thoth project under The Linux Foundation Networking.
Where I’ve Worked
Tech Lead & Founding Team Member
January, 2022 - March, 2024
- Pioneered the integration of remote sensing data with AI models to estimate and offset carbon emissions, contributing to sustainable environmental practices.
- Implemented AWS Elastic Container Service (ECS) scheduler that automated deployment of applications in the cloud via Docker, increasing deployment efficiency by 40%.
- Leveraged Web3 technologies to create a revenue-generating mechanism that incentivizes nature revival and carbon offsetting.
- Directed the development of an MVP for Climate Models, focusing on Amazonia Forest, to provide essential data for biomass and CO2 calculations.
- And, unfortunately failed to secure funds. :(
Talks & Presentations I was part of
Featured Talks
Data Anonymization for Telco AI Use Cases
Data anonymization can help Telcos to share data to facilitate open innovation. Two big challenges to address while anonymizing Telco data for AI usecases are (a) fool-proof against de-anonymization (b) Not hamper the power (Ex: predictive, Classification) of the AI models. The talk we cover the following:
- State of art of Data Anonymization applied to Telcos, including the research works, projects and specifications.
- What constitutes the sensitive data (names, addresses, telco-specific fields, location-data, etc). in Telco scenarios
- Anonymization categories such as (Suppression, Masking, Pseudonymization, Generalization, Swapping, Perturbation and Synthetic Data Generation).
- Approaches/techniques ranging from classic (ex: K-Anonymity) to use of NLP to GANs (Generative Adversarial Networks).
- Which of the above categories and techniques are applicable to Telco Data, considering the challenges of deanonymization and model-power.
- Demonstration of the developed unified-anonymization tool.
- Machine Learning
- Data Anonymization
- Networking
Featured Talks
Predictive Analytics for Sustainable Energy from Agricultural Waste
In this session, we will explore the transformative power of machine learning in optimizing sustainable energy production from agricultural waste. Specifically, we will examine how predictive analytics can be employed to forecast spatial biomass distribution, efficiently allocate resources, and design a cost-effective bioenergy supply chain. Using real-world examples, we’ll demonstrate how these technologies are vital in transitioning to a lower-carbon future, benefitting both the environment and farmers. This discussion promises to provide insightful, practical knowledge on employing machine learning for sustainability challenges in energy sector.
- Machine Learning
- Climate
- Sustainable Energy
Featured Talks
Synthetic Observability Data Generation Using GANs
Generative modeling is an unsupervised learning technique in machine learning. It involves automatically discovering, and learning, the patterns in the input data. Once learns, it can be used to generate new examples that would be similar to the original dataset. Generative Adversarial Networks, or GANs for short, are an approach to generative modeling using deep learning methods. The challenge (Synthetic Observability Data Generation using GANs) proposes to use GANs for generating infrastructure time-series data (data that has time dependency). Using GANs to generate datasets that should preserve temporal dynamics is challenging compared to generating Images, for example.
This talk will provide a detailed overview of GANs, covering some hands-on exercises using Tensorflow. The talk will also include explanation of GANs for time-series Data, taking TGANs as example. The talk will begin with a quick survey of existing GANs for Time-Series Data, and end with a discussion on possible GANs to consider for this challenge.
- Machine Learning
- GANs
- Networking
Some Things I’ve Built
Featured Project
Thoth: Development of Advanced Failure Prediction Models in Network Function Virtualization (NFV)
As the Project Team Lead (PTL) within the AI taskforce at LF Networking, I have been at the forefront of harnessing AI's potential for enhancing workload availability and boosting performance and efficiency in Network Function Virtualization (NFV) use cases. This ambitious project is centered around the development of specialized machine learning models and tools, primarily designed for operational teams in telecommunications. We are committed to solving distinct problems across various categories, starting with failure prediction, where we aim to create six models targeting the failure prediction of VMs, Containers, Nodes, Network-Links, Applications, and Middleware Services. One of our key achievements has been the significant enhancement of prediction accuracy by 35%, accomplished through the implementation of advanced deep learning techniques such as Recurrent Neural Networks (RNN) and Long Short-Term Memory (LSTM) networks. This achievement not only reflects the project's technical prowess but also our commitment to innovation, which was instrumental in securing a place for this project under the AI taskforce. Moreover, we have taken strides in the realm of data privacy with the initiation of a novel approach to protect consumer data privacy. This initiative, while safeguarding sensitive data such as city names, organizations, individuals' details, IP Addresses, and MAC addresses, ensures that the privacy standards are met without losing the predictive power of our machine learning models. This balanced approach aims to set a precedent in data collaboration, benefiting both data providers and consumers and establishing a new standard in ethical AI development.
- Python
- Machine Learning
- DevOps
- Networking
Featured Project
BetaProfile
A nicer look at your GitHub profile and repo stats. Includes data visualizations of your top languages, starred repositories, and sort through your top repos by number of stars, forks, and size.
- Next.js
- Chart.js
- GitHub API
Featured Project
Visualize Spotify
A web app for visualizing personalized Spotify data. View your top artists, top tracks, recently played tracks, and detailed audio information about each track. Create and save new playlists of recommended tracks based on your existing playlists and more.
- React
- Styled Components
- Express
- Spotify API
- Heroku
Other Noteworthy Projects
view the archiveTask Scheduler
Developed a sophisticated Task Scheduler system designed to facilitate the scheduling and execution of tasks at predefined times. This project showcases a deep understanding of Python programming, interaction with SQL databases for persistent storage, and the deployment of applications in containerized environments.
Mechanistic Interpretability
This repository is my playground for Mechanistic Interpretability where I will push all my code-examples and readings which I am doing while practicing.
Deep Sensor
DeepSensor streamlines the application of neural processes (NPs) to environmental sciences by providing a simple interface for building, training, and evaluating NPs using
xarray
andpandas
data. View my contributionDynareJulia
A Julia rewrite of Dynare: solving, simulating and estimating DSGE models.
Hostility Detection
In this work, we present a novel hostility detection dataset in Hindi language. We collect and manually annotate ∼ 8200 online posts. The annotated dataset covers four hostility dimensions: fake news, hate speech, offensive, and defamation posts, along with a non-hostile label. The hostile posts are also considered for multilabel tags due to a significant overlap among the hostile classes. We release this dataset as part of the CONSTRAINT-2021 shared task on hostile post detection.
Jarvis Telegram
Just A Rather Very Intelligent System, now on Telegram! Worked as a mentee at GSSoC for this project.
What’s Next?
Get In Touch
Although I’m always open to work on AI Alignment Theory, Deep Learning, Optimization, my inbox is always open for any ML Ops related projects also. Since the evolution of Large Language Models (LLMs), I have been increasingly drawn to areas such as multimodal learning, memory-augmented perception, and active learning. Whether you want to collaborate or just want to say hi, I’ll be happy to connect with you!
Say Hello