Skip to Content

Hi, my name is

Rohit Singh Rathaur,

an Independent AI Researcher

I'm deeply engaged in learning AI Alignment, with a keen interest in Mechanistic Interpretability, concept-based & Developmental Interpretability, and Aligning Language Models. My current endeavors include spearheading the Thoth project as a Project Team Lead (PTL) within Anuket at theLinux Foundation Networking(LFN) , where I'm committed to advancing AI technologies and applications in the telecommunications sector and beyond. Currently, I am collaborating with Rahul Sarkar from the Institute for Computational and Mathematical Engineering at Stanford University on a review paper titled A Review of Invertible Neural Networks: Theory and Applications. This work aims to shed light on the theoretical underpinnings and practical applications of invertible neural networks, with the final draft of the paper under process.

Here is my up-to-date coding activities information:Total time coded since Aug 9 2020

About Me

My academic journey has led me to theUiT The Arctic University of Norway, where I pursued an Academic Exchange Program, working on my Mini Master's Thesis under the guidance of Dr. Deepak Gupta from TransmuteAI Lab, Dr. Dilip from BioAI Lab, and Dr. Rakesh Sharma from the Julich Supercomputing Center, Germany. This experience enriched my understanding and application of AI in diverse fields. My passion extends to building accessible Carbon Markets and exploring Machine Learning use-cases for the Telecommunication sectors, aiming to leverage AI for sustainable and innovative solutions. Through my research and projects, I am dedicated to contributing significantly to the ever-evolving landscape of technology, with a particular focus on making AI systems more interpretable, aligned, and beneficial for humanity.

Here are a few technologies I’ve been working with recently:

  • Python
  • Julia
  • Machine Learning
  • Deep Learning
  • AI Alignment
  • PyTorch
  • Node.js
  • React
Headshot

News

  • 2024-02-17, Joined Multi-agent Reinforcement Learning and Autonomous Decision Making Reading group at University Maryland, College Park where we are also considering the fundamental Human-AI alignment problem through Multi-agent RL to be pertinent for discussions.

  • 2022-09-17, Accepted as an Academic Exchange Student at UiT The Arctic University of Norway.

  • 2022-05-24, Selected as a PTL for Thoth project under The Linux Foundation Networking.

Where I’ve Worked

Tech Lead & Founding Team Member

January, 2022 - March, 2024

  • Pioneered the integration of remote sensing data with AI models to estimate and offset carbon emissions, contributing to sustainable environmental practices.
  • Implemented AWS Elastic Container Service (ECS) scheduler that automated deployment of applications in the cloud via Docker, increasing deployment efficiency by 40%.
  • Leveraged Web3 technologies to create a revenue-generating mechanism that incentivizes nature revival and carbon offsetting.
  • Directed the development of an MVP for Climate Models, focusing on Amazonia Forest, to provide essential data for biomass and CO2 calculations.
  • And, unfortunately failed to secure funds. :(

Talks & Presentations I was part of

  • Featured Talks

    Data Anonymization for Telco AI Use Cases

    Data anonymization can help Telcos to share data to facilitate open innovation. Two big challenges to address while anonymizing Telco data for AI usecases are (a) fool-proof against de-anonymization (b) Not hamper the power (Ex: predictive, Classification) of the AI models. The talk we cover the following:

    • State of art of Data Anonymization applied to Telcos, including the research works, projects and specifications.
    • What constitutes the sensitive data (names, addresses, telco-specific fields, location-data, etc). in Telco scenarios
    • Anonymization categories such as (Suppression, Masking, Pseudonymization, Generalization, Swapping, Perturbation and Synthetic Data Generation).
    • Approaches/techniques ranging from classic (ex: K-Anonymity) to use of NLP to GANs (Generative Adversarial Networks).
    • Which of the above categories and techniques are applicable to Telco Data, considering the challenges of deanonymization and model-power.
    • Demonstration of the developed unified-anonymization tool.
    • Machine Learning
    • Data Anonymization
    • Networking
  • Featured Talks

    Predictive Analytics for Sustainable Energy from Agricultural Waste

    In this session, we will explore the transformative power of machine learning in optimizing sustainable energy production from agricultural waste. Specifically, we will examine how predictive analytics can be employed to forecast spatial biomass distribution, efficiently allocate resources, and design a cost-effective bioenergy supply chain. Using real-world examples, we’ll demonstrate how these technologies are vital in transitioning to a lower-carbon future, benefitting both the environment and farmers. This discussion promises to provide insightful, practical knowledge on employing machine learning for sustainability challenges in energy sector.

    • Machine Learning
    • Climate
    • Sustainable Energy
  • Featured Talks

    Synthetic Observability Data Generation Using GANs

    Generative modeling is an unsupervised learning technique in machine learning. It involves automatically discovering, and learning, the patterns in the input data. Once learns, it can be used to generate new examples that would be similar to the original dataset. Generative Adversarial Networks, or GANs for short, are an approach to generative modeling using deep learning methods. The challenge (Synthetic Observability Data Generation using GANs) proposes to use GANs for generating infrastructure time-series data (data that has time dependency). Using GANs to generate datasets that should preserve temporal dynamics is challenging compared to generating Images, for example.

    This talk will provide a detailed overview of GANs, covering some hands-on exercises using Tensorflow. The talk will also include explanation of GANs for time-series Data, taking TGANs as example. The talk will begin with a quick survey of existing GANs for Time-Series Data, and end with a discussion on possible GANs to consider for this challenge.

    • Machine Learning
    • GANs
    • Networking

Some Things I’ve Built

Other Noteworthy Projects

view the archive
  • Folder

    Task Scheduler

    Developed a sophisticated Task Scheduler system designed to facilitate the scheduling and execution of tasks at predefined times. This project showcases a deep understanding of Python programming, interaction with SQL databases for persistent storage, and the deployment of applications in containerized environments.

    • Python
    • MariaDB
    • Docker
    • Kubernetes
    • Helm
  • Mechanistic Interpretability

    This repository is my playground for Mechanistic Interpretability where I will push all my code-examples and readings which I am doing while practicing.

    • Python
    • PyTorch
    • TransformerLens
    • LLMs
  • Deep Sensor

    DeepSensor streamlines the application of neural processes (NPs) to environmental sciences by providing a simple interface for building, training, and evaluating NPs using xarray and pandas data. View my contribution

    • Python
    • Deep Learning
    • Modeling
  • DynareJulia

    A Julia rewrite of Dynare: solving, simulating and estimating DSGE models.

    • Julia
    • Simulation
    • NLP
  • Hostility Detection

    In this work, we present a novel hostility detection dataset in Hindi language. We collect and manually annotate ∼ 8200 online posts. The annotated dataset covers four hostility dimensions: fake news, hate speech, offensive, and defamation posts, along with a non-hostile label. The hostile posts are also considered for multilabel tags due to a significant overlap among the hostile classes. We release this dataset as part of the CONSTRAINT-2021 shared task on hostile post detection.

    • PyTorch
    • Machine Learning
    • NLP
  • Jarvis Telegram

    Just A Rather Very Intelligent System, now on Telegram! Worked as a mentee at GSSoC for this project.

    • Python
    • NLP
    • Rasa

What’s Next?

Get In Touch

Although I’m always open to work on AI Alignment Theory, Deep Learning, Optimization, my inbox is always open for any ML Ops related projects also. Since the evolution of Large Language Models (LLMs), I have been increasingly drawn to areas such as multimodal learning, memory-augmented perception, and active learning. Whether you want to collaborate or just want to say hi, I’ll be happy to connect with you!