Continual learning [1], also known as lifelong learning, refers to the ability of an artificial intelligence system to continuously learn and adapt from new experiences over time. This is an important capability as it allows AI models to acquire new knowledge and skills as more data becomes available, without forgetting previously learned information. Continual learning … full description “Algorithms for lifelong deep learning (available)”
Tag: Machine Learning (ML)
Attacking Large Pre-trained Programming Language Models (PLMs) via Backdoors (completed)
Project Description: Backdoors refer to a class of Machine Learning (ML) attacks where an adversary trains an ML model to intentionally misclassify any input to a specific label [1]. This is typically achieved by poisoning the training data, such that inputs are misclassified to a target label when the backdoor trigger is present. For instance, … full description “Attacking Large Pre-trained Programming Language Models (PLMs) via Backdoors (completed)”
Exploring optimization algorithms for recurrent neural networks (available)
Recurrent neural networks (RNNs) are a key type of architecture in modern deep learning, particularly for processing sequential data such as text, speech, video, and time series data. Unlike feedforward networks, RNNs have loops that allow information to persist and be passed from one step to the next. This enables them to effectively model patterns … full description “Exploring optimization algorithms for recurrent neural networks (available)”
Implementation of biologically inspired efficient deep learning models (available)
As deep learning models continue to grow in size and complexity to tackle increasingly difficult tasks, the need for efficient and scalable models becomes ever more important. Extremely large language models like GPT-4 require massive computational resources and expensive hardware to train and run. This makes them impractical to deploy at scale in many real-world … full description “Implementation of biologically inspired efficient deep learning models (available)”