Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Page Not Found

Page not found. Your pixels are in another canvas.

Jupyter notebook markdown generator

Posts

Visual Prompting

8 minute read

Published: September 05, 2023

Large language models like GPT-3 can be prompted with in-context examples or instructions to complete tasks without fine-tuning the model’s parameters. Prompting allows handling open-ended queries without introducing large numbers of learnable parameters. However, manually crafting a successful prompt to maximize the likelihood of the desired output is challenging (Hard prompts). For specific downstream tasks, domain adaptation may be required. This motivates soft prompts - appending tunable vectors to the input to steer the model toward desired outputs. Soft prompts help handle low-data domains and improve generalization without exhaustive prompt engineering.

Improving instruction following capabilities using self-alignment

4 minute read

Published: August 24, 2023

The introduction of GPT-3 completely revolutionized natural language processing by enabling few-shot learning through prompt engineering rather than fine-tuning. However, language models still struggle at zero-shot performance on tasks dissimilar from their pretraining data.

Reasoning in Large Language Models

5 minute read

Published: August 13, 2023

Let’s start this blog with a task. We have to train a model which concatenates the last letters of 2 input words. For example, if the input words are ‘Elon’ and ‘Musk’, the model should return ‘nk’. If we use supervised learning to train said model, we will need many examples with variation of words containing different end letters to create a model which gives the correct output. One might argue that we can use few shot learning with LLMs like GPT-3 to solve this problem. However, the model still isn’t able to produce the right output.

An adversarial lens towards aligned large language models

7 minute read

Published: August 06, 2023

Since the public release of LLM-based chat assistants like ChatGPT, there has been a large emphasis on aligning AI language models to prevent the production of undesirable or harmful content. One approach is to use reinforcement learning from human preferences to optimize a pre-trained language model by learning a reward function based on human preferences [1]. Constitutional AI [2] further removes the need for “human” preferences by training a reward model from AI feedback refined using safety instructions. The recently released Llama-2 model [3] also uses safety and helpfulness criteria to learn an RLHF-like model that improves alignment in open-source LLMs.

projects

Visual prompts using data augmentations for robust out-of-distribution image classification

less than 1 minute read

Tunable Visual prompting using data augmentation techniques such as DeepAugment, CutMix and CutOut for robust predictions for vision-language models like CLIP. Performance evaluations led to 2-3% improvement on different corruptions of CIFAR-C dataset

Seeing is not believing: Privacy preserving facial manipulation using adversarial mask generation and diffusion models

less than 1 minute read

Identified salient features in input images and generated adversarial masks using various techniques such as saliency gradient maps, GRAD-CAM and random patch masking. Created non-private representations of the input images using latent diffusion models, so that private information is not transmitted to downstream tasks such as FaceNet’s recognition model.

Distributed Heterogeneous Training for Large Language Models using Ray and DeepSpeed

less than 1 minute read

Conducted ablation studies for exploring efficient heterogeneous (CPU + GPU) distributed training for language models such as BeRT and RoBeRTa over different factors such as batch size, number of CPU/GPU parallel workers etc. Ray was used for parallelizing CPU processes and Deepspeed’s ZeRO optimization was used for data parallelism along with mixed precision training for sentiment analysis.

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.

Aakanksha Sanctis

Sitemap

Pages

Page Not Found

About me

Archive Layout with Content

Posts by Category

Posts by Collection

CV

Markdown

Page not in menu

Page Archive

Portfolio

Projects

Publications

Sitemap

Posts by Tags

Talk map

Talks and presentations

Teaching

Terms and Privacy Policy

Blog Posts

Jupyter notebook markdown generator

Posts

Visual Prompting

Improving instruction following capabilities using self-alignment

Reasoning in Large Language Models

An adversarial lens towards aligned large language models

projects

Visual prompts using data augmentations for robust out-of-distribution image classification

Seeing is not believing: Privacy preserving facial manipulation using adversarial mask generation and diffusion models

Distributed Heterogeneous Training for Large Language Models using Ray and DeepSpeed

talks

Efficiently building ML workflows at scale for production pipelines

Personalized BI dashboards for hands-off-the-wheel (HOTW) ML workflows

Generating explainations for deals recommendations using SHAP

teaching

Teaching experience 1

Teaching experience 2