Himabindu Lakkaraju - Publications - Faculty & Research

2024
Conference Paper

Fair Machine Unlearning: Data Removal while Mitigating Disparities

By: Himabindu Lakkaraju, Flavio Calmon, Jiaqi Ma and Alex Oesterling

Citation

2024
Conference Paper

Quantifying Uncertainty in Natural Language Explanations of Large Language Models

By: Himabindu Lakkaraju, Sree Harsha Tanneru and Chirag Agarwal

Large Language Models (LLMs) are increasingly used as powerful tools for several high-stakes natural language processing (NLP) applications. Recent prompting works claim to elicit intermediate reasoning steps and key tokens that serve as proxy explanations for LLM... View Details

Keywords: Large Language Model; AI and Machine Learning

Citation

Read Now

2023
Article

M4: A Unified XAI Benchmark for Faithfulness Evaluation of Feature Attribution Methods across Metrics, Modalities, and Models

By: Himabindu Lakkaraju, Xuhong Li, Mengnan Du, Jiamin Chen, Yekun Chai and Haoyi Xiong

While Explainable Artificial Intelligence (XAI) techniques have been widely studied to explain predictions made by deep neural networks, the way to evaluate the faithfulness of explanation results remains challenging, due to the heterogeneity of explanations for... View Details

Keywords: AI and Machine Learning

Citation

2023
Article

Post Hoc Explanations of Language Models Can Improve Language Models

By: Satyapriya Krishna, Jiaqi Ma, Dylan Slack, Asma Ghandeharioun, Sameer Singh and Himabindu Lakkaraju

Large Language Models (LLMs) have demonstrated remarkable capabilities in performing complex tasks. Moreover, recent research has shown that incorporating human-annotated rationales (e.g., Chain-of-Thought prompting) during in-context learning can significantly enhance... View Details

Keywords: AI and Machine Learning; Performance Effectiveness

Citation

Read Now

2023
Article

Verifiable Feature Attributions: A Bridge between Post Hoc Explainability and Inherent Interpretability

By: Usha Bhalla, Suraj Srinivas and Himabindu Lakkaraju

With the increased deployment of machine learning models in various real-world applications, researchers and practitioners alike have emphasized the need for explanations of model behaviour. To this end, two broad strategies have been outlined in prior literature to... View Details

Keywords: AI and Machine Learning; Mathematical Methods

Citation

Read Now

2023
Article

Which Models Have Perceptually-Aligned Gradients? An Explanation via Off-Manifold Robustness

By: Suraj Srinivas, Sebastian Bordt and Himabindu Lakkaraju

One of the remarkable properties of robust computer vision models is that their input-gradients are often aligned with human perception, referred to in the literature as perceptually-aligned gradients (PAGs). Despite only being trained for classification, PAGs cause... View Details

Keywords: AI and Machine Learning; Mathematical Methods

Citation

Read Now

2023
Working Paper

In-Context Unlearning: Language Models as Few Shot Unlearners

By: Martin Pawelczyk, Seth Neel and Himabindu Lakkaraju

Machine unlearning, the study of efficiently removing the impact of specific training points on the trained model, has garnered increased attention of late, driven by the need to comply with privacy regulations like the Right to be Forgotten. Although unlearning is... View Details

Keywords: AI and Machine Learning; Copyright; Information

Citation

Read Now

2023
Article

On Minimizing the Impact of Dataset Shifts on Actionable Explanations

By: Anna P. Meyer, Dan Ley, Suraj Srinivas and Himabindu Lakkaraju

The Right to Explanation is an important regulatory principle that allows individuals to request actionable explanations for algorithmic decisions. However, several technical challenges arise when providing such actionable explanations in practice. For instance, models... View Details

Keywords: Mathematical Methods; Analytics and Data Science

Citation

Read Now

2023
Article

On the Impact of Actionable Explanations on Social Segregation

By: Ruijiang Gao and Himabindu Lakkaraju

As predictive models seep into several real-world applications, it has become critical to ensure that individuals who are negatively impacted by the outcomes of these models are provided with a means for recourse. To this end, there has been a growing body of research... View Details

Keywords: Forecasting and Prediction; AI and Machine Learning; Outcome or Result

Citation

Read Now

August 2023
Article

Explaining Machine Learning Models with Interactive Natural Language Conversations Using TalkToModel

By: Dylan Slack, Satyapriya Krishna, Himabindu Lakkaraju and Sameer Singh

Practitioners increasingly use machine learning (ML) models, yet models have become more complex and harder to understand. To understand complex models, researchers have proposed techniques to explain model predictions. However, practitioners struggle to use... View Details

Keywords: AI and Machine Learning; Technological Innovation; Technology Adoption

Citation

Read Now

2023
Article

Towards Bridging the Gaps between the Right to Explanation and the Right to Be Forgotten

By: Himabindu Lakkaraju, Satyapriya Krishna and Jiaqi Ma

The Right to Explanation and the Right to be Forgotten are two important principles outlined to regulate algorithmic decision making and data usage in real-world applications. While the right to explanation allows individuals to request an actionable explanation for an... View Details

Keywords: Analytics and Data Science; AI and Machine Learning; Decision Making; Governing Rules, Regulations, and Reforms

Citation

Read Now

June 2023
Article

When Does Uncertainty Matter? Understanding the Impact of Predictive Uncertainty in ML Assisted Decision Making

By: Sean McGrath, Parth Mehta, Alexandra Zytek, Isaac Lage and Himabindu Lakkaraju

As machine learning (ML) models are increasingly being employed to assist human decision makers, it becomes critical to provide these decision makers with relevant inputs which can help them decide if and how to incorporate model predictions into their decision... View Details

Keywords: AI and Machine Learning; Decision Making

Citation

Read Now

2023
Article

Probabilistically Robust Recourse: Navigating the Trade-offs between Costs and Robustness in Algorithmic Recourse

By: Martin Pawelczyk, Teresa Datta, Johannes van-den-Heuvel, Gjergji Kasneci and Himabindu Lakkaraju

As machine learning models are increasingly being employed to make consequential decisions in real-world settings, it becomes critical to ensure that individuals who are adversely impacted (e.g., loan denied) by the predictions of these models are provided with a means... View Details

Keywords: AI and Machine Learning; Decision Choices and Conditions; Mathematical Methods

Citation

Read Now

April 2023
Article

On the Privacy Risks of Algorithmic Recourse

By: Martin Pawelczyk, Himabindu Lakkaraju and Seth Neel

As predictive models are increasingly being employed to make consequential decisions, there is a growing emphasis on developing techniques that can provide algorithmic recourse to affected individuals. While such recourses can be immensely beneficial to affected... View Details

Keywords: Recourse; Privacy Threats; AI and Machine Learning; Information

Citation

Read Now

2023
Article

Evaluating Explainability for Graph Neural Networks

By: Chirag Agarwal, Owen Queen, Himabindu Lakkaraju and Marinka Zitnik

As explanations are increasingly used to understand the behavior of graph neural networks (GNNs), evaluating the quality and reliability of GNN explanations is crucial. However, assessing the quality of GNN explanations is challenging as existing graph datasets have no... View Details

Keywords: Analytics and Data Science

Citation

Read Now

2023
Working Paper

When Algorithms Explain Themselves: AI Adoption and Accuracy of Experts' Decisions

By: Himabindu Lakkaraju and Chiara Farronato

Citation

2022
Article

Efficiently Training Low-Curvature Neural Networks

By: Suraj Srinivas, Kyle Matoba, Himabindu Lakkaraju and Francois Fleuret

Standard deep neural networks often have excess non-linearity, making them susceptible to issues such as low adversarial robustness and gradient instability. Common methods to address these downstream issues, such as adversarial training, are expensive and often... View Details

Keywords: AI and Machine Learning

Citation

Read Now

2022
Article

OpenXAI: Towards a Transparent Evaluation of Model Explanations

By: Chirag Agarwal, Satyapriya Krishna, Eshika Saxena, Martin Pawelczyk, Nari Johnson, Isha Puri, Marinka Zitnik and Himabindu Lakkaraju

While several types of post hoc explanation methods have been proposed in recent literature, there is very little work on systematically benchmarking these methods. Here, we introduce OpenXAI, a comprehensive and extensible opensource framework for evaluating and... View Details

Keywords: Measurement and Metrics; Analytics and Data Science

Citation

Read Now

2022
Article

Which Explanation Should I Choose? A Function Approximation Perspective to Characterizing Post hoc Explanations

By: Tessa Han, Suraj Srinivas and Himabindu Lakkaraju

A critical problem in the field of post hoc explainability is the lack of a common foundational goal among methods. For example, some methods are motivated by function approximation, some by game theoretic notions, and some by obtaining clean visualizations. This... View Details

Keywords: Mathematical Methods; Decision Choices and Conditions; Analytics and Data Science

Citation

Read Now

2022
Article

A Human-Centric Take on Model Monitoring

By: Murtuza Shergadwala, Himabindu Lakkaraju and Krishnaram Kenthapadi

Predictive models are increasingly used to make various consequential decisions in high-stakes domains such as healthcare, finance, and policy. It becomes critical to ensure that these models make accurate predictions, are robust to shifts in the data, do not rely on... View Details

Keywords: AI and Machine Learning; Research and Development; Demand and Consumers

Citation

Read Now

Publications

Publications

Show Results For

Show Results For

Fair Machine Unlearning: Data Removal while Mitigating Disparities

Quantifying Uncertainty in Natural Language Explanations of Large Language Models

M4: A Unified XAI Benchmark for Faithfulness Evaluation of Feature Attribution Methods across Metrics, Modalities, and Models

Post Hoc Explanations of Language Models Can Improve Language Models

Verifiable Feature Attributions: A Bridge between Post Hoc Explainability and Inherent Interpretability

Which Models Have Perceptually-Aligned Gradients? An Explanation via Off-Manifold Robustness

In-Context Unlearning: Language Models as Few Shot Unlearners

On Minimizing the Impact of Dataset Shifts on Actionable Explanations

On the Impact of Actionable Explanations on Social Segregation

Explaining Machine Learning Models with Interactive Natural Language Conversations Using TalkToModel

Towards Bridging the Gaps between the Right to Explanation and the Right to Be Forgotten

When Does Uncertainty Matter? Understanding the Impact of Predictive Uncertainty in ML Assisted Decision Making

Probabilistically Robust Recourse: Navigating the Trade-offs between Costs and Robustness in Algorithmic Recourse

On the Privacy Risks of Algorithmic Recourse

Evaluating Explainability for Graph Neural Networks

When Algorithms Explain Themselves: AI Adoption and Accuracy of Experts' Decisions

Efficiently Training Low-Curvature Neural Networks

OpenXAI: Towards a Transparent Evaluation of Model Explanations

Which Explanation Should I Choose? A Function Approximation Perspective to Characterizing Post hoc Explanations

A Human-Centric Take on Model Monitoring

Publications

Publications

Show Results For

Show Results For

Lakkaraju, Himabindu →

Fair Machine Unlearning: Data Removal while Mitigating Disparities

Quantifying Uncertainty in Natural Language Explanations of Large Language Models

M4: A Unified XAI Benchmark for Faithfulness Evaluation of Feature Attribution Methods across Metrics, Modalities, and Models

Post Hoc Explanations of Language Models Can Improve Language Models

Verifiable Feature Attributions: A Bridge between Post Hoc Explainability and Inherent Interpretability

Which Models Have Perceptually-Aligned Gradients? An Explanation via Off-Manifold Robustness

In-Context Unlearning: Language Models as Few Shot Unlearners

On Minimizing the Impact of Dataset Shifts on Actionable Explanations

On the Impact of Actionable Explanations on Social Segregation

Explaining Machine Learning Models with Interactive Natural Language Conversations Using TalkToModel

Towards Bridging the Gaps between the Right to Explanation and the Right to Be Forgotten

When Does Uncertainty Matter? Understanding the Impact of Predictive Uncertainty in ML Assisted Decision Making

Probabilistically Robust Recourse: Navigating the Trade-offs between Costs and Robustness in Algorithmic Recourse

On the Privacy Risks of Algorithmic Recourse

Evaluating Explainability for Graph Neural Networks

When Algorithms Explain Themselves: AI Adoption and Accuracy of Experts' Decisions

Efficiently Training Low-Curvature Neural Networks

OpenXAI: Towards a Transparent Evaluation of Model Explanations

Which Explanation Should I Choose? A Function Approximation Perspective to Characterizing Post hoc Explanations

A Human-Centric Take on Model Monitoring