Jon Kleinberg

Cornell University

H-index: 122

North America-United States

About Jon Kleinberg

Jon Kleinberg, With an exceptional h-index of 122 and a recent h-index of 76 (since 2020), a distinguished researcher at Cornell University, specializes in the field of algorithms, data mining, information networks, social networks, Web mining.

His recent articles reflect a diverse array of research interests and contributions to the field:

From Graphs to Hypergraphs: Hypergraph Projection and its Remediation

Modeling reputation-based behavioral biases in school choice

Hypergraph patterns and collaboration structure

Replicating Electoral Success

Language Generation in the Limit

Equilibria, Efficiency, and Inequality in Network Formation for Hiring and Opportunity

The Moderating Effect of Instant Runoff Voting

Microstructures and Accuracy of Graph Recall by Large Language Models

Jon Kleinberg Information

University	Cornell University
Position	Professor of Computer Science
Citations(all)	123425
Citations(since 2020)	40183
Cited By	99840
hIndex(all)	122
hIndex(since 2020)	76
i10Index(all)	283
i10Index(since 2020)	214
Email	Access Email
University Profile Page	Cornell University

Jon Kleinberg Skills & Research Interests

algorithms

data mining

information networks

social networks

Web mining

Top articles of Jon Kleinberg

A Feder Cooper,Katherine Lee,Madiha Zahrah Choksi,Solon Barocas,Christopher De Sa,James Grimmelmann,Jon Kleinberg,Siddhartha Sen,Baobao Zhang

Published Date

2024

Variance in predictions across different trained models is a significant, under-explored source of error in fair binary classification. In practice, the variance on some data examples is so large that decisions can be effectively arbitrary. To investigate this problem, we take an experimental approach and make four overarching contributions. We: 1) Define a metric called self-consistency, derived from variance, which we use as a proxy for measuring and reducing arbitrariness; 2) Develop an ensembling algorithm that abstains from classification when a prediction would be arbitrary; 3) Conduct the largest to-date empirical study of the role of variance (vis-a-vis self-consistency and arbitrariness) in fair binary classification; and, 4) Release a toolkit that makes the US Home Mortgage Disclosure Act (HMDA) datasets easily usable for future research. Altogether, our experiments reveal shocking insights about the reliability of conclusions on benchmark datasets. Most fair binary classification benchmarks are close-to-fair when taking into account the amount of arbitrariness present in predictions -- before we even try to apply any fairness interventions. This finding calls into question the practical utility of common algorithmic fairness methods, and in turn suggests that we should reconsider how we choose to measure fairness in binary classification.

On the Relationship Between Relevance and Conflict in Online Social Link Recommendations

Authors

Yanbang Wang,Jon Kleinberg

Published Date

2023

In an online social network, link recommendations are a way for users to discover relevant links to people they may know, thereby potentially increasing their engagement on the platform. However, the addition of links to a social network can also have an effect on the level of conflict in the network---expressed in terms of polarization and disagreement. To date, however, we have very little understanding of how these two implications of link formation relate to each other: are the goals of high relevance and conflict reduction aligned, or are the links that users are most likely to accept fundamentally different from the ones with the greatest potential for reducing conflict? Here we provide the first analysis of this question, using the recently popular Friedkin-Johnsen model of opinion dynamics. We first present a surprising result on how link additions shift the level of opinion conflict, followed by explanation work that relates the amount of shift to structural features of the added links. We then characterize the gap in conflict reduction between the set of links achieving the largest reduction and the set of links achieving the highest relevance. The gap is measured on real-world data, based on instantiations of relevance defined by 13 link recommendation algorithms. We find that some, but not all, of the more accurate algorithms actually lead to better reduction of conflict. Our work suggests that social links recommended for increasing user engagement may not be as conflict-provoking as people might have thought.

On the actionability of outcome prediction

Authors

Lydia T Liu,Solon Barocas,Jon Kleinberg,Karen Levy

Published Date

2024

Predicting future outcomes is a prevalent application of machine learning in social impact domains. Examples range from predicting student success in education to predicting disease risk in healthcare. Practitioners often recognize that the ultimate goal is not just to predict but to act effectively, and increasing empirical evidence suggests that relying on outcome predictions for downstream interventions may not lead to desired results. In most domains there exists a multitude of possible interventions for each individual, making the challenge of taking effective action more acute. Even when causal mechanism connecting the individual’s latent states to outcomes is well understood, in any given instance (a specific student, or patient), practitioners still need to infer—from budgeted measurements of latent states—which of many possible interventions will be most effective for this individual. With this in mind, we ask: when are accurate predictors of outcomes helpful for identifying the most suitable intervention? Through a simple model encompassing actions, latent states, and measurements, we demonstrate that pure outcome prediction rarely results in the most effective policy for taking actions, even when combined with other measurements. We find that except in cases where there is a single decisive action for improving the outcome, outcome prediction never maximizes “action value”, the utility of taking actions. Making measurements of actionable latent states, where specific actions lead to desired outcomes, considerably enhances the action value compared to outcome prediction, and the degree of improvement depends on action costs and the …

Node-based generalized friendship paradox fails

Authors

Anna Evtushenko,Jon Kleinberg

Journal

Scientific reports

Published Date

2023/2/6

The Friendship Paradox—the principle that “your friends have more friends than you do”—is a combinatorial fact about degrees in a graph; but given that many web-based social activities are correlated with a user’s degree, this fact has been taken more broadly to suggest the empirical principle that “your friends are also more active than you are.” This Generalized Friendship Paradox, the notion that any attribute positively correlated with degree obeys the Friendship Paradox, has been established mathematically in a network-level version that essentially aggregates uniformly over all the edges of a network. Here we show, however, that the natural node-based version of the Generalized Friendship Paradox—which aggregates over nodes, not edges—may fail, even for degree-attribute correlations approaching 1. Whether this version holds depends not only on degree-attribute correlations, but also on the …

Dynamic Interventions for Networked Contagions

Authors

Marios Papachristou,Siddhartha Banerjee,Jon Kleinberg

Published Date

2023/4/30

We study the problem of designing dynamic intervention policies for minimizing cascading failures in online financial networks, as well we more general demand-supply networks. Formally, we consider a dynamic version of the celebrated Eisenberg-Noe model of financial network liabilities, and use this to study the design of external intervention policies. Our controller has a fixed resource budget in each round, and can use this to minimize the effect of demand/supply shocks in the network. We formulate the optimal intervention problem as a Markov Decision Process, and show how we can leverage the problem structure to efficiently compute optimal intervention policies with continuous interventions, and give approximation algorithms in the case of discrete interventions. Going beyond financial networks, we argue that our model captures dynamic network intervention in a much broader class of dynamic demand …

Foundations of data science

Authors

Avrim Blum,John Hopcroft,Ravindran Kannan

Published Date

2020/1/23

Authors

Karim Hamade,Reid McIlroy-Young,Siddhartha Sen,Jon Kleinberg,Ashton Anderson

Published Date

2023/10/13

Powerful artificial intelligence systems are often used in settings where they must interact with agents that are computationally much weaker, for example when they work alongside humans or operate in complex environments where some tasks are handled by algorithms, heuristics, or other entities of varying computational power. For AI agents to successfully interact in these settings, however, achieving superhuman performance alone is not sufficient; they also need to account for suboptimal actions or idiosyncratic style from their less-skilled counterparts. We propose a formal evaluation framework for assessing the compatibility of near-optimal AI with interaction partners who may have much lower levels of skill; we use popular collaborative chess variants as model systems to study and develop AI agents that can successfully interact with lower-skill entities. Traditional chess engines designed to output near-optimal moves prove to be inadequate partners when paired with engines of various lower skill levels in this domain, as they are not designed to consider the presence of other agents. We contribute three methodologies to explicitly create skill-compatible AI agents in complex decision-making settings, and two chess game frameworks designed to foster collaboration between powerful AI agents and less-skilled partners. On these frameworks, our agents outperform state-of-the-art chess AI (based on AlphaZero) despite being weaker in conventional chess, demonstrating that skill-compatibility is a tangible trait that is qualitatively and measurably distinct from raw performance. Our evaluations further explore and clarify the mechanisms by …

Calibrated recommendations for users with decaying attention

Authors

Jon Kleinberg,Emily Ryu,Éva Tardos

Journal

arXiv preprint arXiv:2302.03239

Published Date

2023/2/7

Recommendation systems capable of providing diverse sets of results are a focus of increasing importance, with motivations ranging from fairness to novelty and other aspects of optimizing user experience. One form of diversity of recent interest is calibration, the notion that personalized recommendations should reflect the full distribution of a user's interests, rather than a single predominant category -- for instance, a user who mainly reads entertainment news but also wants to keep up with news on the environment and the economy would prefer to see a mixture of these genres, not solely entertainment news. Existing work has formulated calibration as a subset selection problem; this line of work observes that the formulation requires the unrealistic assumption that all recommended items receive equal consideration from the user, but leaves as an open question the more realistic setting in which user attention decays as they move down the list of results. In this paper, we consider calibration with decaying user attention under two different models. In both models, there is a set of underlying genres that items can belong to. In the first setting, where items are represented by fine-grained mixtures of genre percentages, we provide a -approximation algorithm by extending techniques for constrained submodular optimization. In the second setting, where items are coarsely binned into a single genre each, we surpass the barrier imposed by submodular maximization and give a -approximate greedy algorithm. Our work thus addresses the problem of capturing ordering effects due to decaying attention, allowing for the extension of …

Human bias in algorithm design

Authors

Carey K Morewedge,Sendhil Mullainathan,Haaya F Naushan,Cass R Sunstein,Jon Kleinberg,Manish Raghavan,Jens O Ludwig

Journal

Nature Human Behaviour

Published Date

2023/11

Algorithms are designed to learn user preferences by observing user behaviour. This causes algorithms to fail to reflect user preferences when psychological biases affect user decision making. For algorithms to enhance social welfare, algorithm design needs to be psychologically informed.

Fairness in model-sharing games

Authors

Kate Donahue,Jon Kleinberg

Published Date

2023/4/30

In many real-world situations, data is distributed across multiple self-interested agents. These agents can collaborate to build a machine learning model based on data from multiple agents, potentially reducing the error each experiences. However, sharing models in this way raises questions of fairness: to what extent can the error experienced by one agent be significantly lower than the error experienced by another agent in the same coalition? In this work, we consider two notions of fairness that each may be appropriate in different circumstances: egalitarian fairness (which aims to bound how dissimilar error rates can be) and proportional fairness (which aims to reward players for contributing more data). We similarly consider two common methods of model aggregation, one where a single model is created for all agents (uniform), and one where an individualized model is created for each agent. For egalitarian …

The inversion problem: Why algorithms should infer mental state and not just predict behavior

Authors

Jon Kleinberg,Jens Ludwig,Sendhil Mullainathan,Manish Raghavan

Journal

Perspectives on Psychological Science

Published Date

2023/10/6

Authors

Rediet Abebe,Nicole Immorlica,Jon Kleinberg,Brendan Lucier,Ali Shirali

Published Date

2022/7/12

The tendency for individuals to form social ties with others who are similar to themselves, known as homophily, is one of the most robust sociological principles. Since this phenomenon can lead to patterns of interactions that segregate people along different demographic dimensions, it can also lead to inequalities in access to information, resources, and opportunities. As we consider potential interventions that might alleviate the effects of segregation, we face the challenge that homophily constitutes a pervasive and organic force that is difficult to push back against. Designing effective interventions can therefore benefit from identifying counterbalancing social processes that might be harnessed to work in opposition to segregation. In this work, we show that triadic closure---another common phenomenon that posits that individuals with a mutual connection are more likely to be connected to one another---can be one …

Containing the spread of a contagion on a tree

Authors

Michela Meister,Jon Kleinberg

Journal

arXiv preprint arXiv:2210.13247

Published Date

2022/10/24

Contact tracing can be thought of as a race between two processes: an infection process and a tracing process. In this paper, we study a simple model of infection spreading on a tree, and a tracer who stabilizes one node at a time. We focus on the question, how should the tracer choose nodes to stabilize so as to prevent the infection from spreading further? We study simple policies, which prioritize nodes based on time, infectiousness, or probability of generating new contacts.

Four years of FAccT: A reflexive, mixed-methods analysis of research contributions, shortcomings, and future prospects

Authors

Benjamin Laufer,Sameer Jain,A Feder Cooper,Jon Kleinberg,Hoda Heidari

Published Date

2022/6/21

Fairness, Accountability, and Transparency (FAccT) for socio-technical systems has been a thriving area of research in recent years. An ACM conference bearing the same name has been the central venue for scholars in this area to come together, provide peer feedback to one another, and publish their work. This reflexive study aims to shed light on FAccT’s activities to date and identify major gaps and opportunities for translating contributions into broader positive impact. To this end, we utilize a mixed-methods research design. On the qualitative front, we develop a protocol for reviewing and coding prior FAccT papers, tracing their distribution of topics, methods, datasets, and disciplinary roots. We also design and administer a questionnaire to reflect the voices of FAccT community members and affiliates on a wide range of topics. On the quantitative front, we use the full text and citation network associated with …

Exporting Geography Into A Virtual Landscape: A Global Pandemic Locally Discussed

Authors

Katherine Van Koevering,Yiquan Hong,Jon Kleinberg

Journal

arXiv preprint arXiv:2210.07187

Published Date

2022/10/13

The COVID-19 pandemic has been a global health crisis playing out in the age of social media. Even though the virtual environment makes interaction possible regardless of physical location, many of the most pressing issues during the pandemic -- case counts, lockdown policies, vaccine availability -- have played out in an intensely local fashion. Reflecting this locality, many of the online COVID communities that formed have been closely tied to physical location, at different spatial scales ranging from cities to countries to entire global platforms. This provides an opportunity to study how the real-world geography of the pandemic translates into a virtual landscape. By analyzing almost 300 geographically-linked COVID discussion communities on Reddit, we show how these discussions were organized geographically and temporally in three aspects: what were people talking about, who were they talking about it with, and how did they self-organize these conversations?

Allocating stimulus checks in times of crisis

Authors

Marios Papachristou,Jon Kleinberg

Published Date

2022/4/25

We study the problem of financial assistance (bailouts, stimulus payments, or subsidy allocations) in a network where individuals experience income shocks. These questions are pervasive both in policy domains and in the design of new Web-enabled forms of financial interaction. We build on the financial clearing framework of Eisenberg and Noe that allows the incorporation of a bailout policy that is based on discrete bailouts motivated by stimulus programs in both off-line and on-line settings. We show that optimally allocating such bailouts on a financial network in order to maximize a variety of social welfare objectives of this form is a computationally intractable problem. We develop approximation algorithms to optimize these objectives and establish guarantees for their approximation ratios. Then, we incorporate multiple fairness constraints in the optimization problems and study their boundedness. Finally, we …

Learning models of individual behavior in chess

Authors

Reid McIlroy-Young,Russell Wang,Siddhartha Sen,Jon Kleinberg,Ashton Anderson

Published Date

2022/8/14

AI systems that can capture human-like behavior are becoming increasingly useful in situations where humans may want to learn from these systems, collaborate with them, or engage with them as partners for an extended duration. In order to develop human-oriented AI systems, the problem of predicting human actions---as opposed to predicting optimal actions---has received considerable attention. Existing work has focused on capturing human behavior in an aggregate sense, which potentially limits the benefit any particular individual could gain from interaction with these systems. We extend this line of work by developing highly accurate predictive models of individual human behavior in chess. Chess is a rich domain for exploring human-AI interaction because it combines a unique set of properties: AI systems achieved superhuman performance many years ago, and yet humans still interact with them closely …

Measuring the completeness of economic models

Authors

Drew Fudenberg,Jon Kleinberg,Annie Liang,Sendhil Mullainathan

Journal

Journal of Political Economy

Published Date

2021/1/18

Authors

Rediet Abebe,Jon Kleinberg,Andrew Wang

Published Date

2021

Poverty and economic hardship are multi-faceted and dynamic phenomena impacting over 50 million people in the United States and billions of people world-wide. Despite the prevalence of poverty, there remains much to be understood about what makes families susceptible to experiencing economic distress and what interventions many be effective and for which families. An important set of questions is related to the role of income shocks. Shocks may constitute unexpected expenses such as a medical bill or a parking ticket or interruptions to one’s income flow, such as a delayed paycheck or loss of public benefits. Recently these phenomena have garnered increased attention, with a growing body of empirical and computational work showing their impact on various measures of socioeconomic welfare. We present a computational study of a large survey-based longitudinal data-set to understand the role of …

Algorithmic monoculture and social welfare

Authors

Jon Kleinberg,Manish Raghavan

Journal

Proceedings of the National Academy of Sciences

Published Date

2021/6/1

As algorithms are increasingly applied to screen applicants for high-stakes decisions in employment, lending, and other domains, concerns have been raised about the effects of algorithmic monoculture, in which many decision-makers all rely on the same algorithm. This concern invokes analogies to agriculture, where a monocultural system runs the risk of severe harm from unexpected shocks. Here, we show that the dangers of algorithmic monoculture run much deeper, in that monocultural convergence on a single algorithm by a group of decision-making agents, even when the algorithm is more accurate for any one agent in isolation, can reduce the overall quality of the decisions being made by the full collection of agents. Unexpected shocks are therefore not needed to expose the risks of monoculture; it can hurt accuracy even under “normal” operations and even for algorithms that are more accurate when …

Polarization in geometric opinion dynamics

Authors

Jason Gaitonde,Jon Kleinberg,Éva Tardos

Published Date

2021/7/18

In light of increasing recent attention to political polarization, understanding how polarization can arise poses an important theoretical question. While more classical models of opinion dynamics seem poorly equipped to study this phenomenon, a recent novel approach by H\ka zł a, Jin, Mossel, and Ramnarayan (HJMR) proposes a simple geometric model of opinion evolution that provably exhibits strong polarization in specialized cases. Moreover, polarization arises quite organically in their model: in each time step, each agent updates opinions according to their correlation/response with an issue drawn at random. However, their techniques do not seem to extend beyond a set of special cases they identify, which benefit from fragile symmetry or contractiveness assumptions, leaving open how general this phenomenon really is. In this paper, we further the study of polarization in related geometric models. We show …

Stochastic model for sunk cost bias

Authors

Jon Kleinberg,Sigal Oren,Manish Raghavan,Nadav Sklar

Published Date

2021/12/1

We present a novel model for capturing the behavior of an agent exhibiting sunk-cost bias in a stochastic environment. Agents exhibiting sunk-cost bias take into account the effort they have already spent on an endeavor when they evaluate whether to continue or abandon it. We model planning tasks in which an agent with this type of bias tries to reach a designated goal. Our model structures this problem as a type of Markov decision process: loosely speaking, the agent traverses a directed acyclic graph with probabilistic transitions, paying costs for its actions as it tries to reach a target node containing a specified reward. The agent’s sunk cost bias is modeled by a cost that it incurs for abandoning the traversal: if the agent decides to stop traversing the graph, it incurs a cost of , where is a parameter that captures the extent of the bias and is the sum of costs already invested. We analyze the behavior of two types of agents: naive agents that are unaware of their bias, and sophisticated agents that are aware of it. Since optimal (bias-free) behavior in this problem can involve abandoning the traversal before reaching the goal, the bias exhibited by these types of agents can result in sub-optimal behavior by shifting their decisions about abandonment. We show that in contrast to optimal agents, it is computationally hard to compute the optimal policy for a sophisticated agent. Our main results quantify the loss exhibited by these two types of agents with respect to an optimal agent. We present both general and topology-specific bounds.

Model-sharing games: Analyzing federated learning under voluntary participation

Authors

Kate Donahue,Jon Kleinberg

Journal

AAAI 2021

Published Date

2020/10/2

Federated learning is a setting where agents, each with access to their own data source, combine models learned from local data to create a global model. If agents are drawing their data from different distributions, though, federated learning might produce a biased global model that is not optimal for each agent. This means that agents face a fundamental question: should they join the global model or stay with their local model? In this work, we show how this situation can be naturally analyzed through the framework of coalitional game theory. Motivated by these considerations, we propose the following game: there are heterogeneous players with different model parameters governing their data distribution and different amounts of data they have noisily drawn from their own distribution. Each player's goal is to obtain a model with minimal expected mean squared error (MSE) on their own distribution. They have a choice of fitting a model based solely on their own data, or combining their learned parameters with those of some subset of the other players. Combining models reduces the variance component of their error through access to more data, but increases the bias because of the heterogeneity of distributions. In this work, we derive exact expected MSE values for problems in linear regression and mean estimation. We use these values to analyze the resulting game in the framework of hedonic game theory; we study how players might divide into coalitions, where each set of players within a coalition jointly constructs a single model. In a case with arbitrarily many players that each have either a" small" or" large" amount of data, we …

The generalized mean densest subgraph problem

Authors

Nate Veldt,Austin R Benson,Jon Kleinberg

Published Date

2021/8/14

Finding dense subgraphs of a large graph is a standard problem in graph mining that has been studied extensively both for its theoretical richness and its many practical applications. In this paper we introduce a new family of dense subgraph objectives, parameterized by a single parameter p, based on computing generalized means of degree sequences of a subgraph. Our objective captures both the standard densest subgraph problem and the maximum k-core as special cases, and provides a way to interpolate between and extrapolate beyond these two objectives when searching for other notions of dense subgraphs. In terms of algorithmic contributions, we first show that our objective can be minimized in polynomial time for all p ≥ 1 using repeated submodular minimization. A major contribution of our work is analyzing the performance of different types of peeling algorithms for dense subgraphs both in theory …

Hypergraph ego-networks and their temporal evolution

Authors

Cazamere Comrie,Jon Kleinberg

Published Date

2021/12/7

Interactions involving multiple objects simultaneously are ubiquitous across many domains. The systems these interactions inhabit can be modelled using hypergraphs, a generalization of traditional graphs in which each edge can connect any number of nodes. Analyzing the global and static properties of these hypergraphs has led to a plethora of novel findings regarding how these modelled system are structured. However, less is known about the localized structure of these systems and how they evolve over time. In this paper, we propose the study of hypergraph ego-networks, a structure that can be used to model higher-order interactions involving a single node. We also propose the temporal reconstruction of hypergraph ego-networks as a benchmark problem for models that aim to predict the local temporal structure of hypergraphs. By combining a deep learning binary classifier with a hill-climbing algorithm …

Optimal stopping with behaviorally biased agents: The role of loss aversion and changing reference points

Authors

Jon Kleinberg,Robert Kleinberg,Sigal Oren

Published Date

2021/7/18

One of the central human biases studied in behavioral economics is reference dependence - people's tendency to evaluate an outcome not in absolute terms but instead relative to a reference point that reflects some notion of the status quo [4]. Reference dependence interacts closely with a related behavioral bias, loss aversion, in which people weigh losses more strongly than gains of comparable absolute values. Taken together, these two effects produce a fundamental behavioral regularity in human choices: once a reference point has been established, people tend to avoid outcomes in which they experience a loss relative to the reference point. A well-known instance of the effect is the empirical evidence that individual investors will tend to avoid selling a stock unless it has exceeded the price at which they purchased it. In more complex examples, the reference may shift while an agent is making a decision …

Planted hitting set recovery in hypergraphs

Authors

Ilya Amburg,Jon Kleinberg,Austin R Benson

Authors

Rediet Abebe,T-H HUBERT Chan,Jon Kleinberg,Zhibin Liang,David Parkes,Mauro Sozio,Charalampos E Tsourakakis

Journal

ACM Transactions on Knowledge Discovery from Data (TKDD)

Published Date

2021/7/21

A long line of work in social psychology has studied variations in people’s susceptibility to persuasion—the extent to which they are willing to modify their opinions on a topic. This body of literature suggests an interesting perspective on theoretical models of opinion formation by interacting parties in a network: in addition to considering interventions that directly modify people’s intrinsic opinions, it is also natural to consider interventions that modify people’s susceptibility to persuasion. In this work, motivated by this fact, we propose an influence optimization problem. Specifically, we adopt a popular model for social opinion dynamics, where each agent has some fixed innate opinion, and a resistance that measures the importance it places on its innate opinion; agents influence one another’s opinions through an iterative process. Under certain conditions, this iterative process converges to some equilibrium opinion …

Optimality and stability in federated learning: A game-theoretic approach

Authors

Kate Donahue,Jon Kleinberg

Journal

Neurips 2021

Published Date

2021/6/17

Federated learning is a distributed learning paradigm where multiple agents, each only with access to local data, jointly learn a global model. There has recently been an explosion of research aiming not only to improve the accuracy rates of federated learning, but also provide certain guarantees around social good properties such as total error. One branch of this research has taken a game-theoretic approach, and in particular, prior work has viewed federated learning as a hedonic game, where error-minimizing players arrange themselves into federating coalitions. This past work proves the existence of stable coalition partitions, but leaves open a wide range of questions, including how far from optimal these stable solutions are. In this work, we motivate and define a notion of optimality given by the average error rates among federating agents (players). First, we provide and prove the correctness of an efficient algorithm to calculate an optimal (error minimizing) arrangement of players. Next, we analyze the relationship between the stability and optimality of an arrangement. First, we show that for some regions of parameter space, all stable arrangements are optimal (Price of Anarchy equal to 1). However, we show this is not true for all settings: there exist examples of stable arrangements with higher cost than optimal (Price of Anarchy greater than 1). Finally, we give the first constant-factor bound on the performance gap between stability and optimality, proving that the total error of the worst stable solution can be no higher than 9 times the total error of an optimal solution (Price of Anarchy bound of 9).

Roles for computing in social change

Authors

Rediet Abebe,Solon Barocas,Jon Kleinberg,Karen Levy,Manish Raghavan,David G Robinson

Published Date

2020/1/27

A recent normative turn in computer science has brought concerns about fairness, bias, and accountability to the core of the field. Yet recent scholarship has warned that much of this technical work treats problematic features of the status quo as fixed, and fails to address deeper patterns of injustice and inequality. While acknowledging these critiques, we posit that computational research has valuable roles to play in addressing social problems --- roles whose value can be recognized even from a perspective that aspires toward fundamental social change. In this paper, we articulate four such roles, through an analysis that considers the opportunities as well as the significant risks inherent in such work. Computing research can serve as a diagnostic, helping us to understand and measure social problems with precision and clarity. As a formalizer, computing shapes how social problems are explicitly defined …

Minimizing localized ratio cut objectives in hypergraphs

Authors

Nate Veldt,Austin R Benson,Jon Kleinberg

Published Date

2020/8/23

Hypergraphs are a useful abstraction for modeling multiway relationships in data, and hypergraph clustering is the task of detecting groups of closely related nodes in such data.Graph clustering has been studied extensively, and there are numerous methods for detecting small, localized clusters without having to explore an entire input graph. However, there are only a few specialized approaches for localized clustering in hypergraphs. Here we present a framework for local hypergraph clustering based on minimizing localized ratio cut objectives. Our framework takes an input set of reference nodes in a hypergraph and solves a sequence of hypergraph minimum s-t cut problems in order to identify a nearby well-connected cluster of nodes that overlaps substantially with the input set. Our methods extend graph-based techniques but are significantly more general and have new output quality guarantees. First, our …

An economic perspective on algorithmic fairness

Authors

Ashesh Rambachan,Jon Kleinberg,Jens Ludwig,Sendhil Mullainathan

Journal

AEA Papers and Proceedings

Published Date

2020/5/1

There are widespread concerns that the growing use of machine learning algorithms in important decisions may reproduce and reinforce existing discrimination against legally protected groups. Most of the attention to date on issues of “algorithmic bias” or “algorithmic fairness” has come from computer scientists and machine learning researchers. We argue that concerns about algorithmic fairness are at least as much about questions of how discrimination manifests itself in data, decision-making under uncertainty, and optimal regulation. To fully answer these questions, an economic framework is necessary--and as a result, economists have much to contribute.

Fairness and utilization in allocating resources with uncertain demand

Authors

Kate Donahue,Jon Kleinberg

Published Date

2020/1/27

Resource allocation problems are a fundamental domain in which to evaluate the fairness properties of algorithms. The trade-offs between fairness and utilization have a long history in this domain. A recent line of work has considered fairness questions for resource allocation when the demands for the resource are distributed across multiple groups and drawn from probability distributions. In such cases, a natural fairness requirement is that individuals from different groups should have (approximately) equal probabilities of receiving the resource. A largely open question in this area has been to bound the gap between the maximum possible utilization of the resource and the maximum possible utilization subject to this fairness condition. Here, we obtain some of the first provable upper bounds on this gap. We obtain an upper bound for arbitrary distributions, as well as much stronger upper bounds for specific …

Aligning superhuman ai with human behavior: Chess as a model system

Authors

Reid McIlroy-Young,Siddhartha Sen,Jon Kleinberg,Ashton Anderson

Published Date

2020/8/23

As artificial intelligence becomes increasingly intelligent---in some cases, achieving superhuman performance---there is growing potential for humans to learn from and collaborate with algorithms. However, the ways in which AI systems approach problems are often different from the ways people do, and thus may be uninterpretable and hard to learn from. A crucial step in bridging this gap between human and artificial intelligence is modeling the granular actions that constitute human behavior, rather than simply matching aggregate human performance. We pursue this goal in a model system with a long history in artificial intelligence: chess. The aggregate performance of a chess player unfolds as they make decisions over the course of a game. The hundreds of millions of games played online by players at every skill level form a rich source of data in which these decisions, and their exact context, are recorded in …

Frozen binomials on the web: Word ordering and language conventions in online text

Authors

Katherine Van Koevering,Austin R Benson,Jon Kleinberg

Published Date

2020/4/20

There is inherent information captured in the order in which we write words in a list. The orderings of binomials — lists of two words separated by ‘and’ or ‘or’ — has been studied for more than a century. These binomials are common across many areas of speech, in both formal and informal text. In the last century, numerous explanations have been given to describe what order people use for these binomials, from differences in semantics to differences in phonology. These rules describe primarily ‘frozen’ binomials that exist in exactly one ordering and have lacked large-scale trials to determine efficacy. Text in online social media such as Reddit provides a unique opportunity to study these lists in the context of informal text at a very large scale. In this work, we expand the view of binomials to include a large-scale analysis of both frozen and non-frozen binomials in a quantitative way. Using this data, we then …

Algorithmic classification and strategic effort

Authors

Jon Kleinberg,Manish Raghavan

Journal

ACM SIGecom Exchanges

Authors

Cristian DANESCU-NICULESCU-MIZIL,LEE Lillian,PANG Bo,Jon KLEINBERG

Published Date

2020

Le texte qui suit paraîtra inhabituel au lecteur familier de Réseaux. À la fois parce qu’il a été écrit par des chercheurs en informatique et parce qu’il comporte des équations mathématiques qui sont peu communes dans cette revue. Pourtant, nous avons fait le choix de traduire cet article parce qu’il constitue une contribution importante, croyons-nous, à l’enquête en sciences sociales à partir des traces textuelles issues du web. Il est emblématique de l’intérêt croissant que certains chercheurs en informatique–en particulier ceux qui se spécialisent dans la théorie des réseaux–portent à des objets qui sont communément étudiés par les sciences sociales.Cet article porte en effet sur un objet incontestablement sociologique: les relations de pouvoir dans les interactions sociales. Reprenant à leur compte une perspective sociolinguistique, les auteurs veulent démontrer que les participants à une interaction émettent des signaux linguistiques qui expriment la relation de pouvoir qui s’ établit entre eux-mêmes et leurs interlocuteurs. Pour le dire rapidement, un individu qui discute avec un autre individu dont le statut social est supérieur au sien tendra à réutiliser systématiquement certains des termes que son interlocuteur utilise. L’argument défendu par les auteurs, c’est que de tels signaux peuvent être saisis, quel que soit le sujet de la discussion–des discussions entre éditeurs sur Wikipédia ou des échanges entre avocats et juges de la Cour suprême des États-Unis–, et qu’ils peuvent être quantifiés à grande échelle.

Subsidy allocations in the presence of income shocks

Authors

Rediet Abebe,Jon Kleinberg,S Matthew Weinberg

Journal

Proceedings of the AAAI Conference on Artificial Intelligence

Published Date

2020/4/3

Poverty and economic hardship are understood to be highly complex and dynamic phenomena. Due to the multi-faceted nature of welfare, assistance programs targeted at alleviating hardship can face challenges, as they often rely on simpler welfare measurements, such as income or wealth, that fail to capture to full complexity of each family's state. Here, we explore one important dimension–susceptibility to income shocks. We introduce a model of welfare that incorporates income, wealth, and income shocks and analyze this model to show that it can vary, at times substantially, from measures of welfare that only use income or wealth. We then study the algorithmic problem of optimally allocating subsidies in the presence of income shocks. We consider two well-studied objectives: the first aims to minimize the expected number of agents that fall below a given welfare threshold (a min-sum objective) and the second aims to minimize the likelihood that the most vulnerable agent falls below this threshold (a min-max objective). We present optimal and near-optimal algorithms for various general settings. We close with a discussion on future directions on allocating societal resources and ethical implications of related approaches.

Opinion dynamics with varying susceptibility to persuasion via non-convex local search

Authors

Rediet Abebe,Jon Kleinberg,David Parkes,Charalampos E Tsourakakis

Published Date

2018/7/19

A long line of work in social psychology has studied variations in people's susceptibility to persuasion -- the extent to which they are willing to modify their opinions on a topic. This body of literature suggests an interesting perspective on theoretical models of opinion formation on social networks: in addition to considering interventions that directly modify people's intrinsic opinions, it is also natural to consider those that modify people's susceptibility to persuasion. Here, we adopt a popular model for social opinion dynamics, and formalize the opinion maximization and minimization problems where interventions happen at the level of susceptibility. We show that modeling interventions at the level of susceptibility leads to an interesting family of new questions in network opinion dynamics. We find that the questions are quite different depending on whether there is an overall budget constraining the number of agents we …

Adversarial perturbations of opinion dynamics in networks

Authors

Jason Gaitonde,Jon Kleinberg,Eva Tardos

Published Date

2020/7/13

In this paper, we study the connections between network structure, opinion dynamics, and an adversary's power to artificially induce disagreements. We approach these questions by extending models of opinion formation in the mathematical social sciences to represent scenarios, familiar from recent events, in which external actors have sought to destabilize communities through sophisticated information warfare tactics via fake news and bots. In many instances, the intrinsic goals of these efforts are not necessarily to shift the overall sentiment of the network towards a particular policy, but rather to induce discord. These perturbations will diffuse via opinion dynamics on the underlying network, through mechanisms that have been analyzed and abstracted through work in computer science and the social sciences. Here we investigate the properties of such attacks, considering optimal strategies both for the adversary …

Mitigating bias in algorithmic hiring: Evaluating claims and practices

Authors

Manish Raghavan,Solon Barocas,Jon Kleinberg,Karen Levy

Published Date

2020/1/27

There has been rapidly growing interest in the use of algorithms in hiring, especially as a means to address or mitigate bias. Yet, to date, little is known about how these methods are used in practice. How are algorithmic assessments built, validated, and examined for bias? In this work, we document and analyze the claims and practices of companies offering algorithms for employment assessment. In particular, we identify vendors of algorithmic pre-employment assessments (i.e., algorithms to screen candidates), document what they have disclosed about their development and validation procedures, and evaluate their practices, focusing particularly on efforts to detect and mitigate bias. Our analysis considers both technical and legal perspectives. Technically, we consider the various choices vendors make regarding data collection and prediction targets, and explore the risks and trade-offs that these choices pose …

How do classifiers induce agents to invest effort strategically?

Authors

Jon Kleinberg,Manish Raghavan

Journal

ACM Transactions on Economics and Computation (TEAC)

Published Date

2020/10/16

Algorithms are often used to produce decision-making rules that classify or evaluate individuals. When these individuals have incentives to be classified a certain way, they may behave strategically to influence their outcomes. We develop a model for how strategic agents can invest effort in order to change the outcomes they receive, and we give a tight characterization of when such agents can be incentivized to invest specified forms of effort into improving their outcomes as opposed to “gaming” the classifier. We show that whenever any “reasonable” mechanism can do so, a simple linear mechanism suffices.

An economic approach to regulating algorithms

Authors

Ashesh Rambachan,Jon Kleinberg,Sendhil Mullainathan,Jens Ludwig

Published Date

2020/5/11

There is growing concern about" algorithmic bias"-that predictive algorithms used in decisionmaking might bake in or exacerbate discrimination in society. When will these" biases" arise? What should be done about them? We argue that such questions are naturally answered using the tools of welfare economics: a social welfare function for the policymaker, a private objective function for the algorithm designer and a model of their information sets and interaction. We build such a model that allows the training data to exhibit a wide range of" biases." Prevailing wisdom is that biased data change how the algorithm is trained and whether an algorithm should be used at all. In contrast, we find two striking irrelevance results. First, when the social planner builds the algorithm, her equity preference has no effect on the training procedure. So long as the data, however biased, contain signal, they will be used and the algorithm built on top will be the same. Any characteristic that is predictive of the outcome of interest, including group membership, will be used. Second, we study how the social planner regulates private (possibly discriminatory) actors building algorithms. Optimal regulation depends crucially on the disclosure regime. Absent disclosure, algorithms are regulated much like human decision-makers: disparate impact and disparate treatment rules dictate what is allowed. In contrast, under stringent disclosure of all underlying algorithmic inputs (data, training procedure and decision rule), once again we find an irrelevance result: private actors can use any predictive characteristic. Additionally, now algorithms strictly reduce the extent of …

See List of Professors in Jon Kleinberg University(Cornell University)