You can edit almost every page by Creating an account and confirming your email.

Epistemic artificial intelligence

From EverybodyWiki Bios & Wiki







Uncertainty quantification in artificial intelligence

There is growing recognition in artificial intelligence of the need for being able to quantify how certain AI models are when they make a prediction, and estimate or a decision.[1][2].

Limitations of current AI

Typically, AI/machine learning models (for instance, neural networks) are trained to produce an output (prediction) in response to an input data point - however, there is no indication of the likelihood of this output to be correct. This issue is known in machine learning as calibration. This is a serious stumbling block towards the deployment of AI systems in mission-critical situations[3], such as, e.g., clinical data analysis[4], autonomous driving[5], drug discovery[6] or engineering[7].

Recent advances in large language models (LLMs) and image and video generation (e.g., diffusion models), in particular, have exposed fundamental limitations of even cutting-edge AI models[8]. How much are we supposed to believe the response to a query from an LLM?[9] Hallucinations[10], i.e., situations in which language models simply make up answers which are false or inaccurate, is possibly the central weakness of this class of models and or chatbots using them as an engine[11]. Related to this, generative models (including, for instance, diffusion models[12]) are not very creative, in the sense that the variety of their outputs is typically much lower than that of the training set.

More in general, modern machine learning systems often make inaccurate and/or overconfident predictions when fed data which does not fit their original training distribution, including out-of-domain samples[13] or adversarial attacks[14].

Aleatoric vs epistemic uncertainty

Researchers in this field often make a distinction between aleatoric uncertainty, which arises from the intrinsic, irreducible variability of the data, and epistemic uncertainty, induced by limited knowledge of the data-generating process[15]. This distinction reflects a similar taxonomy in field such as economics, for instance, where (epistemic, or Knightian) uncertainty is considered distinct from risk (aleatoric probability)[16][17].

A simple example can help. Consider a lottery in which balls (red or white) are extracted from a bag. If one knows the exact percentage of red and white balls in the bag, the lottery can be described by a simple probability distribution (e.g., Prob(Red) = 0.7, Prob(White) = 0.3). In this case, the uncertainty involved is purely aleatoric. However, if one does not know the distribution of balls inside the bag, the uncertainty is on the true probability distribution driving the lottery itself: this uncertainty is called "epistemic" or "reducible",[18] for it stems from lack of knowledge or data about the process at hand. A classical example of epistemic uncertainty is associated with Ellberg's paradox[19].

The discussion around the difference between aleatoric and epistemic uncertainty in artificial intelligence, or even whether such a distinction makes sense at all, is currently attracting significant interest in the field[20], but also in other disciplines such as structural safety[21], but also management science[22].

Importance of uncertainty quantification

As we have seen, uncertainty quantification is key to several aspects of artificial intelligence: its reliability, creativity, truthfulness, dependability in critical real-world problems[23]. These can be classed into four sub-areas: robustness, adaptation, calibration and decision making.

Adversarial robustness

As mentioned, traditional neural networks (and machine learning models more in general) often suffer from overconfidence, i.e., they are prone to output wrong predictions with very high confidence scores[24], especially when test inputs are outside their training distribution. For instance, a neural network may claim an input image portraits a flower whereas it contains an animal, with a confidence of 99% (this is quite a common occurrence, in fact[25]).

This can be exploited by adversaries to manipulate ML models, in particular by feeding them subtly altered data to cause unintended, incorrect behaviour[26]. These alterations can be very subtle: for instance, it may suffice to add to an image another image representing low-intensity noise of a suitable kind, to throw the network/model entirely off course. Lack of robustness to adversarial attacked can render an AI model all but unusable[27].

Domain adaptation

Possibly the most important open challenge in machine learning is giving models the ability to adapt to test data outside the domain distribution of their training set (domain adaptation)[28], a subject of study closely related to statistical learning theory.

A classical situation is one in which a model (for instance, a computer vision object/person detector) is trained on a corpus of data collected in a given situation (e.g., surveillance videos from a specific bank in a specific city), to then being deployed in an entire different context (e.g., a supermarket in a different city or country). Collecting data (and annotating it, if one is an a supervised learning setting) is expensive and can take a very long time, making it difficult if not impossible to be able to train a model for every possible occurrence.

Whereas a variety of different mathematical techniques have been proposed to tackle the domain adaptation problem[29] (from minimax learning[30] to the use of custom loss functions[31], to adversarial strategies, to kernel methods[32]), models still falter under significant training-testing shifts. Uncertainty quantification can provide a principled way to addressing this issue[33][34].

Calibration

As we pointed out above, neural networks are typically uncalibrated[35], i.e., their predicted confidence rarely matches their observed empirical accuracy. The ideal situation, instead, is one in which the network/model is aware of the uncertainty of its prediction, so that it can transparently admit it does not know enough about it. Among the general public, too many users take AI predictions at face value, without realizing how dangerous this can be[36].

Techniques exist for dealing with calibration problems: for instance, Bayesian binning[37] (assigning predicted scores to a number of "bins"), Platt scaling (a different way of transforming the outputs of a classification model into a probability distribution over classes), temperature scaling, implicit regularization[38], data augmentation can improve this[39]. The most common metric for calibration[40] is the Expected Calibration Error (ECE), which quantifies the discrepancy between predicted probabilities and observed frequencies by dividing the prediction space into several bins based on the predicted probabilities. Other metrics exist, such as the Brier score.

Sequential decision-making

Uncertainty also affects sequential decision making settings (i.e., situations in which a machine needs to make a series of decisions to achieve its long-term goal), for instance reinforcement learning (RL, where the purpose is to maximize the expected future cumulative reward an environment generates in response to the agent's actions). Indeed, especially at the beginning of the process, the environment surrounding the RL agent, the reward the agent receives in response to its actions/decisions, and most importantly the expected cumulative reward of its sequence of decisions are subject to high uncertainty, both epistemic and aleatoric[41].

Generally speaking, epistemic uncertainty is significant when having to make predictions or decisions in the very long terms, for instance for climate change prediction[42] and adaptation[43].

Techniques have been proposed to deal with uncertainty quantification in reinforcement learning[44], for instance based on Bayesian Bellman formulations[45] or ensembles[46]. Still, a proper mathematical formulation of epistemic, second-order uncertainty in RL[47], starting from classical partially-observable Markov diffusion processes (POMDPs), is wanting.

Approaches to uncertainty quantification in AI

In response to the fundamental challenge of how uncertainty about AI models and their predictions should be mathematically modelled, a variety of different approaches have been proposed. The matter has been surveyed in a number of very interesting recent papers. The following classes of methods can be highlighted: traditional models, deep deterministic uncertainty approaches, ensemble methods, Bayesian deep learning, conformal prediction, interval and evidential techniques.

Traditional models, including classical neural networks and deep networks, do not quantify uncertainty at all, but make deterministic predictions assuming exact input-output relations[48].

Deep Deterministic Uncertainty (DDU)[49] and similar approaches[50] attempt to do so without changing the way a network is trained, but by estimating epistemic uncertainty via latent representation analysis or distance-sensitive functions rather than soft-max probabilities.

Bayesian Deep Learning (BDL)[51], instead, models network parameters as probability distributions[52]. Rather than learning a deterministic weight for each network connection, as in traditional models, the training of a Bayesian deep network results in a (joint) probability distribution over the weight. This does two things: boosts the expressive power of the network, and acknowledges the epistemic uncertainty associated with the learning process. BDL requires a number of steps, such as defining a suitable prior for the distribution of the weights[53] (which is often not obvious), designing a likelihood model[54], and generating from the trained posterior a predictive distribution over the output space of the network[55] (often by sampling from an approximated posterior). These design choices introduce arbitrary elements into the process. Also, Bayesian deep networks can be computationally expensive to train[56].

Bayesian models are sensitive to prior mis-specification[57] (which risks biasing the entire estimation process), and incur heavy computational overhead, whereas Bayesian Model Averaging (BMA, i.e., the idea of generating a prediction by sampling from the learnt posterior and then taking the mean)[58] runs the risk of diluting useful predictive information.

Ensemble methods such as Deep Ensembles (DE)[59] and Epistemic Neural Networks (ENN)[60], on the other hand, estimate uncertainty by aggregating predictions from multiple independently trained models/networks (often trained using a different set of hyperparameter values, rather than different slices of the data). Ensemble methods are also computationally demanding, and their theoretical justification remains weak, despite research advances[61].

Conformal prediction[62] is an elegant technique which outputs, for a given prediction problem, prediction sets in the output space, without an attached probability distribution. Thus, the outputs of conformal prediction are sets, rather than probabilities. This happens by computing empirical cumulative distributions of "nonconformity" scores (which measure how unusual or atypical a data point is compared to other examples) and applying a form of hypothesis testing. Unfortunately, conformal prediction[63] primarily captures aleatoric uncertainty, as it relies on applying frequentist hypothesis testing and on the assumption data exchangeability[64] (i.e., predictions need to be invariant to the ordering of the data). Nevertheless, extensions of conformal predictions to epistemic uncertainty[65][66] or which relax the exchangeability assumption[67] have recently been proposed.

Finally, evidential methods[68] predict second-order Dirichlet distribution parameters in the output space of the network, rather than point probabilities. A Dirichlet distribution is a family of probability distributions which, by virtue of its shape, is suitable to model "second-order" probability, and therefore uncertainty on the predicted probability vector. They have been applied to neural networks, decision trees (classifiers which output a class by traversing a suitable tree learned from the data), K-nearest neighbors frameworks[69], and evidential deep learning classifiers[70] in which feature vectors are converted into belief functions and aggregated using Dempster’s rule in a Dempster-Shafer (DS) layer. Evidential approaches intended in the narrow sense (i.e., those predicting Dirichlet second-order distributions), however, appear to violate asymptotic conditions and struggle with out-of-distribution data[71].

The figure below illustrates the major approaches to uncertainty in AI. The diagram on the right illustrates the fact that traditional, ensemble, Bayesian, evidential and epistemic predictions can all be seen as predicting a credal set (i.e., a convex set of probability distributions) in the output space of the model. This opens the way to comparing their performance homogenously as generators of credal sets, and to the concept of epistemic AI.

Major uncertainty quantification approaches in AI.

Epistemic artificial intelligence

A common limitation of most of the above approaches is that epistemic uncertainty is either approximated heuristically or collapsed into point-valued predictions, rather than represented explicitly as uncertainty about the model itself. Indeed, epistemic uncertainty can be defined either as uncertainty about the "true" probability distribution generating the data the model is trained on[72], or as uncertainty in the hypothesis space (the class of models, e.g., the class of convolutional neural networks with a set number of layers and neurons for each layer) chosen[73]. As a result, existing methods struggle to faithfully encode ignorance under data scarcity, distribution shift, and open-world conditions.

Formal definition

Epistemic AI maintains that an AI system must be designed not only to learn from the data it observes, but also to be prepared for data it has not yet encountered, in a Socratic stance ("know that you do not know"). This is possibly the paramount problem in machine learning, where a mapping from inputs to outputs needs to be learn from data that is forcibly scarce[74], or not representative of the entirety of the problem[75].

The problem can be formalized as one of learning a mapping (epistemic model) from input data points to predictions in the form of a second-order uncertainty measure, either on the target space (e.g., in classification, the collection of possible classes) or on the parameter space of the model itself (e.g., for neural networks, the set of connection weights and biases): see Figure below. Later, this prediction may be updated in the light of new data, in a continual learning setting.

When the epistemic prediction is a credal set, as in most cases[76] (as shown in the above Section), a probability (‘pignistic’) estimate can be computed as its center of mass to provide a "precise" probabilistic prediction, which can then be evaluated in the traditional way. It is well known that second-order uncertainty measures subsume classical probabilistic approaches as special cases[77] (in particular, a standard probability distribution is a trivial credal set containing a single point; it is also a special case of finite random sets or belief functions[78]; more general classes of uncertainty measures, such as capacities[79] or lower previsions[80], also exist).

Epistemic AI can therefore be understood as a generalization of existing Bayesian and ensemble methods to second-order uncertainty.

Principle of epistemic learning.

Formally, one can define Epistemic AI as a class of learning algorithms that: (i) employ second-order uncertainty measures (e.g., credal sets, probability intervals, random sets or belief functions), rather than single probability distributions, as a mathematical formalization of the uncertainty of the learning process, and (ii) preserve and propagate ignorance under data scarcity, distribution shift, or open-world conditions, instead of collapsing it into point estimates.

Formally, one can define Epistemic AI as a class of learning algorithms that: (i) employ second-order uncertainty measures (e.g., credal sets, probability intervals[81][82], random sets[83][84] or belief functions), rather than single probability distributions, as a mathematical formalization of the uncertainty of the learning process, and (ii) preserve and propagate ignorance under data scarcity, distribution shift, or open-world conditions, instead of collapsing it into point estimates. As the field is still evolving, the best way of formalizing this concept is still open for discussion.

Limitations of first-order probabilistic representations

Existing models fundamentally struggle to capture epistemic uncertainty. Some authors have argued this is primarily because a single probability distribution cannot fully express ignorance about the data-generating process[77].

Bayesian methods, in particular, though widely used, particularly falter in data-sparse or ambiguous settings because they must assign fixed belief mass even when knowledge is lacking. Uninformative priors such as Jeffreys' (1998), which attempt to assume the least about the prior distribution of the data, for instance by assigning equal probability to all eventualities, are unfortunately not invariant under a reparameterization of the sample space[85] and can be improper[86], violating objectivity and the strong likelihood principle[87]. Moreover, priors must be specified even for systems without past data, leading to arbitrary modeling choices that can bias the learning process for a long time (Bernstein-von Mises theorem). Bayesian posteriors may appear similar whether we have no knowledge or weak evidence, conflating ignorance with imprecise belief and potentially causing misleading overconfidence.

Model selection and prior choice lack objective criteria[88], and prior sensitivity worsens with scarce data[89]. Further, Bayesian models cannot naturally represent set-valued or propositional evidence[90], because the additivity of probability forces allocation to individual outcomes, even when evidence supports sets of hypotheses. Random-sets, in opposition, can naturally model missing data[77] without the need for imputation techniques such as missing at random. For a simple introduction to the topic, the reader can consult. Bayes’ rule also assumes that new evidence is sharp and definitive (although this can be overcome in non-standard frameworks, such as Popper's probability[91]), which is unrealistic in many real-world cases. Hierarchical Bayesian models, which place priors over priors, can model epistemic uncertainty and potentially address some of these issues, but are very computationally expensive in high-dimensional or open-world settings.

Moreover, Bayesian inference tends to smooth out epistemic uncertainty by averaging over models, collapsing diverse possibilities into a single estimate and failing to distinguish knowns from unknowns[92]. Computationally, Bayesian models also often suffer from slow convergence and large inference time[93], limiting their suitability for real-time safety-critical systems.

Target vs parameter representations

Epistemic models have recently demonstrated the ability to outperform competitor methods in areas such as classification, out-of-distribution detection[94], robustness to adversarial attacks, by employing either credal[95] or random-set[96] representations.

Uncertainty, however, can be modelled at two levels: (i) a target level, where the network/model outputs an uncertainty measure on the target space, while its parameters (weights) remain deterministic; (ii) a parameter level, where uncertainty is modelled on the parameter space (i.e., the neural network's weights). Epistemic AI models developed so far have only considered a target-space approach[97].

Results and ongoing work

Numerous methods based on credal set representations have been brought forward, especially in 2024-26.

For instance, Credal-Set Interval Neural Networks[94], based on Interval Neural Networks[98], predict probability intervals for classes (e.g., intervals of the kind P_(c)P(c)P(c)for each class c). As any probability interval determines a credal set, the result model can be thought of as outputting a credal set.

In a slightly different approach, Credal Deep Ensembles[99][100] use ensembles of credal networks (each outputting a separate credal set) to provide upper and lower probability bounds; trained with a distributionally-robust optimization (DRO)-inspired loss[101][102], credal deep ensembles do outperform traditional ones in terms of out-of-distribution detection and general quality of uncertainty quantification. Credal Wrapper[103] also improves uncertainty estimation by "wrapping" both Bayesian and ensemble predictions as credal sets with upper/lower bounds per class, using the intersection probability transformation[104], devised within the framework of the geometric approach to uncertainty, to map a credal set to a single distribution. Credal deep evidential classification[105] leverages a credal set and an interval of evidential predictive distributions to avoid overfitting to the training data and to systematically assess both epistemic (reducible) and aleatoric (irreducible) uncertainties. Credal prediction based on relative likelihood has also been recently proposed in[106]. A framework for credal ensembling in multi-class classification can be found in[107], where probabilistic ensemble members are aggregated into either a representative classifier or a credal classifier, and various decision tasks based on this uncertainty quantification are performed. A weakly supervised framework for credal learning is proposed in[108], where credal learning is studied from the learning-theoretic and complexity-theoretic perspectives. Credal graph neural networks have been formulated[109], where the output layer of a GNN is modified to produce a credal set for classification purposes. A generalization of statistical learning theory under the assumption that a series of training sets are generated each by a different probability distribution within a finitely-generated credal set is the focus of[110], where generalization bounds which generalize classical ones are derived.

Random-set representations have also been advanced. Random-Set Neural Networks[111], in particular, efficiently predict belief values for sets of classes, addressing ambiguity and incomplete data, using a budgeting method levering the unsupervised clustering of the training set to reduce complexity. RS-NN demonstrate superior performance over its competitors, in terms of accuracy, uncertainty estimation and out-of-distribution (OoD) detection on multiple benchmark datasets, including CIFAR-10, MNIST, ImageNet. Extensions of the random-set approach to graph neural networks[112] and large language models[113] have recently been proposed. A possible random-set treatment of statistical learning is explored in[114][115].

A key future research direction is thus extending Epistemic AI from target-level to parameter-level uncertainty representations, with the aim to fully generalize Bayesian (deep) learning, e.g., by transforming Bayesian Neural Network posteriors into random-set posteriors using an epistemic wrapper[116] without retraining via Shafer's likelihood transform[117]. Credal Bayesian Deep Learning[118], in opposition, introduces sets of posteriors over parameters, deriving predictive distributions at inference time that distinguish and quantify aleatoric and epistemic uncertainty, yielding either a set of outputs with guarantees or a single best prediction. Another promising direction is to employ Smets’ Generalized Bayes Theorem[119] to generate belief functions, i.e., finite random-sets, over a model's parameters from a generalized likelihood and observations, under conditional cognitive independence (a generalization of the i.i.d. assumption), to directly learn random-set parametric representations from a training set.

Natural extensions to regression can also be envisaged, as done for instance in CREDO[120], which first builds an interpretable credal envelope that widens when local evidence is weak, then applies split conformal calibration on top of this envelope to guarantee marginal coverage without further assumptions. Two-sample credal tests of epistemic uncertainty have been proposed[121].

Generative AI

The extension to generative AI (GenAI), the part of AI which concerns itself with the generation of new data, is also a paramount open direction of research.

Large Language Models (LLMs) perform very strongly in natural language processing (NLP) tasks such as answering questions, reasoning[122], mathematical problem-solving[123], and code generation[124]. Pre-trained on large text corpora via next-token prediction, LLMs are fine-tuned for specific applications. Despite their success, they face challenges like hallucinations[125]. Mechanisms to enhance their truthfulness (calibration) and quantify uncertainty may improve their reliability. Bayesian approaches such as Laplace-LORA[126], BLoB[127] and Monte-Carlo Dropout[128] have been applied to LLMs for uncertainty quantification. ENN-LLM[129] uses Epistemic Neural Network-inspired ensembles, while other approaches leverage hidden states, softmax entropy[130] or semantic entropy[131]. These methods, however, often trade performance for inference efficiency. The use of knowledge distillation techniques has also been proposed[132]. The epistemic uncertainty of visual tokens for object hallucinations in Large Vision-Language Models is considered in[133].

A good recent survey on uncertainty quantification and confidence calibration in large language models is[134]

An epistemic AI take on generative AI (both in the context of LLMs and beyond) requires us to address the all-important challenge of how to elicit second-order representations from "traditional" ground truth datasets, such as question-answer pairs. How do we teach a model that the examples it sees are only samples from an incredibly rich set of possibilities? Developing appropriate evaluation methods for uncertainty-aware LLMs (building on initial efforts in a classification context[135]) is another challenge that needs to be addressed before such models can be effectively trained and deployed. In the context of GenAI, Epistemic AI can teach generative models the range of possible outputs they could produce from a limited training set, capturing the epistemic uncertainty of the generative process itself[136]. Some results are coming in: credal sets have been shown to be able to expose calibration gaps in language models[137], while attempts to design credal transformers[138] have also been made.

Challenges

A number of open challenges can be identified, which are likely to drive further work in this area.

Complexity of second-order representations

Challenges that are shared across UQ may be exacerbated when using second-order uncertainty measures[139], owing to their higher expressiveness and complexity. Working with sets of distributions (e.g., credal or random-sets) may involve costly sampling and inference procedures[140], particularly for decision-making[141]. Recent work, however, has seemingly addressed this by employing set budgeting techniques to efficiently constrain the complexity of using random-sets[111]. However, further research is needed to expand this to other second-order representations, potentially more general, such as imprecise probability, lower previsions or monotone capacities.

Scaling up

Most evidential approaches[142] struggle with scalability beyond medium-sized datasets. The clustering approach proposed within the random-set approach[96] has unlocked the potential of random-set representations to large datasets such as ImageNet and architectures including vision transformers, with future extensions possibly incorporating Dirichlet mixture models[143] and dynamic clustering for continual learning. A key challenge remains: can epistemic representations scale to foundation model-size and massive datasets? While efficient belief function/random-set representations have been explored[144], further work is needed. Quantum approaches show some promise, with recent work on belief representation[145], combination, and integration into quantum circuits[146]

Continual learning

Continual learning is a more faithful representation of life-long real-world learning processes, especially in contexts in which models are continually updated in the light of streaming data whose distribution, however, may vary over time in unknown ways. Despite recent efforts[147][148], a unified framework linking uncertainty modeling and continual learning remains an entirely open challenge, not just for Epistemic AI but for uncertainty quantification in artificial intelligence more in general.

Statistical guarantees

Most current Epistemic AI methods do not provide statistical coverage guarantees on their predictions[149], albeit they can do so in combination with classical conformal learning. Already mentioned efforts to generalize conformal learning certainly go in this direction. Recent studies have been looking at extending the notion of confidence interval to belief functions, under the name of confidence structures[150], which generalize standard confidence distributions and generate "frequency-calibrated" belief functions. Also in the random-set setting, Inferential Models (IMs) can produce belief functions with well-defined frequentist properties[151]. An alternative approach relies on the notion of predictive belief function[152], which, under repeated sampling, is less committed than the true probability distribution of interest with some prescribed probability.

References

  1. Chakraborti, Tapabrata; Banerji, Christopher R. S.; Marandon, Ariane; Hellon, Vicky; Mitra, Robin; Lehmann, Brieuc; Bräuninger, Leandra; McGough, Sarah; Turkay, Cagatay; Frangi, Alejandro F.; Bianconi, Ginestra; Li, Weizi; Rackham, Owen; Parashar, Deepak; Harbron, Chris (April 2025). "Personalized uncertainty quantification in artificial intelligence". Nature Machine Intelligence. 7 (4): 522–530. doi:10.1038/s42256-025-01024-8. ISSN 2522-5839.
  2. He, Wenchong; Jiang, Zhe; Xiao, Tingsong; Xu, Zelin; Li, Yukun (2026-02-03). "A Survey on Uncertainty Quantification Methods for Deep Learning". ACM Comput. Surv. 58 (7): 179:1–179:35. doi:10.1145/3786319. ISSN 0360-0300.
  3. Thames, Chris; Sun, Yifan (2024-04-23). "A Survey of Artificial Intelligence Approaches to Safety and Mission-Critical Systems". 2024 Integrated Communications, Navigation and Surveillance Conference (ICNS). IEEE. pp. 1–12. doi:10.1109/ICNS60906.2024.10550712. ISBN 979-8-3503-9309-5. Search this book on
  4. Abdar, Moloud; Khosravi, Abbas; Islam, Sheikh Mohammed Shariful; Acharya, U. Rajendra; Vasilakos, Athanasios V. (July 2022). "The need for quantification of uncertainty in artificial intelligence for clinical data analysis: increasing the level of trust in the decision-making process". IEEE Systems, Man, and Cybernetics Magazine. 8 (3): 28–40. doi:10.1109/MSMC.2022.3150144. ISSN 2333-942X.
  5. Michelmore, Rhiannon; Wicker, Matthew; Laurenti, Luca; Cardelli, Luca; Gal, Yarin; Kwiatkowska, Marta (May 2020). "Uncertainty Quantification with Statistical Guarantees in End-to-End Autonomous Driving Control". 2020 IEEE International Conference on Robotics and Automation (ICRA). IEEE. pp. 7344–7350. arXiv:1909.09884. doi:10.1109/ICRA40945.2020.9196844. ISBN 978-1-7281-7395-5. Search this book on
  6. Yu, Jie; Wang, Dingyan; Zheng, Mingyue (August 2022). "Uncertainty quantification: Can we trust artificial intelligence in drug discovery?". iScience. 25 (8): 104814. Bibcode:2022iSci...25j4814Y. doi:10.1016/j.isci.2022.104814. PMC 9391523 Check |pmc= value (help). PMID 35996575 Check |pmid= value (help).
  7. Shi, Yan; Wei, Pengfei; Feng, Ke; Feng, De-Cheng; Beer, Michael (2025-01-30). "A survey on machine learning approaches for uncertainty quantification of engineering systems". Machine Learning for Computational Science and Engineering. 1 (1): 11. doi:10.1007/s44379-024-00011-x. ISSN 3005-1436.
  8. Sah, Chandan Kumar (2025-11-16). "Understanding Uncertainty in LLMS". 2025 40th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE. pp. 4128–4130. doi:10.1109/ASE63991.2025.00392. ISBN 979-8-3503-5733-2. Search this book on
  9. György, András; Kuzborskij, Ilja; Szepesvári, Csaba; Yadkori, Yasin (2024). To Believe or Not to Believe Your LLM: Iterative Prompting for Estimating Epistemic Uncertainty. Neural Information Processing Systems Foundation, Inc. (NeurIPS). pp. 58077–58117. doi:10.52202/079017-1852. ISBN 979-8-3313-1438-5. Search this book on
  10. Perković, Gabrijela; Drobnjak, Antun; Botički, Ivica (2024-05-20). "Hallucinations in LLMS: Understanding and Addressing Challenges". 2024 47th MIPRO ICT and Electronics Convention (MIPRO). IEEE. pp. 2084–2088. doi:10.1109/MIPRO60963.2024.10569238. ISBN 979-8-3503-8250-1. Search this book on
  11. Semnani, Sina; Yao, Violet; Zhang, Heidi; Lam, Monica (2023). "WikiChat: Stopping the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on Wikipedia". Findings of the Association for Computational Linguistics: EMNLP 2023. Association for Computational Linguistics. pp. 2387–2413. doi:10.18653/v1/2023.findings-emnlp.157. Search this book on
  12. Rondini, Silvia; Alvarez-Martin, Claudia; Angermair-Barkai, Paula; Penacchio, Olivier; Paz, Marc; Pelowski, Matthew; Dediu, Dan; Rodriguez-Fornells, Antoni; Cerda-Company, Xim (May 2026). "Stable Diffusion Models Reveal a Persisting Human–AI Gap in Visual Creativity". Advanced Science. 13 (27). doi:10.1002/advs.202524142. ISSN 2198-3844. PMC 13170252 Check |pmc= value (help). PMID 41873462 Check |pmid= value (help). Unknown parameter |article-number= ignored (help)
  13. Zheng, Yinhe; Chen, Guanyi; Huang, Minlie (2020). "Out-of-Domain Detection for Natural Language Understanding in Dialog Systems". IEEE/ACM Transactions on Audio, Speech, and Language Processing. 28: 1198–1209. arXiv:1909.03862. Bibcode:2020ITASL..28.1198Z. doi:10.1109/TASLP.2020.2983593. ISSN 2329-9290.
  14. Finlayson, Samuel G.; Bowers, John D.; Ito, Joichi; Zittrain, Jonathan L.; Beam, Andrew L.; Kohane, Isaac S. (2019-03-22). "Adversarial attacks on medical machine learning". Science. 363 (6433): 1287–1289. Bibcode:2019Sci...363.1287F. doi:10.1126/science.aaw4399. ISSN 0036-8075. PMC 7657648 Check |pmc= value (help). PMID 30898923.
  15. Hüllermeier, Eyke; Waegeman, Willem (2021-03-01). "Aleatoric and epistemic uncertainty in machine learning: an introduction to concepts and methods". Machine Learning. 110 (3): 457–506. doi:10.1007/s10994-021-05946-3. ISSN 1573-0565.
  16. LeRoy, Stephen F.; Singell, Larry D. (1987). "Knight on Risk and Uncertainty". Journal of Political Economy. 95 (2): 394–406. doi:10.1086/261461. JSTOR 1832078.
  17. Pritchard, Duncan (2016). "Epistemic Risk". The Journal of Philosophy. 113 (11): 550–571. doi:10.5840/jphil20161131137. hdl:20.500.11820/534f4b25-5283-4ad5-a697-39ec7082f566. ISSN 0022-362X. JSTOR 48568279.
  18. Klingebiel, Ronald; Zhu, Feibai (2023-12-01). "Ambiguity aversion and the degree of ambiguity". Journal of Risk and Uncertainty. 67 (3): 299–324. doi:10.1007/s11166-023-09410-6. ISSN 1573-0476.
  19. Segal, Uzi (1987). "The Ellsberg Paradox and Risk Aversion: An Anticipated Utility Approach". International Economic Review. 28 (1): 175–202. doi:10.2307/2526866. ISSN 0020-6598. JSTOR 2526866.
  20. Smith, Freddie Bickford; Kossen, Jannik; Trollope, Eleanor; Wilk, Mark van der; Foster, Adam; Rainforth, Tom (2025-08-18), Rethinking Aleatoric and Epistemic Uncertainty, arXiv:2412.20892
  21. Kiureghian, Armen Der; Ditlevsen, Ove (2009-03-01). "Aleatory or epistemic? Does it matter?". Structural Safety. Risk Acceptance and Risk Communication. 31 (2): 105–112. doi:10.1016/j.strusafe.2008.06.020. ISSN 0167-4730.
  22. Walters, Daniel J.; Ülkümen, Gülden; Tannenbaum, David; Erner, Carsten; Fox, Craig R. (May 2023). "Investor Behavior Under Epistemic vs. Aleatory Uncertainty". Management Science. 69 (5): 2761–2777. doi:10.1287/mnsc.2022.4489. ISSN 0025-1909.
  23. Roy, Pritam (2024-05-27). "Enhancing Real-World Robustness in AI: Challenges and Solutions". Journal of Recent Trends in Computer Science and Engineering. 12 (1): 34–49. doi:10.70589/JRTCSE.2024.1.6. ISSN 2322-0872.
  24. Chuan Guo, Geoff Pleiss, Yu Sun, Kilian Q. Weinberger (2017). "On Calibration of Modern Neural Networks". Proceedings of the 34th International Conference on Machine Learning. PMLR 70: 1321–1330.CS1 maint: Multiple names: authors list (link)
  25. Haffar, Rami; Jebreel, Najeeb Moharram; Domingo-Ferrer, Josep; Sánchez, David (2021). "Explaining Image Misclassification in Deep Learning via Adversarial Examples". In Torra, Vicenç; Narukawa, Yasuo. Modeling Decisions for Artificial Intelligence. Lecture Notes in Computer Science. 12898. Cham: Springer International Publishing. pp. 323–334. doi:10.1007/978-3-030-85529-1_26. ISBN 978-3-030-85529-1. Search this book on
  26. Qiu, Shilin; Liu, Qihe; Zhou, Shijie; Wu, Chunjiang (2019-03-04). "Review of Artificial Intelligence Adversarial Attack and Defense Technologies". Applied Sciences. 9 (5): 909. Bibcode:2019ApSci...9..909Q. doi:10.3390/app9050909. ISSN 2076-3417.
  27. Jiang, Wei; Zhang, Tianyuan; Liu, Shuangcheng; Ji, Weiyu; Zhang, Zichao; Xiao, Gang (2023-09-18). "Exploring the Physical-World Adversarial Robustness of Vehicle Detection". Electronics. 12 (18): 3921. doi:10.3390/electronics12183921. ISSN 2079-9292.
  28. Farahani, Abolfazl; Voghoei, Sahar; Rasheed, Khaled; Arabnia, Hamid R. (2021). "A Brief Review of Domain Adaptation". In Stahlbock, Robert; Weiss, Gary M.; Abou-Nasr, Mahmoud; Yang, Cheng-Ying; Arabnia, Hamid R.; Deligiannidis, Leonidas. Advances in Data Science and Information Engineering. Transactions on Computational Science and Computational Intelligence. Cham: Springer International Publishing. pp. 877–894. doi:10.1007/978-3-030-71704-9_65. ISBN 978-3-030-71704-9. Search this book on
  29. Singhal, Peeyush; Walambe, Rahee; Ramanna, Sheela; Kotecha, Ketan (2023). "Domain Adaptation: Challenges, Methods, Datasets, and Applications". IEEE Access. 11: 6973–7020. Bibcode:2023IEEEA..11.6973S. doi:10.1109/ACCESS.2023.3237025. ISSN 2169-3536.
  30. Azar, Mohammad Gheshlaghi; Osband, Ian; Munos, Rémi (2017-07-01), Minimax Regret Bounds for Reinforcement Learning, arXiv:1703.05449
  31. Awais, Muhammad; Zhou, Fengwei; Xu, Hang; Hong, Lanqing; Luo, Ping; Bae, Sung-Ho; Li, Zhenguo (2021-09-02), Adversarial Robustness for Unsupervised Domain Adaptation, arXiv:2109.00946
  32. Deshmukh, Aniket Anand; Lei, Yunwen; Sharma, Srinagesh; Dogan, Urun; Cutler, James W.; Scott, Clayton (2019-05-24). "A Generalization Error Bound for Multi-class Domain Generalization". arXiv:1905.10392 [stat.ML].
  33. Roy, Subhankar; Trapp, Martin; Pilzer, Andrea; Kannala, Juho; Sebe, Nicu; Ricci, Elisa; Solin, Arno (2022). "Uncertainty-Guided Source-Free Domain Adaptation". In Avidan, Shai; Brostow, Gabriel; Cissé, Moustapha; Farinella, Giovanni Maria; Hassner, Tal. Computer Vision – ECCV 2022. Lecture Notes in Computer Science. 13685. Cham: Springer Nature Switzerland. pp. 537–555. doi:10.1007/978-3-031-19806-9_31. ISBN 978-3-031-19806-9. Search this book on
  34. Peng, Xi; Qiao, Fengchun; Zhao, Long (March 2024). "Out-of-Domain Generalization From a Single Source: An Uncertainty Quantification Approach". IEEE Transactions on Pattern Analysis and Machine Intelligence. 46 (3): 1775–1787. Bibcode:2024ITPAM..46.1775P. doi:10.1109/TPAMI.2022.3184598. ISSN 0162-8828. PMID 35724300 Check |pmid= value (help).
  35. Wang, Cheng (2025-09-14), Calibration in Deep Learning: A Survey of the State-of-the-Art, arXiv:2308.01222
  36. Cugurullo, Federico; Xu, Ying (2025-01-04). "When AIs become oracles: generative artificial intelligence, anticipatory urban governance, and the future of cities". Policy and Society. 44 (1): 98–115. doi:10.1093/polsoc/puae025. ISSN 1449-4035.
  37. Dong, Jinzong; Jiang, Zhaohui; Pan, Dong; Yu, Haoyang (2025-04-11). "Combining Priors with Experience: Confidence Calibration Based on Binomial Process Modeling". Proceedings of the AAAI Conference on Artificial Intelligence. 39 (15): 16317–16326. doi:10.1609/aaai.v39i15.33792. ISSN 2374-3468.
  38. Li, Zhibin; Li, Shuai; Bamasag, Omaimah Omar; Alhothali, Areej; Luo, Xin (November 2023). "Diversified Regularization Enhanced Training for Effective Manipulator Calibration". IEEE Transactions on Neural Networks and Learning Systems. 34 (11): 8778–8790. Bibcode:2023ITNNL..34.8778L. doi:10.1109/TNNLS.2022.3153039. ISSN 2162-237X. PMID 35263261 Check |pmid= value (help).
  39. Wang, Cheng (2023-08-02). "Calibration in Deep Learning: A Survey of the State-of-the-Art". arXiv:2308.01222 [cs.LG].
  40. Nixon, Jeremy; Dusenberry, Mike; Jerfel, Ghassen; Nguyen, Timothy; Liu, Jeremiah; Zhang, Linchuan; Tran, Dustin (2020-08-07), Measuring Calibration in Deep Learning, arXiv:1904.01685
  41. Lockwood, Owen; Si, Mei (2022-10-11). "A Review of Uncertainty for Deep Reinforcement Learning". Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment. 18 (1): 155–162. doi:10.1609/aiide.v18i1.21959. ISSN 2334-0924.
  42. Hillerbrand, Rafaela (2009-10-29). "Epistemic Uncertainties in Climate Predictions: A Challenge for Practical Decision Making". Intergenerational Justice Review. 3 (3). doi:10.24357/igjr.3.3.492. ISSN 2190-6335.
  43. Li, Yaohan; Liu, Min; Faber, Michael Havbro (2024). "Epistemic uncertainties in climate adaptation decision ranking". Civil Engineering and Environmental Systems. 41 (3–4): 88–107. Bibcode:2024CEES...41...88L. doi:10.1080/10286608.2025.2489714. Retrieved 2026-05-21.
  44. Lutjens, Bjorn; Everett, Michael; How, Jonathan P. (May 2019). "Safe Reinforcement Learning with Model Uncertainty Estimates". 2019 International Conference on Robotics and Automation (ICRA). IEEE. pp. 8662–8668. arXiv:1810.08700. doi:10.1109/ICRA.2019.8793611. ISBN 978-1-5386-6027-0. Search this book on
  45. Derman, Esther; Mankowitz, Daniel; Mann, Timothy; Mannor, Shie (2019-05-20). "A Bayesian Approach to Robust Reinforcement Learning". arXiv:1905.08188 [cs.LG].
  46. Gaon An, Seungyong Moon, Jang-Hyun Kim, Hyun Oh Song (2021). "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble". Advances in Neural Information Processing Systems. 34.CS1 maint: Multiple names: authors list (link)
  47. Dibya Ghosh, Jad Rahme, Aviral Kumar, Amy Zhang, Ryan P. Adams, Sergey Levine (2021). "Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability". Advances in Neural Information Processing Systems. 34.CS1 maint: Multiple names: authors list (link)
  48. Jansson, Peter A. (1991-03-15). "Neural Networks: An Overview". Analytical Chemistry. 63 (6): 357A–362A. Bibcode:1991AnaCh..63..357J. doi:10.1021/ac00006a739. ISSN 0003-2700.
  49. Mukhoti, Jishnu; Kirsch, Andreas; Amersfoort, Joost van; Torr, Philip H. S.; Gal, Yarin (2022-01-28). "Deep Deterministic Uncertainty: A Simple Baseline". arXiv:2102.11582 [cs.LG].
  50. Joost Van Amersfoort, Lewis Smith, Yee Whye Teh, Yarin Gal (2020). "Uncertainty Estimation Using a Single Deep Deterministic Neural Network". Proceedings of the 37th International Conference on Machine Learning. PMLR 119: 9690–9700.CS1 maint: Multiple names: authors list (link)
  51. Buntine, Wray L.; Weigend, Andreas S. (1991). "Bayesian back-propagation" (PDF). Complex Systems. 5 (6).
  52. Blundell, Charles; Cornebise, Julien; Kavukcuoglu, Koray; Wierstra, Daan (2015-05-21). "Weight Uncertainty in Neural Networks". arXiv:1505.05424 [stat.ML].
  53. Fortuin, Vincent (December 2022). "Priors in Bayesian Deep Learning: A Review". International Statistical Review. 90 (3): 563–591. doi:10.1111/insr.12502. ISSN 0306-7734.
  54. Coccaro, Andrea; Pierini, Maurizio; Silvestrini, Luca; Torre, Riccardo (2020-07-23). "The DNNLikelihood: enhancing likelihood distribution with Deep Learning". The European Physical Journal C. 80 (7): 664. arXiv:1911.03305. Bibcode:2020EPJC...80..664C. doi:10.1140/epjc/s10052-020-8230-1. ISSN 1434-6052.
  55. Pavel Izmailov, Wesley J. Maddox, Polina Kirichenko, Timur Garipov, Dmitry Vetrov, Andrew Gordon Wilson (2020). "Subspace Inference for Bayesian Deep Learning". Proceedings of the 35th Uncertainty in Artificial Intelligence Conference. PMLR 115: 1169–1179.CS1 maint: Multiple names: authors list (link)
  56. Mobiny, Aryan; Yuan, Pengyu; Moulik, Supratik K.; Garg, Naveen; Wu, Carol C.; Van Nguyen, Hien (2021-03-09). "DropConnect is effective in modeling uncertainty of Bayesian deep networks". Scientific Reports. 11 (1): 5458. Bibcode:2021NatSR..11.5458M. doi:10.1038/s41598-021-84854-x. ISSN 2045-2322. PMC 7943811 Check |pmc= value (help). PMID 33750847 Check |pmid= value (help).
  57. Fortuin, Vincent (2022-03-18). "Priors in Bayesian Deep Learning: A Review". arXiv:2105.06868 [stat.ML].
  58. Fragoso, Tiago M.; Neto, Francisco Louzada (April 2018). "Bayesian model averaging: A systematic review and conceptual classification". International Statistical Review. 86 (1): 1–28. arXiv:1509.08864. doi:10.1111/insr.12243.
  59. Lakshminarayanan, Balaji; Pritzel, Alexander; Blundell, Charles (2017-12-04). "Simple and scalable predictive uncertainty estimation using deep ensembles". In Ulrike von Luxburg; Isabelle Guyon; Samy Bengio; Hanna Wallach; Rob Fergus; Roman Garnett; S.V.N. Vishwanathan. 31st Annual Conference on Neural Information Processing Systems 2017 ; Long Beach, California, 4-9 December 2017. NIPS'17. Red Hook, NY, USA: Curran Associates Inc. pp. 6405–6416. ISBN 978-1-5108-6096-4. Search this book on
  60. Osband, Ian; Wen, Zheng; Asghari, Seyed Mohammad; Dwaracherla, Vikranth; Ibrahimi, Morteza; Lu, Xiuyuan; Roy, Benjamin Van (2023-05-17). "Epistemic Neural Networks". arXiv:2107.08924 [cs.LG].
  61. Wood, Danny; Mu, Tingting; Webb, Andrew; Reeve, Henry; Luján, Mikel; Brown, Gavin (2024-02-07), A Unified Theory of Diversity in Ensemble Learning, arXiv:2301.03962
  62. Vovk, V.; Shafer, G.; Nouretdinov, I. (2003). "Self-calibrating Probability Forecasting" (PDF). Advances in Neural Information Processing Systems. 16. pp. 1133–1140. Search this book on
  63. Bates, Stephen; Candès, Emmanuel; Lei, Lihua; Romano, Yaniv; Sesia, Matteo (2023-02-01). "Testing for Outliers with Conformal p-values". The Annals of Statistics. 51 (1). arXiv:2104.08279. doi:10.1214/22-AOS2244.
  64. Vovk, Vladimir (2025-06-08), Randomness, exchangeability, and conformal prediction, arXiv:2501.11689
  65. Chau, Siu Lun; Zargarbashi, Soroush H.; Sale, Yusuf; Caprio, Michele (2026-02-02), Quantifying Epistemic Predictive Uncertainty in Conformal Prediction, arXiv:2602.01667
  66. Javanmardi, Alireza; Zargarbashi, Soroush H.; Thies, Santo M. A. R.; Waegeman, Willem; Bojchevski, Aleksandar; Hüllermeier, Eyke (2025-05-25), Optimal Conformal Prediction under Epistemic Uncertainty, arXiv:2505.19033
  67. Barber, Rina Foygel; Candès, Emmanuel J.; Ramdas, Aaditya; Tibshirani, Ryan J. (2023-04-01). "Conformal prediction beyond exchangeability". The Annals of Statistics. 51 (2). doi:10.1214/23-AOS2276. ISSN 0090-5364.
  68. Xu, L.; Krzyzak, A.; Suen, C.Y. (May–June 1992). "Methods of combining multiple classifiers and their applications to handwriting recognition". IEEE Transactions on Systems, Man, and Cybernetics. 22 (3): 418–435. Bibcode:1992ITSMC..22..418X. doi:10.1109/21.155943.
  69. Denoeux, T. (May 1995). "A k-nearest neighbor classification rule based on Dempster-Shafer theory". IEEE Transactions on Systems, Man, and Cybernetics. 25 (5): 804–813. Bibcode:1995ITSMC..25..804D. doi:10.1109/21.376493.
  70. Tong, Zheng; Xu, Philippe; Denœux, Thierry (2021-08-25). "An evidential classifier based on Dempster-Shafer theory and deep learning". Neurocomputing. 450: 275–293. arXiv:2103.13549. doi:10.1016/j.neucom.2021.03.066. ISSN 0925-2312.
  71. Maohao Shen, J. Jon Ryu, Soumya Ghosh, Yuheng Bu, Prasanna Sattigeri, Subhro Das, Gregory W. Wornell (2024). "Are Uncertainty Quantification Capabilities of Evidential Deep Learning a Mirage?". Advances in Neural Information Processing Systems. 37.CS1 maint: Multiple names: authors list (link)
  72. Nannapaneni, Saideep; Mahadevan, Sankaran (November 2016). "Reliability analysis under epistemic uncertainty". Reliability Engineering & System Safety. 155: 9–20. Bibcode:2016RESS..155....9N. doi:10.1016/j.ress.2016.06.005.
  73. Lahlou, Salem; Jain, Moksh; Nekoei, Hadi; Butoi, Victor Ion; Bertin, Paul; Rector-Brooks, Jarrid; Korablyov, Maksym; Bengio, Yoshua (2023-02-03), DEUP: Direct Epistemic Uncertainty Prediction, arXiv:2102.08501
  74. Nandy, Aditya; Duan, Chenru; Kulik, Heather J (2022-06-01). "Audacity of huge: overcoming challenges of data scarcity and data quality for machine learning in computational materials discovery". Current Opinion in Chemical Engineering. 36. Bibcode:2022COCE...3600778N. doi:10.1016/j.coche.2021.100778. ISSN 2211-3398. OSTI 1976980. Unknown parameter |article-number= ignored (help)
  75. Chasalow, Kyla; Levy, Karen (2021-03-01). "Representativeness in Statistics, Politics, and Machine Learning". Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency. FAccT '21. New York, NY, USA: Association for Computing Machinery. pp. 77–89. doi:10.1145/3442188.3445872. ISBN 978-1-4503-8309-7. Search this book on
  76. Sale, Yusuf; Caprio, Michele; Hüllermeier, Eyke (2023-06-16), Is the Volume of a Credal Set a Good Measure for Epistemic Uncertainty?, arXiv:2306.09586
  77. 77.0 77.1 77.2 Cuzzolin, Fabio (2021). The Geometry of Uncertainty: The Geometry of Imprecise Probabilities. Artificial Intelligence: Foundations, Theory, and Algorithms (1st ed. 2021 ed.). Cham: Springer International Publishing. ISBN 978-3-030-63152-9. Search this book on
  78. Shafer, Glenn (September 1990). "Perspectives on the theory and practice of belief functions". International Journal of Approximate Reasoning. 4 (5–6): 323–362. doi:10.1016/0888-613X(90)90012-Q.
  79. Chateauneuf, Alain (January 1991). "On the use of capacities in modeling uncertainty aversion and risk aversion". Journal of Mathematical Economics. 20 (4): 343–369. doi:10.1016/0304-4068(91)90036-S.
  80. Miranda, Enrique (June 2008). "A survey of the theory of coherent lower previsions". International Journal of Approximate Reasoning. 48 (2): 628–658. doi:10.1016/j.ijar.2007.12.001.
  81. De Campos, Luis M.; Huete, Juan F.; Moral, Serafin (June 1994). "Probability intervals: a tool for uncertain reasoning". International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems. 02 (2): 167–196. doi:10.1142/S0218488594000146. ISSN 0218-4885.
  82. Yager, Ronald R.; Kreinovich, Vladik (December 1999). "Decision making under interval probabilities". International Journal of Approximate Reasoning. 22 (3): 195–215. doi:10.1016/S0888-613X(99)00028-6.
  83. Molchanov, Ilya, ed. (2005), "Random Closed Sets and Capacity Functionals", Theory of Random Sets, Probability and Its Applications, London: Springer, pp. 1–144, doi:10.1007/1-84628-150-4_1, ISBN 978-1-84628-150-1, retrieved 2026-05-21
  84. Nguyen, Hung T. (2006). An introduction to random sets. Taylor and Francis. ISBN 978-1-58488-519-1. Search this book on
  85. Datta, Gauri Sankar; Ghosh, Malay (1996-02-01). "On the invariance of noninformative priors". The Annals of Statistics. 24 (1). doi:10.1214/aos/1033066203. ISSN 0090-5364.
  86. Taraldsen, Gunnar; Tufto, Jarle; Lindqvist, Bo H. (September 2022). "Improper priors and improper posteriors". Scandinavian Journal of Statistics. 49 (3): 969–991. doi:10.1111/sjos.12550. ISSN 0303-6898.
  87. Shafer, Glenn (23/1986). "The combination of evidence". International Journal of Intelligent Systems. 1 (3): 155–179. doi:10.1002/int.4550010302. Check date values in: |date= (help)
  88. Banner, Katharine M.; Irvine, Kathryn M.; Rodhouse, Thomas J. (August 2020). O'Hara, Robert B., ed. "The use of Bayesian priors in Ecology: The good, the bad and the not great". Methods in Ecology and Evolution. 11 (8): 882–889. Bibcode:2020MEcEv..11..882B. doi:10.1111/2041-210X.13407. ISSN 2041-210X.
  89. Kass, Robert E.; Wasserman, Larry (1996). "The Selection of Prior Distributions by Formal Rules". Journal of the American Statistical Association. 91 (435): 1343–1370. Bibcode:1996JASA...91.1343K. doi:10.1080/01621459.1996.10477003. Retrieved 2026-05-05.
  90. Shafer, Glenn (1982). "Belief Functions and Parametric Models". Journal of the Royal Statistical Society. Series B (Methodological). 44 (3): 322–352. ISSN 0035-9246. JSTOR 2345490.
  91. Popper, Karl (1959). "The Propensity Interpretation of Probability" (PDF). The British Journal for the Philosophy of Science. 10 (37): 25–42. doi:10.1093/bjps/X.37.25.
  92. Hinne, M., Gronau, Q., Wagenmakers, E. and van der Bargh, D. (June 2020). "A Conceptual Introduction to Bayesian Model Averaging". Advances in Methods and Practices in Psychological Science. 3 (2): 200–215. doi:10.1177/2515245919898657.CS1 maint: Multiple names: authors list (link)
  93. Chen, Xinghua; Zhang, Ding; Beer, Michael; Aadil, Mohammad (2025-10-01). "Improved efficiency and accuracy in Bayesian structural model updating via DREAMCMAZS sampling and Kriging surrogates". Structures. 80. doi:10.1016/j.istruc.2025.109856. ISSN 2352-0124. Unknown parameter |article-number= ignored (help)
  94. 94.0 94.1 Wang, Kaizheng; Shariatmadar, Keivan; Manchingal, Shireen Kudukkil; Cuzzolin, Fabio; Moens, David; Hallez, Hans (2025-01-25), "CreINNs: Credal-Set Interval Neural Networks for Uncertainty Estimation in Classification Tasks", Neural Networks : The Official Journal of the International Neural Network Society, 185, arXiv:2401.05043, doi:10.1016/j.neunet.2025.107198, PMID 39903958 Check |pmid= value (help) Unknown parameter |article-number= ignored (help)
  95. Kaizheng Wang, Fabio Cuzzolin, Keivan Shariatmadar, David Moens, Hans Hallez (2025). "Credal Wrapper of Model Averaging for Uncertainty Estimation in Classification". International Conference on Learning Representations: 70290–70318.CS1 maint: Multiple names: authors list (link)
  96. 96.0 96.1 Shireen Kudukkil Manchingal, Muhammad Mubashar, Kaizheng Wang, Keivan Shariatmadar, Fabio Cuzzolin (2025). "Random-Set Neural Networks". International Conference on Learning Representations 2025 (ICLR 2025): 14567–14610.CS1 maint: Multiple names: authors list (link)
  97. Wang, Kaizheng; Cuzzolin, Fabio; Moens, David; Hallez, Hans (2026-03-14). "Credal Ensemble Distillation for Uncertainty Quantification". Proceedings of the AAAI Conference on Artificial Intelligence. 40 (31): 26319–26327. doi:10.1609/aaai.v40i31.39837. ISSN 2374-3468.
  98. Oala, Luis; Heiß, Cosmas; Macdonald, Jan; März, Maximilian; Kutyniok, Gitta; Samek, Wojciech (2021-12-01). "Detecting failure modes in image reconstructions with interval neural network uncertainty". International Journal of Computer Assisted Radiology and Surgery. 16 (12): 2089–2097. doi:10.1007/s11548-021-02482-2. ISSN 1861-6429. PMC 8616888 Check |pmc= value (help). PMID 34480723 Check |pmid= value (help).
  99. Wang, Kaizheng; Cuzzolin, Fabio; Moens, David; Hallez, Hans (2025-11-14), Credal Ensemble Distillation for Uncertainty Quantification, arXiv:2511.13766
  100. Kaizheng Wang, Fabio Cuzzolin, Shireen Kudukkil Manchingal, Keivan Shariatmadar, David Moens, Hans Hallez (2024). "Credal Deep Ensembles for Uncertainty Quantification". Advances in Neural Information Processing Systems. 37.CS1 maint: Multiple names: authors list (link)
  101. Kuhn, Daniel; Shafiee, Soroosh; Wiesemann, Wolfram (July 2025). "Distributionally robust optimization". Acta Numerica. 34: 579–804. doi:10.1017/S0962492924000084. ISSN 0962-4929.
  102. Rahimian, Hamed; Mehrotra, Sanjay (2022-07-27). "Frameworks and Results in Distributionally Robust Optimization". Open Journal of Mathematical Optimization. 3: 1–85. doi:10.5802/ojmo.15. ISSN 2777-5860. Unknown parameter |article-number= ignored (help)
  103. Wang, Kaizheng; Cuzzolin, Fabio; Shariatmadar, Keivan; Moens, David; Hallez, Hans (2025-05-09), Credal Wrapper of Model Averaging for Uncertainty Estimation in Classification, arXiv:2405.15047
  104. Cuzzolin, F. (April 2010). "Credal Semantics of Bayesian Transformations in Terms of Probability Intervals". IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics). 40 (2): 421–432. Bibcode:2010ITSMB..40..421C. doi:10.1109/TSMCB.2009.2025968. ISSN 1083-4419. PMID 19846376.
  105. Caprio, Michele; Manchingal, Shireen K.; Cuzzolin, Fabio (2025-12-05), Credal and Interval Deep Evidential Classifications, arXiv:2512.05526
  106. Timo Löhr, Paul Hofman, Felix Mohr, Eyke Hüllermeier (2025). "Credal Prediction based on Relative Likelihood". Advances in Neural Information Processing Systems. 38.CS1 maint: Multiple names: authors list (link)
  107. Nguyen, Vu-Linh; Zhang, Haifei; Destercke, Sébastien (2025-01-16). "Credal ensembling in multi-class classification". Machine Learning. 114 (1): 19. doi:10.1007/s10994-024-06703-y. ISSN 1573-0565.
  108. Campagner, Andrea (2023), "Credal Learning: Weakly Supervised Learning from Credal Sets", ECAI 2023, Frontiers in Artificial Intelligence and Applications, IOS Press, pp. 327–334, doi:10.3233/FAIA230287, ISBN 978-1-64368-436-9, retrieved 2026-05-21
  109. Tolloso, Matteo; Bacciu, Davide (2025-12-02), Credal Graph Neural Networks, arXiv:2512.02722
  110. Michele Caprio, Maryam Sultana, Eleni G. Elia, Fabio Cuzzolin (2024). "Credal Learning Theory". Advances in Neural Information Processing Systems. 37.CS1 maint: Multiple names: authors list (link)
  111. 111.0 111.1 Manchingal, Shireen Kudukkil; Mubashar, Muhammad; Wang, Kaizheng; Shariatmadar, Keivan; Cuzzolin, Fabio (2025-02-14), Random-Set Neural Networks (RS-NN), arXiv:2307.05772
  112. Woodley, Tommy; Manchingal, Shireen Kudukkil; Tolloso, Matteo; Bacciu, Davide; Cuzzolin, Fabio (2026-05-12), Random-Set Graph Neural Networks, arXiv:2605.11987
  113. Mubashar, Muhammad; Manchingal, Shireen Kudukkil; Cuzzolin, Fabio (2025-04-25), Random-Set Large Language Models, arXiv:2504.18085
  114. Cuzzolin, Fabio (2024-02-22). "Generalising realisability in statistical learning theory under epistemic uncertainty". arXiv:2402.14759 [cs.LG].
  115. Benjamin Dupuis, Paul Viallard, George Deligiannidis, Umut Simsekli (2024). "Uniform Generalization Bounds on Data-Dependent Hypothesis Sets via PAC-Bayesian Theory on Random Sets". Journal of Machine Learning Research (JMLR). 25 (409): 1−55.CS1 maint: Multiple names: authors list (link)
  116. Sultana, Maryam; Yorke-Smith, Neil; Wang, Kaizheng; Manchingal, Shireen Kudukkil; Mubashar, Muhammad; Cuzzolin, Fabio (2025-08-18), Epistemic Wrapping for Uncertainty Quantification, arXiv:2505.02277
  117. Shenoy, Prakash P. (September 2023). "Making inferences in incomplete Bayesian networks: A Dempster-Shafer belief function approach". International Journal of Approximate Reasoning. 160. doi:10.1016/j.ijar.2023.108967. Unknown parameter |article-number= ignored (help)
  118. Caprio, Michele; Dutta, Souradeep; Jang, Kuk Jin; Lin, Vivian; Ivanov, Radoslav; Sokolsky, Oleg; Lee, Insup (2024-10-22), Credal Bayesian Deep Learning, arXiv:2302.09656
  119. Smets, Philippe (1993-08-01). "Belief functions: The disjunctive rule of combination and the generalized Bayesian theorem". International Journal of Approximate Reasoning. 9 (1): 1–35. doi:10.1016/0888-613X(93)90005-X. ISSN 0888-613X.
  120. Cabezas, Luben M. C.; Sloman, Sabina J.; Resende, Bruno M.; Wu, Fanyi; Caprio, Michele; Izbicki, Rafael (2026-03-06). "CREDO: Epistemic-Aware Conformalized Credal Envelopes for Regression". arXiv:2603.06826 [stat.ML].
  121. Chau, Siu Lun; Schrab, Antonin; Gretton, Arthur; Sejdinovic, Dino; Muandet, Krikamol (2024-10-16). "Credal Two-Sample Tests of Epistemic Uncertainty". arXiv:2410.12921 [stat.ML].
  122. Wei, Jason; Wang, Xuezhi; Schuurmans, Dale; Bosma, Maarten; Ichter, Brian; Xia, Fei; Chi, Ed; Le, Quoc; Zhou, Denny (2023-01-10), Chain-of-Thought Prompting Elicits Reasoning in Large Language Models, arXiv:2201.11903
  123. Lewkowycz, Aitor; Andreassen, Anders; Dohan, David; Dyer, Ethan; Michalewski, Henryk; Ramasesh, Vinay; Slone, Ambrose; Anil, Cem; Schlag, Imanol (2022-07-01), Solving Quantitative Reasoning Problems with Language Models, arXiv:2206.14858
  124. Rozière, Baptiste; Gehring, Jonas; Gloeckle, Fabian; Sootla, Sten; Gat, Itai; Tan, Xiaoqing Ellen; Adi, Yossi; Liu, Jingyu; Sauvestre, Romain (2024-01-31), Code Llama: Open Foundation Models for Code, arXiv:2308.12950
  125. Maynez, Joshua; Narayan, Shashi; Bohnet, Bernd; McDonald, Ryan (2020-05-02), On Faithfulness and Factuality in Abstractive Summarization, arXiv:2005.00661
  126. Adam Yang, Maxime Robeyns, Xi Wang, Laurence Aitchison (2024). "Bayesian Low-rank Adaptation for Large Language Models". International Conference on Learning Representations: 1812–1842.CS1 maint: Multiple names: authors list (link)
  127. Han, Ligong; Metaxas, Dimitris; Shi, Haizhou; Wang, Hao; Wang, Yibin (2024). BLoB: Bayesian Low-Rank Adaptation by Backpropagation for Large Language Models. Neural Information Processing Systems Foundation, Inc. (NeurIPS). pp. 67758–67794. doi:10.52202/079017-2164. ISBN 979-8-3313-1438-5. Search this book on
  128. Purvi Goel, Li Chen (2021). "On the Robustness of Monte Carlo Dropout Trained With Noisy Labels". Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops: 2219–2228.
  129. Osband, Ian; Asghari, Seyed Mohammad; Roy, Benjamin Van; McAleese, Nat; Aslanides, John; Irving, Geoffrey (2023-05-10), Fine-Tuning Language Models via Epistemic Neural Networks, arXiv:2211.01568
  130. Plaut, Benjamin; Khanh, Nguyen X.; Trinh, Tu (2025-08-07), Probabilities of Chat LLMs Are Miscalibrated but Still Predict Correctness on Multiple-Choice Q&A, arXiv:2402.13213
  131. Farquhar, Sebastian; Kossen, Jannik; Kuhn, Lorenz; Gal, Yarin (June 2024). "Detecting hallucinations in large language models using semantic entropy". Nature. 630 (8017): 625–630. Bibcode:2024Natur.630..625F. doi:10.1038/s41586-024-07421-0. ISSN 1476-4687. PMC 11186750 Check |pmc= value (help). PMID 38898292 Check |pmid= value (help).
  132. Park, Seonghyeon; Yeom, Jewon; Sok, Jaewon; Park, Jeongjae; Kim, Heejun; Kim, Taesup (2026-02-02). "Efficient Epistemic Uncertainty Estimation for Large Language Models via Knowledge Distillation". arXiv:2602.01956 [cs.LG].
  133. Hoigi Seo, Dong Un Kang, Hyunjin Cho, Joohoon Lee, Se Young Chun (2025). "On Epistemic Uncertainty of Visual Tokens for Object Hallucinations in Large Vision-Language Models". Advances in Neural Information Processing Systems. 38.CS1 maint: Multiple names: authors list (link)
  134. Liu, Xiaoou; Chen, Tiejin; Da, Longchao; Chen, Chacha; Lin, Zhen; Wei, Hua (2025). "Uncertainty Quantification and Confidence Calibration in Large Language Models: A Survey". Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.2. pp. 6107–6117. doi:10.1145/3711896.3736569. ISBN 979-8-4007-1454-2. Search this book on
  135. Shireen Kudukkil Manchingal, Muhammad Mubashar, Kaizheng Wang, Fabio Cuzzolin (2025). "A Unified Evaluation Framework for Epistemic Predictions". Proceedings of the 28th International Conference on Artificial Intelligence and Statistics. PMLR 258: 2017–2025.CS1 maint: Multiple names: authors list (link)
  136. Mubashar, Muhammad; Cuzzolin, Fabio (2026-03-18). "Epistemic Generative Adversarial Networks". arXiv:2603.18348 [cs.LG].
  137. Garces Arias, Esteban; Rodemann, Julian; Heumann, Christian (2025). "The Geometry of Creative Variability: How Credal Sets Expose Calibration Gaps in Language Models". Proceedings of the 2nd Workshop on Uncertainty-Aware NLP (UncertaiNLP 2025). Association for Computational Linguistics. pp. 36–50. doi:10.18653/v1/2025.uncertainlp-main.5. Search this book on
  138. Ji, Shihao; Song, Zihui; Huang, Jiajie (2025-10-14). "Credal Transformer: A Principled Approach for Quantifying and Mitigating Hallucinations in Large Language Models". arXiv:2510.12137 [cs.CL].
  139. Sale, Yusuf; Bengs, Viktor; Caprio, Michele; Hüllermeier, Eyke (2023-12-02). "Second-Order Uncertainty Quantification: A Distance-Based Approach". arXiv:2312.00995 [cs.LG].
  140. Wilson, Nic (1991-01-01), "A Monte-Carlo Algorithm for Dempster-Shafer Belief", Uncertainty in Artificial Intelligence, Morgan Kaufmann, pp. 414–417, doi:10.1016/B978-1-55860-203-8.50058-9, ISBN 978-1-55860-203-8, retrieved 2026-05-21
  141. Augustin, Thomas (2022), Aslett, Louis J. M.; Coolen, Frank P. A.; De Bock, Jasper, eds., "Statistics with Imprecise Probabilities—A Short Survey", Uncertainty in Engineering: Introduction to Methods and Applications, SpringerBriefs in Statistics, Cham: Springer International Publishing, pp. 67–79, doi:10.1007/978-3-030-83640-5_5, ISBN 978-3-030-83640-5, retrieved 2026-05-05
  142. Sensoy, Murat; Kaplan, Lance; Kandemir, Melih (2018-10-31), Evidential Deep Learning to Quantify Classification Uncertainty, arXiv:1806.01768
  143. Walker, Stephen G. (2007). "Sampling the Dirichlet Mixture Model with Slices". Communications in Statistics - Simulation and Computation. 36 (1): 45–54. Bibcode:2007CoSSC..36...45W. doi:10.1080/03610910601096262. Retrieved 2026-05-21.
  144. Reineking, Thomas (2014). Belief Functions: Theory and Algorithms. Universität Bremen. Search this book on
  145. Zhou, Qianli; Tian, Guojing; Deng, Yong (2022-10-12), BF-QC: Belief Functions on Quantum Circuits, arXiv:2107.03930
  146. Wu, Keming; Xiao, Fuyuan (2024-01-01). "A novel quantum belief entropy for uncertainty measure in complex evidence theory". Information Sciences. 652. doi:10.1016/j.ins.2023.119744. ISSN 0020-0255. Unknown parameter |article-number= ignored (help)
  147. Zheng, Ervine; Yu, Qi; Li, Rui; Shi, Pengcheng; Haake, Anne (2021-05-18). "A Continual Learning Framework for Uncertainty-Aware Interactive Image Segmentation". Proceedings of the AAAI Conference on Artificial Intelligence. 35 (7): 6030–6038. doi:10.1609/aaai.v35i7.16752. ISSN 2374-3468.
  148. Jha, Saurav; Gong, Dong; Zhao, He; Yao, Lina (2023-10-30), NPCL: Neural Processes for Uncertainty-Aware Continual Learning, arXiv:2310.19272
  149. Zou, Xin; Liu, Weiwei (2024-03-24). "Coverage-Guaranteed Prediction Sets for Out-of-Distribution Data". Proceedings of the AAAI Conference on Artificial Intelligence. 38 (15): 17263–17270. doi:10.1609/aaai.v38i15.29673. ISSN 2374-3468.
  150. Denœux, Thierry; Li, Shoumei (2018-01-01). "Frequency-calibrated belief functions: Review and new insights". International Journal of Approximate Reasoning. 92: 232–254. doi:10.1016/j.ijar.2017.10.013. ISSN 0888-613X.
  151. Martin, Ryan; Liu, Chuanhai (2013). "Inferential Models: A Framework for Prior-Free Posterior Probabilistic Inference". Journal of the American Statistical Association. 108 (501): 301–313. arXiv:1206.4091. doi:10.1080/01621459.2012.747960. Retrieved 2026-05-05.
  152. Denœux, Thierry (2006-08-01). "Constructing belief functions from sample data using multinomial confidence regions". International Journal of Approximate Reasoning. 42 (3): 228–252. doi:10.1016/j.ijar.2006.01.001. ISSN 0888-613X.

Template:CC notice


This article "Epistemic artificial intelligence" is from Wikipedia. The list of its authors can be seen in its historical and/or the page Edithistory:Epistemic artificial intelligence. Articles copied from Draft Namespace on Wikipedia could be seen on the Draft Namespace of Wikipedia and not main one.