TY - JOUR AB - Statistical inference is central to many scientific endeavors, yet how it works remains unresolved. Answering this requires a quantitative understanding of the intrinsic interplay between statistical models, inference methods, and the structure in the data. To this end, we characterize the efficacy of direct coupling analysis (DCA)—a highly successful method for analyzing amino acid sequence data—in inferring pairwise interactions from samples of ferromagnetic Ising models on random graphs. Our approach allows for physically motivated exploration of qualitatively distinct data regimes separated by phase transitions. We show that inference quality depends strongly on the nature of data-generating distributions: optimal accuracy occurs at an intermediate temperature where the detrimental effects from macroscopic order and thermal noise are minimal. Importantly our results indicate that DCA does not always outperform its local-statistics-based predecessors; while DCA excels at low temperatures, it becomes inferior to simple correlation thresholding at virtually all temperatures when data are limited. Our findings offer insights into the regime in which DCA operates so successfully, and more broadly, how inference interacts with the structure in the data. AU - Ngampruetikorn, Vudtiwat AU - Sachdeva, Vedant AU - Torrence, Johanna AU - Humplik, Jan AU - Schwab, David J. AU - Palmer, Stephanie E. ID - 11638 IS - 2 JF - Physical Review Research SN - 2643-1564 TI - Inferring couplings in networks across order-disorder phase transitions VL - 4 ER - TY - JOUR AB - Advances in multi-unit recordings pave the way for statistical modeling of activity patterns in large neural populations. Recent studies have shown that the summed activity of all neurons strongly shapes the population response. A separate recent finding has been that neural populations also exhibit criticality, an anomalously large dynamic range for the probabilities of different population activity patterns. Motivated by these two observations, we introduce a class of probabilistic models which takes into account the prior knowledge that the neural population could be globally coupled and close to critical. These models consist of an energy function which parametrizes interactions between small groups of neurons, and an arbitrary positive, strictly increasing, and twice differentiable function which maps the energy of a population pattern to its probability. We show that: 1) augmenting a pairwise Ising model with a nonlinearity yields an accurate description of the activity of retinal ganglion cells which outperforms previous models based on the summed activity of neurons; 2) prior knowledge that the population is critical translates to prior expectations about the shape of the nonlinearity; 3) the nonlinearity admits an interpretation in terms of a continuous latent variable globally coupling the system whose distribution we can infer from data. Our method is independent of the underlying system’s state space; hence, it can be applied to other systems such as natural scenes or amino acid sequences of proteins which are also known to exhibit criticality. AU - Humplik, Jan AU - Tkacik, Gasper ID - 720 IS - 9 JF - PLoS Computational Biology SN - 1553734X TI - Probabilistic models for neural populations that naturally capture global coupling and criticality VL - 13 ER - TY - JOUR AB - In infectious disease epidemiology the basic reproductive ratio, R0, is defined as the average number of new infections caused by a single infected individual in a fully susceptible population. Many models describing competition for hosts between non-interacting pathogen strains in an infinite population lead to the conclusion that selection favors invasion of new strains if and only if they have higher R0 values than the resident. Here we demonstrate that this picture fails in finite populations. Using a simple stochastic SIS model, we show that in general there is no analogous optimization principle. We find that successive invasions may in some cases lead to strains that infect a smaller fraction of the host population, and that mutually invasible pathogen strains exist. In the limit of weak selection we demonstrate that an optimization principle does exist, although it differs from R0 maximization. For strains with very large R0, we derive an expression for this local fitness function and use it to establish a lower bound for the error caused by neglecting stochastic effects. Furthermore, we apply this weak selection limit to investigate the selection dynamics in the presence of a trade-off between the virulence and the transmission rate of a pathogen. AU - Humplik, Jan AU - Hill, Alison AU - Nowak, Martin ID - 1928 JF - Journal of Theoretical Biology TI - Evolutionary dynamics of infectious diseases in finite populations VL - 360 ER -