[
    {
        "id": "authors:47sqx-33w78",
        "collection": "authors",
        "collection_id": "47sqx-33w78",
        "cite_using_url": "https://authors.library.caltech.edu/records/47sqx-33w78",
        "type": "monograph",
        "title": "A Foundation Model for Cell Segmentation",
        "author": [
            {
                "family_name": "Israel",
                "given_name": "Uriah",
                "clpid": "Israel-Uriah"
            },
            {
                "family_name": "Marks",
                "given_name": "Markus",
                "orcid": "0000-0001-8016-1637",
                "clpid": "Marks-Markus"
            },
            {
                "family_name": "Dilip",
                "given_name": "Rohit",
                "orcid": "0000-0002-1820-0321",
                "clpid": "Dilip-Rohit"
            },
            {
                "family_name": "Li",
                "given_name": "Qilin",
                "orcid": "0000-0002-7191-8965",
                "clpid": "Li-Qilin"
            },
            {
                "family_name": "Yu",
                "given_name": "Changhua",
                "orcid": "0000-0003-4799-4535",
                "clpid": "Yu-Changhua"
            },
            {
                "family_name": "Laubscher",
                "given_name": "Emily",
                "orcid": "0009-0008-0242-0507",
                "clpid": "Laubscher-Emily"
            },
            {
                "family_name": "Li",
                "given_name": "Shenyi",
                "clpid": "Li-Shenyi"
            },
            {
                "family_name": "Schwartz",
                "given_name": "Morgan",
                "orcid": "0000-0001-8131-9125",
                "clpid": "Schwartz-Morgan"
            },
            {
                "family_name": "Pradhan",
                "given_name": "Elora",
                "orcid": "0000-0002-0011-2104",
                "clpid": "Pradhan-Elora"
            },
            {
                "family_name": "Ates",
                "given_name": "Ada",
                "clpid": "Ates-Ada"
            },
            {
                "family_name": "Abt",
                "given_name": "Martin",
                "orcid": "0009-0001-6203-0702",
                "clpid": "Abt-Martin"
            },
            {
                "family_name": "Brown",
                "given_name": "Caitlin",
                "clpid": "Brown-Caitlin"
            },
            {
                "family_name": "Pao",
                "given_name": "Edward",
                "clpid": "Pao-Edward"
            },
            {
                "family_name": "Pearson-Goulart",
                "given_name": "Alexander",
                "clpid": "Pearson-Goulart-Alexander"
            },
            {
                "family_name": "Perona",
                "given_name": "Pietro",
                "orcid": "0000-0002-7583-5809",
                "clpid": "Perona-P"
            },
            {
                "family_name": "Gkioxari",
                "given_name": "Georgia",
                "clpid": "Gkioxari-Georgia"
            },
            {
                "family_name": "Barnowski",
                "given_name": "Ross",
                "orcid": "0009-0004-4184-8566",
                "clpid": "Barnowski-Ross"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Van Valen",
                "given_name": "David",
                "orcid": "0000-0001-7534-7621",
                "clpid": "Van-Valen-D"
            }
        ],
        "abstract": "<p>Cells are a fundamental unit of biological organization, and identifying them in imaging data &ndash; cell segmentation &ndash; is a critical task for various cellular imaging experiments. While deep learning methods have led to substantial progress on this problem, most models in use are specialist models that work well for specific domains. Methods that have learned the general notion of &ldquo;what is a cell&rdquo; and can identify them across different domains of cellular imaging data have proven elusive. In this work, we present CellSAM, a foundation model for cell segmentation that generalizes across diverse cellular imaging data. CellSAM builds on top of the Segment Anything Model (SAM) by developing a prompt engineering approach for mask generation. We train an object detector, CellFinder, to automatically detect cells and prompt SAM to generate segmentations. We show that this approach allows a single model to achieve human-level performance for segmenting images of mammalian cells (in tissues and cell culture), yeast, and bacteria collected across various imaging modalities. We show that CellSAM has strong zero-shot performance and can be improved with a few examples via few-shot learning. We also show that CellSAM can unify bioimaging analysis workflows such as spatial transcriptomics and cell tracking. A deployed version of CellSAM is available at&nbsp;<a href=\"https://cellsam.deepcell.org/\">https://cellsam.deepcell.org/</a>.</p>",
        "doi": "10.1101/2023.11.17.567630",
        "pmcid": "PMC10690226",
        "issn": "2692-8205",
        "publisher": "Cold Spring Harbor Laboratory Press",
        "publication": "bioRvix",
        "publication_date": "2024-03-07",
        "pages": "2023.11.17.567630"
    },
    {
        "id": "authors:stvvz-was45",
        "collection": "authors",
        "collection_id": "stvvz-was45",
        "cite_using_url": "https://authors.library.caltech.edu/records/stvvz-was45",
        "type": "monograph",
        "title": "TOTEM: TOkenized Time Series EMbeddings for General Time Series Analysis",
        "author": [
            {
                "family_name": "Talukder",
                "given_name": "Sabera",
                "clpid": "Talukder-Sabera"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Gkioxari",
                "given_name": "Georgia",
                "clpid": "Gkioxari-Georgia"
            }
        ],
        "abstract": "<p>The field of general time series analysis has recently begun to explore unified modeling, where a common architectural backbone can be retrained on a specific task for a specific dataset. In this work, we approach unification from a complementary vantage point: unification across tasks and domains. To this end, we explore the impact of discrete, learnt, time series data representations that enable generalist, cross-domain training. Our method, TOTEM, or TOkenized Time Series EMbeddings, proposes a simple tokenizer architecture that embeds time series data from varying domains using a discrete vectorized representation learned in a self-supervised manner. TOTEM works across multiple tasks and domains with minimal to no tuning. We study the efficacy of TOTEM with an extensive evaluation on 17 real world time series datasets across 3 tasks. We evaluate both the specialist (i.e., training a model on each domain) and generalist (i.e., training a single model on many domains) settings, and show that TOTEM matches or outperforms previous best methods on several popular benchmarks. The code can be found at: <a href=\"https://github.com/SaberaTalukder/TOTEM\">https://github.com/SaberaTalukder/TOTEM</a>.</p>",
        "doi": "10.48550/arxiv.2402.16412",
        "publisher": "arXiv",
        "publication_date": "2024"
    },
    {
        "id": "authors:60m6h-a7p32",
        "collection": "authors",
        "collection_id": "60m6h-a7p32",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20230316-204049328",
        "type": "monograph",
        "title": "Eventual Discounting Temporal Logic Counterfactual Experience Replay",
        "author": [
            {
                "family_name": "Voloshin",
                "given_name": "Cameron",
                "clpid": "Voloshin-Cameron"
            },
            {
                "family_name": "Verma",
                "given_name": "Abhinav",
                "orcid": "0000-0002-9820-8285",
                "clpid": "Verma-Abhinav"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            }
        ],
        "abstract": "Linear temporal logic (LTL) offers a simplified way of specifying tasks for policy optimization that may otherwise be difficult to describe with scalar reward functions. However, the standard RL framework can be too myopic to find maximally LTL satisfying policies. This paper makes two contributions. First, we develop a new value-function based proxy, using a technique we call eventual discounting, under which one can find policies that satisfy the LTL specification with highest achievable probability. Second, we develop a new experience replay method for generating off-policy data from on-policy rollouts via counterfactual reasoning on different ways of satisfying the LTL specification. Our experiments, conducted in both discrete and continuous state-action spaces, confirm the effectiveness of our counterfactual experience replay approach.",
        "publisher": "arXiv",
        "publication_date": "2023-03-03"
    },
    {
        "id": "authors:cwf24-dcd30",
        "collection": "authors",
        "collection_id": "cwf24-dcd30",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20221219-204745839",
        "type": "monograph",
        "title": "BKinD-3D: Self-Supervised 3D Keypoint Discovery from Multi-View Videos",
        "author": [
            {
                "family_name": "Sun",
                "given_name": "Jennifer J.",
                "orcid": "0000-0002-0906-6589",
                "clpid": "Sun-Jennifer-J"
            },
            {
                "family_name": "Karashchuk",
                "given_name": "Pierre",
                "orcid": "0000-0001-6244-8239",
                "clpid": "Karashchuk-Pierre"
            },
            {
                "family_name": "Dravid",
                "given_name": "Amil",
                "orcid": "0000-0001-6007-0690",
                "clpid": "Dravid-Amil"
            },
            {
                "family_name": "Ryou",
                "given_name": "Serim",
                "orcid": "0000-0003-1344-1158",
                "clpid": "Ryou-Serim"
            },
            {
                "family_name": "Fereidooni",
                "given_name": "Sonia",
                "clpid": "Fereidooni-Sonia"
            },
            {
                "family_name": "Tuthill",
                "given_name": "John C.",
                "orcid": "0000-0002-5689-5806",
                "clpid": "Tuthill-John-C"
            },
            {
                "family_name": "Katsaggelos",
                "given_name": "Aggelos",
                "orcid": "0000-0003-4554-0070",
                "clpid": "Katsaggelos-Aggelos"
            },
            {
                "family_name": "Brunton",
                "given_name": "Bingni W.",
                "orcid": "0000-0002-4831-3466",
                "clpid": "Brunton-Bingni-W"
            },
            {
                "family_name": "Gkioxari",
                "given_name": "Georgia",
                "clpid": "Gkioxari-Georgia"
            },
            {
                "family_name": "Kennedy",
                "given_name": "Ann",
                "orcid": "0000-0002-3782-0518",
                "clpid": "Kennedy-Ann"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Perona",
                "given_name": "Pietro",
                "orcid": "0000-0002-7583-5809",
                "clpid": "Perona-P"
            }
        ],
        "abstract": "Quantifying motion in 3D is important for studying the behavior of humans and other animals, but manual pose annotations are expensive and time-consuming to obtain. Self-supervised keypoint discovery is a promising strategy for estimating 3D poses without annotations. However, current keypoint discovery approaches commonly process single 2D views and do not operate in the 3D space. We propose a new method to perform self-supervised keypoint discovery in 3D from multi-view videos of behaving agents, without any keypoint or bounding box supervision in 2D or 3D. Our method uses an encoder-decoder architecture with a 3D volumetric heatmap, trained to reconstruct spatiotemporal differences across multiple views, in addition to joint length constraints on a learned 3D skeleton of the subject. In this way, we discover keypoints without requiring manual supervision in videos of humans and rats, demonstrating the potential of 3D keypoint discovery for studying behavior.",
        "doi": "10.48550/arXiv.2212.07401",
        "publisher": "arXiv",
        "publication_date": "2022-12-14"
    },
    {
        "id": "authors:0gsec-n8613",
        "collection": "authors",
        "collection_id": "0gsec-n8613",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20221219-234122405",
        "type": "monograph",
        "title": "FI-ODE: Certified and Robust Forward Invariance in Neural ODEs",
        "author": [
            {
                "family_name": "Huang",
                "given_name": "Yujia",
                "orcid": "0000-0001-7667-8342",
                "clpid": "Huang-Yujia"
            },
            {
                "family_name": "Jimenez Rodriguez",
                "given_name": "Ivan Dario",
                "orcid": "0000-0001-9065-5227",
                "clpid": "Jimenez-Rodriguez-Ivan-Dario"
            },
            {
                "family_name": "Zhang",
                "given_name": "Huan",
                "orcid": "0000-0002-1096-4255",
                "clpid": "Zhang-Huan"
            },
            {
                "family_name": "Shi",
                "given_name": "Yuanyuan",
                "orcid": "0000-0002-6182-7664",
                "clpid": "Shi-Yuanyuan"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            }
        ],
        "abstract": "We study how to certifiably enforce forward invariance properties in neural ODEs. Forward invariance implies that the hidden states of the ODE will stay in a \"good\" region, and a robust version would hold even under adversarial perturbations to the input. Such properties can be used to certify desirable behaviors such as adversarial robustness (the hidden states stay in the region that generates accurate classification even under input perturbations) and safety in continuous control (the system never leaves some safe set). We develop a general approach using tools from non-linear control theory and sampling-based verification. Our approach empirically produces the strongest adversarial robustness guarantees compared to prior work on certifiably robust ODE-based models (including implicit-depth models).",
        "doi": "10.48550/arXiv.2210.16940",
        "publisher": "arXiv",
        "publication_date": "2022-10-30"
    },
    {
        "id": "authors:63nae-74q84",
        "collection": "authors",
        "collection_id": "63nae-74q84",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20221219-234119032",
        "type": "monograph",
        "title": "Neurosymbolic Programming for Science",
        "author": [
            {
                "family_name": "Sun",
                "given_name": "Jennifer J.",
                "orcid": "0000-0002-0906-6589",
                "clpid": "Sun-Jennifer-J"
            },
            {
                "family_name": "Tjandrasuwita",
                "given_name": "Megan",
                "clpid": "Tjandrasuwita-Megan"
            },
            {
                "family_name": "Sehgal",
                "given_name": "Atharva",
                "clpid": "Sehgal-Atharva"
            },
            {
                "family_name": "Solar-Lezama",
                "given_name": "Armando",
                "orcid": "0000-0001-7604-8252",
                "clpid": "Solar-Lezama-Armando"
            },
            {
                "family_name": "Chaudhuri",
                "given_name": "Swarat",
                "orcid": "0000-0002-6859-1391",
                "clpid": "Chaudhuri-Swarat"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Costilla-Reyes",
                "given_name": "Omar",
                "orcid": "0000-0001-8331-7262",
                "clpid": "Costilla-Reyes-Omar"
            }
        ],
        "abstract": "Neurosymbolic Programming (NP) techniques have the potential to accelerate scientific discovery. These models combine neural and symbolic components to learn complex patterns and representations from data, using high-level concepts or known constraints. NP techniques can interface with symbolic domain knowledge from scientists, such as prior knowledge and experimental context, to produce interpretable outputs. We identify opportunities and challenges between current NP models and scientific workflows, with real-world examples from behavior analysis in science: to enable the use of NP broadly for workflows across the natural and social sciences.",
        "doi": "10.48550/arXiv.2210.05050",
        "publisher": "arXiv",
        "publication_date": "2022-10-10"
    },
    {
        "id": "authors:z042w-bx383",
        "collection": "authors",
        "collection_id": "z042w-bx383",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20221219-234115665",
        "type": "monograph",
        "title": "POLAR: Preference Optimization and Learning Algorithms for Robotics",
        "author": [
            {
                "family_name": "Tucker",
                "given_name": "Maegan",
                "orcid": "0000-0001-7363-6809",
                "clpid": "Tucker-Maegan"
            },
            {
                "family_name": "Li",
                "given_name": "Kejun",
                "orcid": "0000-0002-0823-9839",
                "clpid": "Li-Kejun"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Ames",
                "given_name": "Aaron D.",
                "orcid": "0000-0003-0848-3177",
                "clpid": "Ames-A-D"
            }
        ],
        "abstract": "Parameter tuning for robotic systems is a time-consuming and challenging task that often relies on domain expertise of the human operator. Moreover, existing learning methods are not well suited for parameter tuning for many reasons including: the absence of a clear numerical metric for `good robotic behavior'; limited data due to the reliance on real-world experimental data; and the large search space of parameter combinations. In this work, we present an open-source MATLAB Preference Optimization and Learning Algorithms for Robotics toolbox (POLAR) for systematically exploring high-dimensional parameter spaces using human-in-the-loop preference-based learning. This aim of this toolbox is to systematically and efficiently accomplish one of two objectives: 1) to optimize robotic behaviors for human operator preference; 2) to learn the operator's underlying preference landscape to better understand the relationship between adjustable parameters and operator preference. The POLAR toolbox achieves these objectives using only subjective feedback mechanisms (pairwise preferences, coactive feedback, and ordinal labels) to infer a Bayesian posterior over the underlying reward function dictating the user's preferences. We demonstrate the performance of the toolbox in simulation and present various applications of human-in-the-loop preference-based learning.",
        "doi": "10.48550/arXiv.2208.04404",
        "publisher": "arXiv",
        "publication_date": "2022-08-08"
    },
    {
        "id": "authors:ya1d9-y2y64",
        "collection": "authors",
        "collection_id": "ya1d9-y2y64",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20221219-234042044",
        "type": "monograph",
        "title": "The MABe22 Benchmarks for Representation Learning of Multi-Agent Behavior",
        "author": [
            {
                "family_name": "Sun",
                "given_name": "Jennifer J.",
                "orcid": "0000-0002-0906-6589",
                "clpid": "Sun-Jennifer-J"
            },
            {
                "family_name": "Ulmer",
                "given_name": "Andrew",
                "clpid": "Ulmer-Andrew"
            },
            {
                "family_name": "Chakraborty",
                "given_name": "Dipam",
                "clpid": "Chakraborty-Dipam"
            },
            {
                "family_name": "Geuther",
                "given_name": "Brian",
                "orcid": "0000-0002-7822-486X",
                "clpid": "Geuther-Brian"
            },
            {
                "family_name": "Hayes",
                "given_name": "Edward",
                "clpid": "Hayes-Edward"
            },
            {
                "family_name": "Jia",
                "given_name": "Heng",
                "clpid": "Jia-Heng"
            },
            {
                "family_name": "Kumar",
                "given_name": "Vivek",
                "clpid": "Kumar-Vivek"
            },
            {
                "family_name": "Partridge",
                "given_name": "Zachary",
                "clpid": "Partridge-Zachary"
            },
            {
                "family_name": "Robie",
                "given_name": "Alice",
                "orcid": "0000-0002-0784-2927",
                "clpid": "Robie-Alice-A"
            },
            {
                "family_name": "Schretter",
                "given_name": "Catherine",
                "orcid": "0000-0002-3957-6838",
                "clpid": "Schretter-Catherine-E"
            },
            {
                "family_name": "Sun",
                "given_name": "Chao",
                "clpid": "Sun-Chao"
            },
            {
                "family_name": "Sheppard",
                "given_name": "Keith",
                "orcid": "0000-0003-0842-9365",
                "clpid": "Sheppard-Keith"
            },
            {
                "family_name": "Uttarwar",
                "given_name": "Param",
                "clpid": "Uttarwar-Param"
            },
            {
                "family_name": "Perona",
                "given_name": "Pietro",
                "orcid": "0000-0002-7583-5809",
                "clpid": "Perona-P"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Branson",
                "given_name": "Kristin",
                "orcid": "0000-0002-5567-2512",
                "clpid": "Branson-Kristin"
            },
            {
                "family_name": "Kennedy",
                "given_name": "Ann",
                "orcid": "0000-0002-3782-0518",
                "clpid": "Kennedy-Ann"
            }
        ],
        "abstract": "Real-world behavior is often shaped by complex interactions between multiple agents. To scalably study multi-agent behavior, advances in unsupervised and self-supervised learning have enabled a variety of different behavioral representations to be learned from trajectory data. To date, there does not exist a unified set of benchmarks that can enable comparing methods quantitatively and systematically across a broad set of behavior analysis settings. We aim to address this by introducing a large-scale, multi-agent trajectory dataset from real-world behavioral neuroscience experiments that covers a range of behavior analysis tasks. Our dataset consists of trajectory data from common model organisms, with 9.6 million frames of mouse data and 4.4 million frames of fly data, in a variety of experimental settings, such as different strains, lengths of interaction, and optogenetic stimulation. A subset of the frames also consist of expert-annotated behavior labels. Improvements on our dataset corresponds to behavioral representations that work across multiple organisms and is able to capture differences for common behavior analysis tasks.",
        "doi": "10.48550/arXiv.2207.10553",
        "publisher": "arXiv",
        "publication_date": "2022-07-21"
    },
    {
        "id": "authors:m3zrt-p6h47",
        "collection": "authors",
        "collection_id": "m3zrt-p6h47",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20220714-212414777",
        "type": "monograph",
        "title": "Compactly Restrictable Metric Policy Optimization Problems",
        "author": [
            {
                "family_name": "Dorobantu",
                "given_name": "Victor D.",
                "orcid": "0000-0002-2797-7802",
                "clpid": "Dorobantu-Victor-D"
            },
            {
                "family_name": "Azizzadenesheli",
                "given_name": "Kamyar",
                "orcid": "0000-0001-8507-1868",
                "clpid": "Azizzadenesheli-Kamyar"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            }
        ],
        "abstract": "We study policy optimization problems for deterministic Markov decision processes (MDPs) with metric state and action spaces, which we refer to as Metric Policy Optimization Problems (MPOPs). Our goal is to establish theoretical results on the well-posedness of MPOPs that can characterize practically relevant continuous control systems. To do so, we define a special class of MPOPs called Compactly Restrictable MPOPs (CR-MPOPs), which are flexible enough to capture the complex behavior of robotic systems but specific enough to admit solutions using dynamic programming methods such as value iteration. We show how to arrive at CR-MPOPs using forward-invariance. We further show that our theoretical results on CR-MPOPs can be used to characterize feedback linearizable control affine systems.",
        "doi": "10.48550/arXiv.arXiv.2207.05850",
        "publisher": "arXiv",
        "publication_date": "2022-07-12"
    },
    {
        "id": "authors:5d294-ehk32",
        "collection": "authors",
        "collection_id": "5d294-ehk32",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20220714-212419626",
        "type": "monograph",
        "title": "Policy Optimization with Linear Temporal Logic Constraints",
        "author": [
            {
                "family_name": "Voloshin",
                "given_name": "Cameron",
                "clpid": "Voloshin-Cameron"
            },
            {
                "family_name": "Le",
                "given_name": "Hoang M.",
                "clpid": "Le-Hoang-M"
            },
            {
                "family_name": "Chaudhuri",
                "given_name": "Swarat",
                "clpid": "Chaudhuri-Swarat"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            }
        ],
        "abstract": "We study the problem of policy optimization (PO) with linear temporal logic (LTL) constraints. The language of LTL allows flexible description of tasks that may be unnatural to encode as a scalar cost function. We consider LTL-constrained PO as a systematic framework, decoupling task specification from policy selection, and an alternative to the standard of cost shaping. With access to a generative model, we develop a model-based approach that enjoys a sample complexity analysis for guaranteeing both task satisfaction and cost optimality (through a reduction to a reachability problem). Empirically, our algorithm can achieve strong performance even in low sample regimes.",
        "doi": "10.48550/arXiv.arXiv.2206.09546",
        "publisher": "arXiv",
        "publication_date": "2022-06-20"
    },
    {
        "id": "authors:bme38-gm639",
        "collection": "authors",
        "collection_id": "bme38-gm639",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20220714-212423144",
        "type": "monograph",
        "title": "Deep Neural Imputation: A Framework for Recovering Incomplete Brain Recordings",
        "author": [
            {
                "family_name": "Talukder",
                "given_name": "Sabera",
                "clpid": "Talukder-Sabera"
            },
            {
                "family_name": "Sun",
                "given_name": "Jennifer J.",
                "orcid": "0000-0002-0906-6589",
                "clpid": "Sun-Jennifer-J"
            },
            {
                "family_name": "Leonard",
                "given_name": "Matthew",
                "clpid": "Leonard-Matthew-K"
            },
            {
                "family_name": "Brunton",
                "given_name": "Bingni W.",
                "clpid": "Brunton-Bingni-W"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            }
        ],
        "abstract": "Neuroscientists and neuroengineers have long relied on multielectrode neural recordings to study the brain. However, in a typical experiment, many factors corrupt neural recordings from individual electrodes, including electrical noise, movement artifacts, and faulty manufacturing. Currently, common practice is to discard these corrupted recordings, reducing already limited data that is difficult to collect. To address this challenge, we propose Deep Neural Imputation (DNI), a framework to recover missing values from electrodes by learning from data collected across spatial locations, days, and participants. We explore our framework with a linear nearest-neighbor approach and two deep generative autoencoders, demonstrating DNI's flexibility. One deep autoencoder models participants individually, while the other extends this architecture to model many participants jointly. We evaluate our models across 12 human participants implanted with multielectrode intracranial electrocorticography arrays; participants had no explicit task and behaved naturally across hundreds of recording hours. We show that DNI recovers not only time series but also frequency content, and further establish DNI's practical value by recovering significant performance on a scientifically-relevant downstream neural decoding task.",
        "doi": "10.48550/arXiv.arXiv.2206.08094",
        "publisher": "arXiv",
        "publication_date": "2022-06-16"
    },
    {
        "id": "authors:v0rk2-qym53",
        "collection": "authors",
        "collection_id": "v0rk2-qym53",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20220325-224027516",
        "type": "monograph",
        "title": "Safety of Sampled-Data Systems with Control Barrier Functions via Approximate Discrete Time Models",
        "author": [
            {
                "family_name": "Taylor",
                "given_name": "Andrew J.",
                "orcid": "0000-0002-5990-590X",
                "clpid": "Taylor-Andrew-J"
            },
            {
                "family_name": "Dorobantu",
                "given_name": "Victor D.",
                "orcid": "0000-0002-2797-7802",
                "clpid": "Dorobantu-Victor-D"
            },
            {
                "family_name": "Cosner",
                "given_name": "Ryan K.",
                "orcid": "0000-0002-4035-1425",
                "clpid": "Cosner-Ryan-K"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Ames",
                "given_name": "Aaron D.",
                "orcid": "0000-0003-0848-3177",
                "clpid": "Ames-A-D"
            }
        ],
        "abstract": "Control Barrier Functions (CBFs) have been demonstrated to be a powerful tool for safety-critical controller design for nonlinear systems. Existing design paradigms do not address the gap between theory (controller design with continuous time models) and practice (the discrete time sampled implementation of the resulting controllers); this can lead to poor performance and violations of safety for hardware instantiations. We propose an approach to close this gap by synthesizing sampled-data counterparts to these CBF-based controllers using approximate discrete time models and Sampled-Data Control Barrier Functions (SD-CBFs). Using properties of a system's continuous time model, we establish a relationship between SD-CBFs and a notion of practical safety for sampled-data systems. Furthermore, we construct convex optimization-based controllers that formally endow nonlinear systems with safety guarantees in practice. We demonstrate the efficacy of these controllers in simulation.",
        "doi": "10.48550/arXiv.2203.11470",
        "publisher": "arXiv",
        "publication_date": "2022-03-22"
    },
    {
        "id": "authors:atx9r-ja815",
        "collection": "authors",
        "collection_id": "atx9r-ja815",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20220325-220806703",
        "type": "monograph",
        "title": "Self-Supervised Online Learning for Safety-Critical Control using Stereo Vision",
        "author": [
            {
                "family_name": "Cosner",
                "given_name": "Ryan K.",
                "orcid": "0000-0002-4035-1425",
                "clpid": "Cosner-Ryan-K"
            },
            {
                "family_name": "Jimenez Rodriguez",
                "given_name": "Ivan D.",
                "clpid": "Jimenez-Rodriguez-Ivan-D"
            },
            {
                "family_name": "Molnar",
                "given_name": "Tamas G.",
                "orcid": "0000-0002-9379-7121",
                "clpid": "Moln\u00e1r-Tam\u00e1s-G"
            },
            {
                "family_name": "Ubellacker",
                "given_name": "Wyatt",
                "orcid": "0000-0002-4732-6185",
                "clpid": "Ubellacker-Wyatt-L"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Ames",
                "given_name": "Aaron D.",
                "orcid": "0000-0003-0848-3177",
                "clpid": "Ames-A-D"
            },
            {
                "family_name": "Bouman",
                "given_name": "Katherine L.",
                "orcid": "0000-0003-0077-4367",
                "clpid": "Bouman-K-L"
            }
        ],
        "abstract": "With the increasing prevalence of complex vision-based sensing methods for use in obstacle identification and state estimation, characterizing environment-dependent measurement errors has become a difficult and essential part of modern robotics. This paper presents a self-supervised learning approach to safety-critical control. In particular, the uncertainty associated with stereo vision is estimated, and adapted online to new visual environments, wherein this estimate is leveraged in a safety-critical controller in a robust fashion. To this end, we propose an algorithm that exploits the structure of stereo-vision to learn an uncertainty estimate without the need for ground-truth data. We then robustify existing Control Barrier Function-based controllers to provide safety in the presence of this uncertainty estimate. We demonstrate the efficacy of our method on a quadrupedal robot in a variety of environments. When not using our method safety is violated. With offline training alone we observe the robot is safe, but overly-conservative. With our online method the quadruped remains safe and conservatism is reduced.",
        "doi": "10.48550/arXiv.2203.01404",
        "publisher": "arXiv",
        "publication_date": "2022-03-02"
    },
    {
        "id": "authors:ke5cf-9cj15",
        "collection": "authors",
        "collection_id": "ke5cf-9cj15",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20220224-200943137",
        "type": "monograph",
        "title": "LyaNet: A Lyapunov Framework for Training Neural ODEs",
        "author": [
            {
                "family_name": "Jimenez Rodriguez",
                "given_name": "Ivan Dario",
                "orcid": "0000-0001-9065-5227",
                "clpid": "Jimenez-Rodriguez-Ivan-Dario"
            },
            {
                "family_name": "Ames",
                "given_name": "Aaron D.",
                "orcid": "0000-0003-0848-3177",
                "clpid": "Ames-A-D"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            }
        ],
        "abstract": "We propose a method for training ordinary differential equations by using a control-theoretic Lyapunov condition for stability. Our approach, called LyaNet, is based on a novel Lyapunov loss formulation that encourages the inference dynamics to converge quickly to the correct prediction. Theoretically, we show that minimizing Lyapunov loss guarantees exponential convergence to the correct solution and enables a novel robustness guarantee. We also provide practical algorithms, including one that avoids the cost of backpropagating through a solver or using the adjoint method. Relative to standard Neural ODE training, we empirically find that LyaNet can offer improved prediction performance, faster convergence of inference dynamics, and improved adversarial robustness. Our code available at https://github.com/ivandariojr/LyapunovLearning.",
        "doi": "10.48550/arXiv.2202.02526",
        "publisher": "arXiv",
        "publication_date": "2022-02-05"
    },
    {
        "id": "authors:2j5rs-cvh78",
        "collection": "authors",
        "collection_id": "2j5rs-cvh78",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20220224-200843937",
        "type": "monograph",
        "title": "Safety-Aware Preference-Based Learning for Safety-Critical Control",
        "author": [
            {
                "family_name": "Cosner",
                "given_name": "Ryan K.",
                "orcid": "0000-0002-4035-1425",
                "clpid": "Cosner-Ryan-K"
            },
            {
                "family_name": "Tucker",
                "given_name": "Maegan",
                "orcid": "0000-0001-7363-6809",
                "clpid": "Tucker-Maegan"
            },
            {
                "family_name": "Taylor",
                "given_name": "Andrew J.",
                "orcid": "0000-0002-5990-590X",
                "clpid": "Taylor-Andrew-J"
            },
            {
                "family_name": "Li",
                "given_name": "Kejun",
                "clpid": "Li-Kejun"
            },
            {
                "family_name": "Moln\u00e1r",
                "given_name": "Tam\u00e1s G.",
                "orcid": "0000-0002-9379-7121",
                "clpid": "Moln\u00e1r-Tam\u00e1s-G"
            },
            {
                "family_name": "Ubellacker",
                "given_name": "Wyatt",
                "orcid": "0000-0002-4732-6185",
                "clpid": "Ubellacker-Wyatt-L"
            },
            {
                "family_name": "Alan",
                "given_name": "Anil",
                "orcid": "0000-0002-9778-8249",
                "clpid": "Alan-Anil"
            },
            {
                "family_name": "Orosz",
                "given_name": "G\u00e1bor",
                "orcid": "0000-0002-9000-3736",
                "clpid": "Orosz-G\u00e1bor"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Ames",
                "given_name": "Aaron D.",
                "orcid": "0000-0003-0848-3177",
                "clpid": "Ames-A-D"
            }
        ],
        "abstract": "Bringing dynamic robots into the wild requires a tenuous balance between performance and safety. Yet controllers designed to provide robust safety guarantees often result in conservative behavior, and tuning these controllers to find the ideal trade-off between performance and safety typically requires domain expertise or a carefully constructed reward function. This work presents a design paradigm for systematically achieving behaviors that balance performance and robust safety by integrating safety-aware Preference-Based Learning (PBL) with Control Barrier Functions (CBFs). Fusing these concepts -- safety-aware learning and safety-critical control -- gives a robust means to achieve safe behaviors on complex robotic systems in practice. We demonstrate the capability of this design paradigm to achieve safe and performant perception-based autonomous operation of a quadrupedal robot both in simulation and experimentally on hardware.",
        "doi": "10.48550/arXiv.2112.08516",
        "publisher": "arXiv",
        "publication_date": "2021-12-15"
    },
    {
        "id": "authors:j33yw-5g106",
        "collection": "authors",
        "collection_id": "j33yw-5g106",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20220224-200833645",
        "type": "monograph",
        "title": "Self-Supervised Keypoint Discovery in Behavioral Videos",
        "author": [
            {
                "family_name": "Sun",
                "given_name": "Jennifer J.",
                "orcid": "0000-0002-0906-6589",
                "clpid": "Sun-Jennifer-J"
            },
            {
                "family_name": "Ryou",
                "given_name": "Serim",
                "clpid": "Ryou-Serim"
            },
            {
                "family_name": "Goldshmid",
                "given_name": "Roni",
                "orcid": "0000-0001-9095-3259",
                "clpid": "Goldshmid-Roni-H"
            },
            {
                "family_name": "Weissbourd",
                "given_name": "Brandon",
                "orcid": "0000-0001-5422-3873",
                "clpid": "Weissbourd-Brandon"
            },
            {
                "family_name": "Dabiri",
                "given_name": "John",
                "orcid": "0000-0002-6722-9008",
                "clpid": "Dabiri-J-O"
            },
            {
                "family_name": "Anderson",
                "given_name": "David J.",
                "orcid": "0000-0001-6175-3872",
                "clpid": "Anderson-D-J"
            },
            {
                "family_name": "Kennedy",
                "given_name": "Ann",
                "orcid": "0000-0002-3782-0518",
                "clpid": "Kennedy-Ann"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Perona",
                "given_name": "Pietro",
                "orcid": "0000-0002-7583-5809",
                "clpid": "Perona-P"
            }
        ],
        "abstract": "We propose a method for learning the posture and structure of agents from unlabelled behavioral videos. Starting from the observation that behaving agents are generally the main sources of movement in behavioral videos, our method uses an encoder-decoder architecture with a geometric bottleneck to reconstruct the difference between video frames. By focusing only on regions of movement, our approach works directly on input videos without requiring manual annotations, such as keypoints or bounding boxes. Experiments on a variety of agent types (mouse, fly, human, jellyfish, and trees) demonstrate the generality of our approach and reveal that our discovered keypoints represent semantically meaningful body parts, which achieve state-of-the-art performance on keypoint regression among self-supervised methods. Additionally, our discovered keypoints achieve comparable performance to supervised keypoints on downstream tasks, such as behavior classification, suggesting that our method can dramatically reduce the cost of model training vis-a-vis supervised methods.",
        "doi": "10.48550/arXiv.2112.05121",
        "publisher": "arXiv",
        "publication_date": "2021-12-09"
    },
    {
        "id": "authors:mwf88-ytc90",
        "collection": "authors",
        "collection_id": "mwf88-ytc90",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20220224-200830238",
        "type": "monograph",
        "title": "Automatic Synthesis of Diverse Weak Supervision Sources for Behavior Analysis",
        "author": [
            {
                "family_name": "Tseng",
                "given_name": "Albert",
                "clpid": "Tseng-Albert"
            },
            {
                "family_name": "Sun",
                "given_name": "Jennifer J.",
                "orcid": "0000-0002-0906-6589",
                "clpid": "Sun-Jennifer-J"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            }
        ],
        "abstract": "Obtaining annotations for large training sets is expensive, especially in behavior analysis settings where domain knowledge is required for accurate annotations. Weak supervision has been studied to reduce annotation costs by using weak labels from task-level labeling functions to augment ground truth labels. However, domain experts are still needed to hand-craft labeling functions for every studied task. To reduce expert effort, we present AutoSWAP: a framework for automatically synthesizing data-efficient task-level labeling functions. The key to our approach is to efficiently represent expert knowledge in a reusable domain specific language and domain-level labeling functions, with which we use state-of-the-art program synthesis techniques and a small labeled dataset to generate labeling functions. Additionally, we propose a novel structural diversity cost that allows for direct synthesis of diverse sets of labeling functions with minimal overhead, further improving labeling function data efficiency. We evaluate AutoSWAP in three behavior analysis domains and demonstrate that AutoSWAP outperforms existing approaches using only a fraction of the data. Our results suggest that AutoSWAP is an effective way to automatically generate labeling functions that can significantly reduce expert effort for behavior analysis.",
        "doi": "10.48550/arXiv.2111.15186",
        "publisher": "arXiv",
        "publication_date": "2021-11-30"
    },
    {
        "id": "authors:c5hxv-yxh35",
        "collection": "authors",
        "collection_id": "c5hxv-yxh35",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20220224-200822492",
        "type": "monograph",
        "title": "Kernel Interpolation as a Bayes Point Machine",
        "author": [
            {
                "family_name": "Bernstein",
                "given_name": "Jeremy",
                "orcid": "0000-0001-9110-7476",
                "clpid": "Bernstein-Jeremy-D"
            },
            {
                "family_name": "Farhang",
                "given_name": "Alex",
                "clpid": "Farhang-Alex"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            }
        ],
        "abstract": "A Bayes point machine is a single classifier that approximates the majority decision of an ensemble of classifiers. This paper observes that kernel interpolation is a Bayes point machine for Gaussian process classification. This observation facilitates the transfer of results from both ensemble theory as well as an area of convex geometry known as Brunn-Minkowski theory to derive PAC-Bayes risk bounds for kernel interpolation. Since large margin, infinite width neural networks are kernel interpolators, the paper's findings may help to explain generalisation in neural networks more broadly. Supporting this idea, the paper finds evidence that large margin, finite width neural networks behave like Bayes point machines too.",
        "doi": "10.48550/arXiv.2110.04274",
        "publisher": "arXiv",
        "publication_date": "2021-10-08"
    },
    {
        "id": "authors:72a5j-ate19",
        "collection": "authors",
        "collection_id": "72a5j-ate19",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20220224-200805115",
        "type": "monograph",
        "title": "Unsupervised Learning of Neurosymbolic Encoders",
        "author": [
            {
                "family_name": "Zhan",
                "given_name": "Eric",
                "clpid": "Zhan-Eric"
            },
            {
                "family_name": "Sun",
                "given_name": "Jennifer J.",
                "orcid": "0000-0002-0906-6589",
                "clpid": "Sun-Jennifer-J"
            },
            {
                "family_name": "Kennedy",
                "given_name": "Ann",
                "orcid": "0000-0002-3782-0518",
                "clpid": "Kennedy-Ann"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Chaudhuri",
                "given_name": "Swarat",
                "clpid": "Chaudhuri-Swarat"
            }
        ],
        "abstract": "We present a framework for the unsupervised learning of neurosymbolic encoders, i.e., encoders obtained by composing neural networks with symbolic programs from a domain-specific language. Such a framework can naturally incorporate symbolic expert knowledge into the learning process and lead to more interpretable and factorized latent representations than fully neural encoders. Also, models learned this way can have downstream impact, as many analysis workflows can benefit from having clean programmatic descriptions. We ground our learning algorithm in the variational autoencoding (VAE) framework, where we aim to learn a neurosymbolic encoder in conjunction with a standard decoder. Our algorithm integrates standard VAE-style training with modern program synthesis techniques. We evaluate our method on learning latent representations for real-world trajectory data from animal biology and sports analytics. We show that our approach offers significantly better separation than standard VAEs and leads to practical gains on downstream tasks.",
        "doi": "10.48550/arXiv.2107.13132",
        "publisher": "arXiv",
        "publication_date": "2021-07-28"
    },
    {
        "id": "authors:pc93q-chx28",
        "collection": "authors",
        "collection_id": "pc93q-chx28",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20220224-200758198",
        "type": "monograph",
        "title": "Interpreting Expert Annotation Differences in Animal Behavior",
        "author": [
            {
                "family_name": "Tjandrasuwita",
                "given_name": "Megan",
                "clpid": "Tjandrasuwita-Megan"
            },
            {
                "family_name": "Sun",
                "given_name": "Jennifer J.",
                "orcid": "0000-0002-0906-6589",
                "clpid": "Sun-Jennifer-J"
            },
            {
                "family_name": "Kennedy",
                "given_name": "Ann",
                "orcid": "0000-0002-3782-0518",
                "clpid": "Kennedy-Ann"
            },
            {
                "family_name": "Chaudhuri",
                "given_name": "Swarat",
                "clpid": "Chaudhuri-Swarat"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            }
        ],
        "abstract": "Hand-annotated data can vary due to factors such as subjective differences, intra-rater variability, and differing annotator expertise. We study annotations from different experts who labelled the same behavior classes on a set of animal behavior videos, and observe a variation in annotation styles. We propose a new method using program synthesis to help interpret annotation differences for behavior analysis. Our model selects relevant trajectory features and learns a temporal filter as part of a program, which corresponds to estimated importance an annotator places on that feature at each timestamp. Our experiments on a dataset from behavioral neuroscience demonstrate that compared to baseline approaches, our method is more accurate at capturing annotator labels and learns interpretable temporal filters. We believe that our method can lead to greater reproducibility of behavior annotations used in scientific studies. We plan to release our code.",
        "doi": "10.48550/arXiv.2106.06114",
        "publisher": "arXiv",
        "publication_date": "2021-06-11"
    },
    {
        "id": "authors:enkxv-30g64",
        "collection": "authors",
        "collection_id": "enkxv-30g64",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20210719-210128990",
        "type": "monograph",
        "title": "Learning Pseudo-Backdoors for Mixed Integer Programs",
        "author": [
            {
                "family_name": "Ferber",
                "given_name": "Aaron",
                "clpid": "Ferber-Aaron"
            },
            {
                "family_name": "Song",
                "given_name": "Jialin",
                "clpid": "Song-Jialin"
            },
            {
                "family_name": "Dilkina",
                "given_name": "Bistra",
                "orcid": "0000-0002-6784-473X",
                "clpid": "Dilkina-Bistra"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            }
        ],
        "abstract": "We propose a machine learning approach for quickly solving Mixed Integer Programs (MIP) by learning to prioritize a set of decision variables, which we call pseudo-backdoors, for branching that results in faster solution times. Learning-based approaches have seen success in the area of solving combinatorial optimization problems by being able to flexibly leverage common structures in a given distribution of problems. Our approach takes inspiration from the concept of strong backdoors, which corresponds to a small set of variables such that only branching on these variables yields an optimal integral solution and a proof of optimality. Our notion of pseudo-backdoors corresponds to a small set of variables such that only branching on them leads to faster solve time (which can be solver dependent). A key advantage of pseudo-backdoors over strong backdoors is that they are much amenable to data-driven identification or prediction. Our proposed method learns to estimate the solver performance of a proposed pseudo-backdoor, using a labeled dataset collected on a set of training MIP instances. This model can then be used to identify high-quality pseudo-backdoors on new MIP instances from the same distribution. We evaluate our method on the generalized independent set problems and find that our approach can efficiently identify high-quality pseudo-backdoors. In addition, we compare our learned approach against Gurobi, a state-of-the-art MIP solver, demonstrating that our method can be used to improve solver performance.",
        "doi": "10.48550/arXiv.2106.05080",
        "publisher": "arXiv",
        "publication_date": "2021-06-09"
    },
    {
        "id": "authors:qcp8d-87r65",
        "collection": "authors",
        "collection_id": "qcp8d-87r65",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20210604-142545306",
        "type": "monograph",
        "title": "End-to-End Sequential Sampling and Reconstruction for MR Imaging",
        "author": [
            {
                "family_name": "Yin",
                "given_name": "Tianwei",
                "clpid": "Yin-Tianwei"
            },
            {
                "family_name": "Wu",
                "given_name": "Zihui",
                "clpid": "Wu-Zihui"
            },
            {
                "family_name": "Sun",
                "given_name": "He",
                "orcid": "0000-0003-1526-6787",
                "clpid": "Sun-He"
            },
            {
                "family_name": "Dalca",
                "given_name": "Adrian V.",
                "orcid": "0000-0002-8422-0136",
                "clpid": "Dalca-Adrian-V"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Bouman",
                "given_name": "Katherine L.",
                "orcid": "0000-0003-0077-4367",
                "clpid": "Bouman-K-L"
            }
        ],
        "abstract": "Accelerated MRI shortens acquisition time by subsampling in the measurement k-space. Recovering a high-fidelity anatomical image from subsampled measurements requires close cooperation between two components: (1) a sampler that chooses the subsampling pattern and (2) a reconstructor that recovers images from incomplete measurements. In this paper, we leverage the sequential nature of MRI measurements, and propose a fully differentiable framework that jointly learns a sequential sampling policy simultaneously with a reconstruction strategy. This co-designed framework is able to adapt during acquisition in order to capture the most informative measurements for a particular target (Figure 1). Experimental results on the fastMRI knee dataset demonstrate that the proposed approach successfully utilizes intermediate information during the sampling process to boost reconstruction performance. In particular, our proposed method outperforms the current state-of-the-art learned k-space sampling baseline on up to 96.96% of test samples. We also investigate the individual and collective benefits of the sequential sampling and co-design strategies. Code and more visualizations are available at this http URL [http://imaging.cms.caltech.edu/seq-mri]",
        "doi": "10.48550/arXiv.2105.06460",
        "publisher": "arXiv",
        "publication_date": "2021-05-13"
    },
    {
        "id": "authors:yej1d-d2j82",
        "collection": "authors",
        "collection_id": "yej1d-d2j82",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20210510-093610124",
        "type": "monograph",
        "title": "The Multi-Agent Behavior Dataset: Mouse Dyadic Social Interactions",
        "author": [
            {
                "family_name": "Sun",
                "given_name": "Jennifer J.",
                "orcid": "0000-0002-0906-6589",
                "clpid": "Sun-Jennifer-J"
            },
            {
                "family_name": "Karigo",
                "given_name": "Tomomi",
                "clpid": "Karigo-Tomomi"
            },
            {
                "family_name": "Chakraborty",
                "given_name": "Dipam",
                "clpid": "Chakraborty-Dipam"
            },
            {
                "family_name": "Mohanty",
                "given_name": "Sharada P.",
                "clpid": "Mohanty-Sharada-P"
            },
            {
                "family_name": "Anderson",
                "given_name": "David J.",
                "orcid": "0000-0001-6175-3872",
                "clpid": "Anderson-D-J"
            },
            {
                "family_name": "Perona",
                "given_name": "Pietro",
                "orcid": "0000-0002-7583-5809",
                "clpid": "Perona-P"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Kennedy",
                "given_name": "Ann",
                "orcid": "0000-0002-3782-0518",
                "clpid": "Kennedy-A"
            }
        ],
        "abstract": "Multi-agent behavior modeling aims to understand the interactions that occur between agents. We present a multi-agent dataset from behavioral neuroscience, the Caltech Mouse Social Interactions (CalMS21) Dataset. Our dataset consists of trajectory data of social interactions, recorded from videos of freely behaving mice in a standard resident-intruder assay. The CalMS21 dataset is part of the Multi-Agent Behavior Challenge 2021 and for our next step, our goal is to incorporate datasets from other domains studying multi-agent behavior. \n\nTo help accelerate behavioral studies, the CalMS21 dataset provides a benchmark to evaluate the performance of automated behavior classification methods in three settings: (1) for training on large behavioral datasets all annotated by a single annotator, (2) for style transfer to learn inter-annotator differences in behavior definitions, and (3) for learning of new behaviors of interest given limited training data. The dataset consists of 6 million frames of unlabelled tracked poses of interacting mice, as well as over 1 million frames with tracked poses and corresponding frame-level behavior annotations. The challenge of our dataset is to be able to classify behaviors accurately using both labelled and unlabelled tracking data, as well as being able to generalize to new annotators and behaviors.",
        "doi": "10.48550/arXiv.2104.02710",
        "publisher": "arXiv",
        "publication_date": "2021-04-06"
    },
    {
        "id": "authors:4g66e-dhs76",
        "collection": "authors",
        "collection_id": "4g66e-dhs76",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20210304-095340677",
        "type": "monograph",
        "title": "Computing the Information Content of Trained Neural Networks",
        "author": [
            {
                "family_name": "Bernstein",
                "given_name": "Jeremy",
                "orcid": "0000-0001-9110-7476",
                "clpid": "Bernstein-Jeremy-D"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            }
        ],
        "abstract": "How much information does a learning algorithm extract from the training data and store in a neural network's weights? Too much, and the network would overfit to the training data. Too little, and the network would not fit to anything at all. Na\u00efvely, the amount of information the network stores should scale in proportion to the number of trainable weights. This raises the question: how can neural networks with vastly more weights than training data still generalise? A simple resolution to this conundrum is that the number of weights is usually a bad proxy for the actual amount of information stored. For instance, typical weight vectors may be highly compressible. Then another question occurs: is it possible to compute the actual amount of information stored? This paper derives both a consistent estimator and a closed-form upper bound on the information content of infinitely wide neural networks. The derivation is based on an identification between neural information content and the negative log probability of a Gaussian orthant. This identification yields bounds that analytically control the generalisation behaviour of the entire solution space of infinitely wide networks. The bounds have a simple dependence on both the network architecture and the training data. Corroborating the findings of Valle-P\u00e9rez et al. (2019), who conducted a similar analysis using approximate Gaussian integration techniques, the bounds are found to be both non-vacuous and correlated with the empirical generalisation behaviour at finite width.",
        "doi": "10.48550/arXiv.2103.01045",
        "publisher": "arXiv",
        "publication_date": "2021-03-01"
    },
    {
        "id": "authors:b6a7k-k8085",
        "collection": "authors",
        "collection_id": "b6a7k-k8085",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20210225-132714927",
        "type": "monograph",
        "title": "Disentangling Observed Causal Effects from Latent Confounders using Method of Moments",
        "author": [
            {
                "family_name": "Liu",
                "given_name": "Anqi",
                "clpid": "Liu-Anqi"
            },
            {
                "family_name": "Liu",
                "given_name": "Hao",
                "orcid": "0000-0002-7405-1578",
                "clpid": "Liu-Hao"
            },
            {
                "family_name": "Li",
                "given_name": "Tongxin",
                "orcid": "0000-0002-9806-8964",
                "clpid": "Li-Tongxin"
            },
            {
                "family_name": "Karimi-Bidhendi",
                "given_name": "Saeed",
                "clpid": "Karimi-Bidhendi-Saeed"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Anandkumar",
                "given_name": "Anima",
                "clpid": "Anandkumar-A"
            }
        ],
        "abstract": "Discovering the complete set of causal relations among a group of variables is a challenging unsupervised learning problem. Often, this challenge is compounded by the fact that there are latent or hidden confounders. When only observational data is available, the problem is ill-posed, i.e. the causal relationships are non-identifiable unless strong modeling assumptions are made. When interventions are available, we provide guarantees on identifiability and learnability under mild assumptions. We assume a linear structural equation model (SEM) with independent latent factors and directed acyclic graph (DAG) relationships among the observables. Since the latent variable inference is based on independent component analysis (ICA), we call this model SEM-ICA. We use the method of moments principle to establish model identifiability. We develop efficient algorithms based on coupled tensor decomposition with linear constraints to obtain scalable and guaranteed solutions. Thus, we provide a principled approach to tackling the joint problem of causal discovery and latent variable inference.",
        "doi": "10.48550/arXiv.2101.06614",
        "publisher": "arXiv",
        "publication_date": "2021-01-17"
    },
    {
        "id": "authors:q6qth-h9r71",
        "collection": "authors",
        "collection_id": "q6qth-h9r71",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20210119-161629149",
        "type": "monograph",
        "title": "On the Benefits of Early Fusion in Multimodal Representation Learning",
        "author": [
            {
                "family_name": "Barnum",
                "given_name": "George",
                "clpid": "Barnum-George"
            },
            {
                "family_name": "Talukder",
                "given_name": "Sabera",
                "clpid": "Talukder-Sabera"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            }
        ],
        "abstract": "Intelligently reasoning about the world often requires integrating data from multiple modalities, as any individual modality may contain unreliable or incomplete information. Prior work in multimodal learning fuses input modalities only after significant independent processing. On the other hand, the brain performs multimodal processing almost immediately. This divide between conventional multimodal learning and neuroscience suggests that a detailed study of early multimodal fusion could improve artificial multimodal representations. To facilitate the study of early multimodal fusion, we create a convolutional LSTM network architecture that simultaneously processes both audio and visual inputs, and allows us to select the layer at which audio and visual information combines. Our results demonstrate that immediate fusion of audio and visual inputs in the initial C-LSTM layer results in higher performing networks that are more robust to the addition of white noise in both audio and visual inputs.",
        "doi": "10.48550/arXiv.2011.07191",
        "publisher": "arXiv",
        "publication_date": "2020-11-14"
    },
    {
        "id": "authors:1bqjb-6nf06",
        "collection": "authors",
        "collection_id": "1bqjb-6nf06",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20210119-161636048",
        "type": "monograph",
        "title": "Architecture Agnostic Neural Networks",
        "author": [
            {
                "family_name": "Talukder",
                "given_name": "Sabera",
                "clpid": "Talukder-Sabera"
            },
            {
                "family_name": "Raghavan",
                "given_name": "Guruprasad",
                "clpid": "Raghavan-Guruprasad"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            }
        ],
        "abstract": "In this paper, we explore an alternate method for synthesizing neural network architectures, inspired by the brain's stochastic synaptic pruning. During a person's lifetime, numerous distinct neuronal architectures are responsible for performing the same tasks. This indicates that biological neural networks are, to some degree, architecture agnostic. However, artificial networks rely on their fine-tuned weights and hand-crafted architectures for their remarkable performance. This contrast begs the question: Can we build artificial architecture agnostic neural networks? To ground this study we utilize sparse, binary neural networks that parallel the brain's circuits. Within this sparse, binary paradigm we sample many binary architectures to create families of architecture agnostic neural networks not trained via backpropagation. These high-performing network families share the same sparsity, distribution of binary weights, and succeed in both static and dynamic tasks. In summation, we create an architecture manifold search procedure to discover families or architecture agnostic neural networks.",
        "doi": "10.48550/arXiv.2011.02712",
        "publisher": "arXiv",
        "publication_date": "2020-11-05"
    },
    {
        "id": "authors:nnyhw-95c64",
        "collection": "authors",
        "collection_id": "nnyhw-95c64",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20201110-082106076",
        "type": "monograph",
        "title": "Competitive Control with Delayed Imperfect Information",
        "author": [
            {
                "family_name": "Yu",
                "given_name": "Chenkai",
                "clpid": "Yu-Chenkai"
            },
            {
                "family_name": "Shi",
                "given_name": "Guanya",
                "clpid": "Shi-Guanya"
            },
            {
                "family_name": "Chung",
                "given_name": "Soon-Jo",
                "orcid": "0000-0002-6657-3907",
                "clpid": "Chung-Soon-Jo"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Wierman",
                "given_name": "Adam",
                "clpid": "Wierman-A"
            }
        ],
        "abstract": "This paper studies the impact of imperfect information in online control with adversarial disturbances. In particular, we consider both delayed state feedback and inexact predictions of future disturbances. We introduce a greedy, myopic policy that yields a constant competitive ratio against the offline optimal policy with delayed feedback and inexact predictions. A special case of our result is a constant competitive policy for the case of exact predictions and no delay, a previously open problem. We also analyze the fundamental limits of online control with limited information by showing that our competitive ratio bounds for the greedy, myopic policy in the adversarial setting match (up to lower-order terms) lower bounds in the stochastic setting.",
        "doi": "10.48550/arXiv.2010.11637",
        "publisher": "arXiv",
        "publication_date": "2020-10-22"
    },
    {
        "id": "authors:f8x2p-rtc42",
        "collection": "authors",
        "collection_id": "f8x2p-rtc42",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20201110-082336091",
        "type": "monograph",
        "title": "Iterative Amortized Policy Optimization",
        "author": [
            {
                "family_name": "Marino",
                "given_name": "Joseph",
                "orcid": "0000-0001-6387-8062",
                "clpid": "Marino-Joseph-L"
            },
            {
                "family_name": "Pich\u00e9",
                "given_name": "Alexandre",
                "clpid": "Pich\u00e9-Alexandre"
            },
            {
                "family_name": "Ialongo",
                "given_name": "Alessandro Davide",
                "clpid": "Ialongo-Alessandro-Davide"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            }
        ],
        "abstract": "Policy networks are a central feature of deep reinforcement learning (RL) algorithms for continuous control, enabling the estimation and sampling of high-value actions. From the variational inference perspective on RL, policy networks, when employed with entropy or KL regularization, are a form of amortized optimization, optimizing network parameters rather than the policy distributions directly. However, this direct amortized mapping can empirically yield suboptimal policy estimates. Given this perspective, we consider the more flexible class of iterative amortized optimizers. We demonstrate that the resulting technique, iterative amortized policy optimization, yields performance improvements over conventional direct amortization methods on benchmark continuous control tasks.",
        "doi": "10.48550/arXiv.2010.10670",
        "publisher": "arXiv",
        "publication_date": "2020-10-20"
    },
    {
        "id": "authors:tjpba-0za37",
        "collection": "authors",
        "collection_id": "tjpba-0za37",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20201106-120148344",
        "type": "monograph",
        "title": "Distributionally Robust Learning for Unsupervised Domain Adaptation",
        "author": [
            {
                "family_name": "Wang",
                "given_name": "Haoxuan",
                "clpid": "Wang-Haoxuan-Shanghai-Jiao-Tong"
            },
            {
                "family_name": "Liu",
                "given_name": "Anqi",
                "clpid": "Liu-Anqi"
            },
            {
                "family_name": "Yu",
                "given_name": "Zhiding",
                "clpid": "Yu-Zhiding"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Anandkumar",
                "given_name": "Anima",
                "clpid": "Anandkumar-A"
            }
        ],
        "abstract": "We propose a distributionally robust learning (DRL) method for unsupervised domain adaptation (UDA) that scales to modern computer vision benchmarks. DRL can be naturally formulated as a competitive two-player game between a predictor and an adversary that is allowed to corrupt the labels, subject to certain constraints, and reduces to incorporating a density ratio between the source and target domains (under the standard log loss). This formulation motivates the use of two neural networks that are jointly trained - a discriminative network between the source and target domains for density-ratio estimation, in addition to the standard classification network. The use of a density ratio in DRL prevents the model from being overconfident on target inputs far away from the source domain. Thus, DRL provides conservative confidence estimation in the target domain, even when the target labels are not available. This conservatism motivates the use of DRL in self-training for sample selection, and we term the approach distributionally robust self-training (DRST). In our experiments, DRST generates more calibrated probabilities and achieves state-of-the-art self-training accuracy on benchmark datasets. We demonstrate that DRST captures shape features more effectively, and reduces the extent of distributional shift during self-training.",
        "doi": "10.48550/arXiv.2010.05784",
        "publisher": "arXiv",
        "publication_date": "2020-10-08"
    },
    {
        "id": "authors:hg146-9p613",
        "collection": "authors",
        "collection_id": "hg146-9p613",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20201110-085241409",
        "type": "monograph",
        "title": "Learning Differentiable Programs with Admissible Neural Heuristics",
        "author": [
            {
                "family_name": "Shah",
                "given_name": "Ameesh",
                "clpid": "Shah-Ameesh"
            },
            {
                "family_name": "Zhan",
                "given_name": "Eric",
                "clpid": "Zhan-Eric"
            },
            {
                "family_name": "Sun",
                "given_name": "Jennifer J.",
                "orcid": "0000-0002-0906-6589",
                "clpid": "Sun-Jennifer-J"
            },
            {
                "family_name": "Verma",
                "given_name": "Abhinav",
                "orcid": "0000-0002-9820-8285",
                "clpid": "Verma-Abhinav"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Chaudhuri",
                "given_name": "Swarat",
                "clpid": "Chaudhuri-Swarat"
            }
        ],
        "abstract": "We study the problem of learning differentiable functions expressed as programs in a domain-specific language. Such programmatic models can offer benefits such as composability and interpretability; however, learning them requires optimizing over a combinatorial space of program \"architectures\". We frame this optimization problem as a search in a weighted graph whose paths encode top-down derivations of program syntax. Our key innovation is to view various classes of neural networks as continuous relaxations over the space of programs, which can then be used to complete any partial program. This relaxed program is differentiable and can be trained end-to-end, and the resulting training loss is an approximately admissible heuristic that can guide the combinatorial search. We instantiate our approach on top of the A-star algorithm and an iteratively deepened branch-and-bound search, and use these algorithms to learn programmatic classifiers in three sequence classification tasks. Our experiments show that the algorithms outperform state-of-the-art methods for program learning, and that they discover programmatic classifiers that yield natural interpretations and achieve competitive accuracy.",
        "doi": "10.48550/arXiv.2007.12101",
        "publisher": "arXiv",
        "publication_date": "2020-07-23"
    },
    {
        "id": "authors:fv4kv-5ky22",
        "collection": "authors",
        "collection_id": "fv4kv-5ky22",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20201110-154207213",
        "type": "monograph",
        "title": "Graph Neural Networks for the Prediction of Substrate-Specific Organic Reaction Conditions",
        "author": [
            {
                "family_name": "Ryou",
                "given_name": "Serim",
                "clpid": "Ryou-Serim"
            },
            {
                "family_name": "Maser",
                "given_name": "Michael R.",
                "orcid": "0000-0001-7895-7804",
                "clpid": "Maser-Michael-R"
            },
            {
                "family_name": "Cui",
                "given_name": "Alexander Y.",
                "clpid": "Cui-Alexander-Y"
            },
            {
                "family_name": "DeLano",
                "given_name": "Travis J.",
                "orcid": "0000-0002-2052-611X",
                "clpid": "DeLano-Travis-J"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Reisman",
                "given_name": "Sarah E.",
                "orcid": "0000-0001-8244-9300",
                "clpid": "Reisman-S-E"
            }
        ],
        "abstract": "We present a systematic investigation using graph neural networks (GNNs) to model organic chemical reactions. To do so, we prepared a dataset collection of four ubiquitous reactions from the organic chemistry literature. We evaluate seven different GNN architectures for classification tasks pertaining to the identification of experimental reagents and conditions. We find that models are able to identify specific graph features that affect reaction conditions and lead to accurate predictions. The results herein show great promise in advancing molecular machine learning.",
        "doi": "10.48550/arXiv.2007.04275",
        "publisher": "arXiv",
        "publication_date": "2020-07-08"
    },
    {
        "id": "authors:4ax9n-s2b88",
        "collection": "authors",
        "collection_id": "4ax9n-s2b88",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20201111-071759033",
        "type": "monograph",
        "title": "Average-case Complexity of Teaching Convex Polytopes via Halfspace Queries",
        "author": [
            {
                "family_name": "Kumar",
                "given_name": "Akash",
                "clpid": "Kumar-Akash"
            },
            {
                "family_name": "Singla",
                "given_name": "Adish",
                "clpid": "Singla-Adish"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Chen",
                "given_name": "Yuxin",
                "clpid": "Chen-Yuxin"
            }
        ],
        "abstract": "We examine the task of locating a target region among those induced by intersections of n halfspaces in R^d. This generic task connects to fundamental machine learning problems, such as training a perceptron and learning a \u03d5-separable dichotomy. We investigate the average teaching complexity of the task, i.e., the minimal number of samples (halfspace queries) required by a teacher to help a version-space learner in locating a randomly selected target. As our main result, we show that the average-case teaching complexity is \u0398(d), which is in sharp contrast to the worst-case teaching complexity of \u0398(n). If instead, we consider the average-case learning complexity, the bounds have a dependency on n as \u0398(n) for i.i.d. queries and \u0398(dlog(n)) for actively chosen queries by the learner. Our proof techniques are based on novel insights from computational geometry, which allow us to count the number of convex polytopes and faces in a Euclidean space depending on the arrangement of halfspaces. Our insights allow us to establish a tight bound on the average-case complexity for \u03d5-separable dichotomies, which generalizes the known O(d) bound on the average number of \"extreme patterns\" in the classical computational geometry literature (Cover, 1965).",
        "doi": "10.48550/arXiv.2006.14677",
        "publisher": "arXiv",
        "publication_date": "2020-06-25"
    },
    {
        "id": "authors:hzymj-wxd44",
        "collection": "authors",
        "collection_id": "hzymj-wxd44",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20201106-120215567",
        "type": "monograph",
        "title": "Competitive Policy Optimization",
        "author": [
            {
                "family_name": "Prajapat",
                "given_name": "Manish",
                "orcid": "0000-0002-3867-4575",
                "clpid": "Prajapat-Manish"
            },
            {
                "family_name": "Azizzadenesheli",
                "given_name": "Kamyar",
                "orcid": "0000-0001-8507-1868",
                "clpid": "Azizzadenesheli-Kamyar"
            },
            {
                "family_name": "Liniger",
                "given_name": "Alexander",
                "orcid": "0000-0002-7858-7900",
                "clpid": "Liniger-Alexander"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Anandkumar",
                "given_name": "Anima",
                "orcid": "0000-0002-6974-6797",
                "clpid": "Anandkumar-A"
            }
        ],
        "abstract": "A core challenge in policy optimization in competitive Markov decision processes is the design of efficient optimization methods with desirable convergence and stability properties. To tackle this, we propose competitive policy optimization (CoPO), a novel policy gradient approach that exploits the game-theoretic nature of competitive games to derive policy updates. Motivated by the competitive gradient optimization method, we derive a bilinear approximation of the game objective. In contrast, off-the-shelf policy gradient methods utilize only linear approximations, and hence do not capture interactions among the players. We instantiate CoPO in two ways:(i) competitive policy gradient, and (ii) trust-region competitive policy optimization. We theoretically study these methods, and empirically investigate their behavior on a set of comprehensive, yet challenging, competitive games. We observe that they provide stable optimization, convergence to sophisticated strategies, and higher scores when played against baseline policy gradient methods.",
        "doi": "10.48550/arXiv.2006.10611",
        "publisher": "arXiv",
        "publication_date": "2020-06-18"
    },
    {
        "id": "authors:vk7vz-zp479",
        "collection": "authors",
        "collection_id": "vk7vz-zp479",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20200707-094715120",
        "type": "monograph",
        "title": "The Power of Predictions in Online Control",
        "author": [
            {
                "family_name": "Yu",
                "given_name": "Chenkai",
                "orcid": "0000-0001-8683-7773",
                "clpid": "Yu-Chenkai"
            },
            {
                "family_name": "Shi",
                "given_name": "Guanya",
                "orcid": "0000-0002-9075-3705",
                "clpid": "Shi-Guanya"
            },
            {
                "family_name": "Chung",
                "given_name": "Soon-Jo",
                "orcid": "0000-0002-6657-3907",
                "clpid": "Chung-Soon-Jo"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Wierman",
                "given_name": "Adam",
                "orcid": "0000-0002-5923-0199",
                "clpid": "Wierman-A"
            }
        ],
        "abstract": "We study the impact of predictions in online Linear Quadratic Regulator control with both stochastic and adversarial disturbances in the dynamics. In both settings, we characterize the optimal policy and derive tight bounds on the minimum cost and dynamic regret. Perhaps surprisingly, our analysis shows that the conventional greedy MPC approach is a near-optimal policy in both stochastic and adversarial settings. Specifically, for length-T problems, MPC requires only O(logT) predictions to reach O(1) dynamic regret, which matches (up to lower-order terms) our lower bound on the required prediction horizon for constant regret.",
        "doi": "10.48550/arXiv.2006.07569",
        "publisher": "arXiv",
        "publication_date": "2020-06-13"
    },
    {
        "id": "authors:7sv9q-5fv63",
        "collection": "authors",
        "collection_id": "7sv9q-5fv63",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20200526-151215262",
        "type": "monograph",
        "title": "A General Large Neighborhood Search Framework for Solving Integer Programs",
        "author": [
            {
                "family_name": "Song",
                "given_name": "Jialin",
                "clpid": "Song-Jialin"
            },
            {
                "family_name": "Lanka",
                "given_name": "Ravi",
                "clpid": "Lanka-Ravi"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Dilkina",
                "given_name": "Bistra",
                "orcid": "0000-0002-6784-473X",
                "clpid": "Dilkina-Bistra"
            }
        ],
        "abstract": "This paper studies how to design abstractions of large-scale combinatorial optimization problems that can leverage existing state-of-the-art solvers in general purpose ways, and that are amenable to data-driven design. The goal is to arrive at new approaches that can reliably outperform existing solvers in wall-clock time. We focus on solving integer programs, and ground our approach in the large neighborhood search (LNS) paradigm, which iteratively chooses a subset of variables to optimize while leaving the remainder fixed. The appeal of LNS is that it can easily use any existing solver as a subroutine, and thus can inherit the benefits of carefully engineered heuristic approaches and their software implementations. We also show that one can learn a good neighborhood selector from training data. Through an extensive empirical validation, we demonstrate that our LNS framework can significantly outperform, in wall-clock time, compared to state-of-the-art commercial solvers such as Gurobi.",
        "doi": "10.48550/arXiv.2004.00422",
        "publisher": "arXiv",
        "publication_date": "2020-03-29"
    },
    {
        "id": "authors:xrywz-v0w33",
        "collection": "authors",
        "collection_id": "xrywz-v0w33",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20200214-105606928",
        "type": "monograph",
        "title": "Online Optimization with Memory and Competitive Control",
        "author": [
            {
                "family_name": "Shi",
                "given_name": "Guanya",
                "orcid": "0000-0002-9075-3705",
                "clpid": "Shi-Guanya"
            },
            {
                "family_name": "Lin",
                "given_name": "Yiheng",
                "orcid": "0000-0001-6524-2877",
                "clpid": "Lin-Yiheng"
            },
            {
                "family_name": "Chung",
                "given_name": "Soon-Jo",
                "orcid": "0000-0002-6657-3907",
                "clpid": "Chung-Soon-Jo"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Wierman",
                "given_name": "Adam",
                "orcid": "0000-0002-5923-0199",
                "clpid": "Wierman-A"
            }
        ],
        "abstract": "This paper presents competitive algorithms for a novel class of online optimization problems with memory. We consider a setting where the learner seeks to minimize the sum of a hitting cost and a switching cost that depends on the previous p decisions. This setting generalizes Smoothed Online Convex Optimization. The proposed approach, Optimistic Regularized Online Balanced Descent, achieves a constant, dimension-free competitive ratio. Further, we show a connection between online optimization with memory and online control with adversarial disturbances. This connection, in turn, leads to a new constant-competitive policy for a rich class of online control problems.",
        "doi": "10.48550/arXiv.2002.05318",
        "publisher": "arXiv",
        "publication_date": "2020-02-13"
    },
    {
        "id": "authors:aptqa-b9c21",
        "collection": "authors",
        "collection_id": "aptqa-b9c21",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20200214-105610460",
        "type": "monograph",
        "title": "Multiresolution Tensor Learning for Efficient and Interpretable Spatial Analysis",
        "author": [
            {
                "family_name": "Park",
                "given_name": "Jung Yeon",
                "clpid": "Park-Jung-Yeon"
            },
            {
                "family_name": "Carr",
                "given_name": "Kenneth Theo",
                "clpid": "Carr-K-T"
            },
            {
                "family_name": "Zhang",
                "given_name": "Stephan",
                "clpid": "Zhang-Stephan"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Yu",
                "given_name": "Rose",
                "clpid": "Yu-Rose"
            }
        ],
        "abstract": "Efficient and interpretable spatial analysis is crucial in many fields such as geology, sports, and climate science. Large-scale spatial data often contains complex higher-order correlations across features and locations. While tensor latent factor models can describe higher-order correlations, they are inherently computationally expensive to train. Furthermore, for spatial analysis, these models should not only be predictive but also be spatially coherent. However, latent factor models are sensitive to initialization and can yield inexplicable results. We develop a novel Multi-resolution Tensor Learning (MRTL) algorithm for efficiently learning interpretable spatial patterns. MRTL initializes the latent factors from an approximate full-rank tensor model for improved interpretability and progressively learns from a coarse resolution to the fine resolution for an enormous computation speedup. We also prove the theoretical convergence and computational complexity of MRTL. When applied to two real-world datasets, MRTL demonstrates 4 ~ 5 times speedup compared to a fixed resolution while yielding accurate and interpretable models.",
        "doi": "10.48550/arXiv.2002.05578",
        "publisher": "arXiv",
        "publication_date": "2020-02-13"
    },
    {
        "id": "authors:wyfv9-end27",
        "collection": "authors",
        "collection_id": "wyfv9-end27",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20200214-105602886",
        "type": "monograph",
        "title": "On the distance between two neural networks and the stability of learning",
        "author": [
            {
                "family_name": "Bernstein",
                "given_name": "Jeremy",
                "orcid": "0000-0001-9110-7476",
                "clpid": "Bernstein-Jeremy-D"
            },
            {
                "family_name": "Vahdat",
                "given_name": "Arash",
                "clpid": "Vahdat-Arash"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Liu",
                "given_name": "Ming-Yu",
                "orcid": "0000-0002-2951-2398",
                "clpid": "Liu-Ming-Yu"
            }
        ],
        "abstract": "This paper relates parameter distance to gradient breakdown for a broad class of nonlinear compositional functions. The analysis leads to a new distance function called deep relative trust and a descent lemma for neural networks. Since the resulting learning rule seems to require little to no learning rate tuning, it may unlock a simpler workflow for training deeper and more complex neural networks. The Python code used in this paper is here:  https://github.com/jxbz/fromage",
        "doi": "10.48550/arXiv.2002.03432",
        "publisher": "arXiv",
        "publication_date": "2020-02-09"
    },
    {
        "id": "authors:6vcht-tev43",
        "collection": "authors",
        "collection_id": "6vcht-tev43",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20200109-100747650",
        "type": "monograph",
        "title": "Empirical Study of Off-Policy Policy Evaluation for Reinforcement Learning",
        "author": [
            {
                "family_name": "Voloshin",
                "given_name": "Cameron",
                "clpid": "Voloshin-C"
            },
            {
                "family_name": "Le",
                "given_name": "Hoang M.",
                "clpid": "Le-Hoang-M"
            },
            {
                "family_name": "Jiang",
                "given_name": "Nan",
                "clpid": "Jiang-Nan"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            }
        ],
        "abstract": "Off-policy policy evaluation (OPE) is the problem of estimating the online performance of a policy using only pre-collected historical data generated by another policy. Given the increasing interest in deploying learning-based methods for safety-critical applications, many recent OPE methods have recently been proposed. Due to disparate experimental conditions from recent literature, the relative performance of current OPE methods is not well understood. In this work, we present the first comprehensive empirical analysis of a broad suite of OPE methods. Based on thousands of experiments and detailed empirical analyses, we offer a summarized set of guidelines for effectively using OPE in practice, and suggest directions for future research.",
        "doi": "10.48550/arXiv.1911.06854",
        "publisher": "arXiv",
        "publication_date": "2019-11-15"
    },
    {
        "id": "authors:h450g-t2m88",
        "collection": "authors",
        "collection_id": "h450g-t2m88",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20200109-085907638",
        "type": "monograph",
        "title": "Triply Robust Off-Policy Evaluation",
        "author": [
            {
                "family_name": "Liu",
                "given_name": "Anqi",
                "clpid": "Liu-Anqi"
            },
            {
                "family_name": "Liu",
                "given_name": "Hao",
                "clpid": "Liu-Hao"
            },
            {
                "family_name": "Anandkumar",
                "given_name": "Anima",
                "clpid": "Anandkumar-A"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            }
        ],
        "abstract": "We propose a robust regression approach to off-policy evaluation (OPE) for contextual bandits. We frame OPE as a covariate-shift problem and leverage modern robust regression tools. Ours is a general approach that can be used to augment any existing OPE method that utilizes the direct method. When augmenting doubly robust methods, we call the resulting method Triply Robust. We prove upper bounds on the resulting bias and variance, as well as derive novel minimax bounds based on robust minimax analysis for covariate shift. Our robust regression method is compatible with deep learning, and is thus applicable to complex OPE settings that require powerful function approximators. Finally, we demonstrate superior empirical performance across the standard OPE benchmarks, especially in the case where the logging policy is unknown and must be estimated from data.",
        "doi": "10.48550/arXiv.1911.05811",
        "publisher": "arXiv",
        "publication_date": "2019-11-13"
    },
    {
        "id": "authors:ndz25-g8638",
        "collection": "authors",
        "collection_id": "ndz25-g8638",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20200109-101924329",
        "type": "monograph",
        "title": "Learning Calibratable Policies using Programmatic Style-Consistency",
        "author": [
            {
                "family_name": "Zhan",
                "given_name": "Eric",
                "clpid": "Zhan-Eric"
            },
            {
                "family_name": "Tseng",
                "given_name": "Albert",
                "clpid": "Tseng-Albert"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Swaminathan",
                "given_name": "Adith",
                "orcid": "0000-0001-9935-6530",
                "clpid": "Swaminathan-A"
            },
            {
                "family_name": "Hausknecht",
                "given_name": "Matthew",
                "clpid": "Hausknecht-M"
            }
        ],
        "abstract": "We study the important and challenging problem of controllable generation of long-term sequential behaviors. Solutions to this problem would impact many applications, such as calibrating behaviors of AI agents in games or predicting player trajectories in sports. In contrast to the well-studied areas of controllable generation of images, text, and speech, there are significant challenges that are unique to or exacerbated by generating long-term behaviors: how should we specify the factors of variation to control, and how can we ensure that the generated temporal behavior faithfully demonstrates diverse styles? In this paper, we leverage large amounts of raw behavioral data to learn policies that can be calibrated to generate a diverse range of behavior styles (e.g., aggressive versus passive play in sports). Inspired by recent work on leveraging programmatic labeling functions, we present a novel framework that combines imitation learning with data programming to learn style-calibratable policies. Our primary technical contribution is a formal notion of style-consistency as a learning objective, and its integration with conventional imitation learning approaches. We evaluate our framework using demonstrations from professional basketball players and agents in the MuJoCo physics environment, and show that our learned policies can be accurately calibrated to generate interesting behavior styles in both domains.",
        "doi": "10.48550/arXiv.1910.01179",
        "publisher": "arXiv",
        "publication_date": "2019-10-02"
    },
    {
        "id": "authors:kws8t-j2v54",
        "collection": "authors",
        "collection_id": "kws8t-j2v54",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20190905-154310582",
        "type": "monograph",
        "title": "Co-training for Policy Learning",
        "author": [
            {
                "family_name": "Song",
                "given_name": "Jialin",
                "clpid": "Song-Jialin"
            },
            {
                "family_name": "Lanka",
                "given_name": "Ravi",
                "clpid": "Lanka-R"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Ono",
                "given_name": "Masahiro",
                "clpid": "Ono-Masahiro"
            }
        ],
        "abstract": "We study the problem of learning sequential decision-making policies in settings with multiple state-action representations. Such settings naturally arise in many domains, such as planning (e.g., multiple integer programming formulations) and various combinatorial optimization problems (e.g., those with both integer programming and graph-based formulations). Inspired by the classical co-training framework for classification, we study the problem of co-training for policy learning. We present sufficient conditions under which learning from two views can improve upon learning from a single view alone. Motivated by these theoretical insights, we present a meta-algorithm for co-training for sequential decision making. Our framework is compatible with both reinforcement learning and imitation learning. We validate the effectiveness of our approach across a wide range of tasks, including discrete/continuous control and combinatorial optimization.",
        "doi": "10.48550/arXiv.1907.04484",
        "publisher": "arXiv",
        "publication_date": "2019-07-03"
    },
    {
        "id": "authors:607xd-w9259",
        "collection": "authors",
        "collection_id": "607xd-w9259",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20190905-154307157",
        "type": "monograph",
        "title": "Robust Regression for Safe Exploration in Control",
        "author": [
            {
                "family_name": "Liu",
                "given_name": "Anqi",
                "clpid": "Liu-Anqi"
            },
            {
                "family_name": "Shi",
                "given_name": "Guanya",
                "clpid": "Shi-Guanya"
            },
            {
                "family_name": "Chung",
                "given_name": "Soon-Jo",
                "orcid": "0000-0002-6657-3907",
                "clpid": "Chung-Soon-Jo"
            },
            {
                "family_name": "Anandkumar",
                "given_name": "Anima",
                "clpid": "Anandkumar-A"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            }
        ],
        "abstract": "We study the problem of safe learning and exploration in sequential control problems. The goal is to safely collect data samples from an operating environment to learn an optimal controller. A central challenge in this setting is how to quantify uncertainty in order to choose provably-safe actions that allow us to collect useful data and reduce uncertainty, thereby achieving both improved safety and optimality. To address this challenge, we present a deep robust regression model that is trained to directly predict the uncertainty bounds for safe exploration. We then show how to integrate our robust regression approach with model-based control methods by learning a dynamic model with robustness bounds. We derive generalization bounds under domain shifts for learning and connect them with safety and stability bounds in control. We demonstrate empirically that our robust regression approach can outperform conventional Gaussian process (GP) based safe exploration in settings where it is difficult to specify a good GP prior.",
        "doi": "10.48550/arXiv.1906.05819",
        "publisher": "arXiv",
        "publication_date": "2019-06-13"
    },
    {
        "id": "authors:amrte-t0y10",
        "collection": "authors",
        "collection_id": "amrte-t0y10",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20190205-101105728",
        "type": "monograph",
        "title": "Optimizing Photonic Nanostructures via Multi-fidelity Gaussian Processes",
        "author": [
            {
                "family_name": "Song",
                "given_name": "Jialin",
                "clpid": "Song-Jialin"
            },
            {
                "family_name": "Tokpanov",
                "given_name": "Yury S.",
                "clpid": "Tokpanov-Y-S"
            },
            {
                "family_name": "Chen",
                "given_name": "Yuxin",
                "clpid": "Chen-Yuxin"
            },
            {
                "family_name": "Fleischman",
                "given_name": "Dagny",
                "clpid": "Fleischman-D"
            },
            {
                "family_name": "Fountaine",
                "given_name": "Kate T.",
                "orcid": "0000-0002-0414-8227",
                "clpid": "Fountaine-K-T"
            },
            {
                "family_name": "Atwater",
                "given_name": "Harry A.",
                "orcid": "0000-0001-9435-0201",
                "clpid": "Atwater-H-A"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            }
        ],
        "abstract": "We apply numerical methods in combination with finite-difference-time-domain (FDTD) simulations to optimize transmission properties of plasmonic mirror color filters using a multi-objective figure of merit over a five-dimensional parameter space by utilizing novel multi-fidelity Gaussian processes approach. We compare these results with conventional derivative-free global search algorithms, such as (single-fidelity) Gaussian Processes optimization scheme, and Particle Swarm Optimization---a commonly used method in nanophotonics community, which is implemented in Lumerical commercial photonics software. We demonstrate the performance of various numerical optimization approaches on several pre-collected real-world datasets and show that by properly trading off expensive information sources with cheap simulations, one can more effectively optimize the transmission properties with a fixed budget.",
        "doi": "10.48550/arXiv.1811.07707",
        "publisher": "arXiv",
        "publication_date": "2018-11-15"
    },
    {
        "id": "authors:g4q3m-mgf65",
        "collection": "authors",
        "collection_id": "g4q3m-mgf65",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20190205-111204454",
        "type": "monograph",
        "title": "Learning to Search via Retrospective Imitation",
        "author": [
            {
                "family_name": "Song",
                "given_name": "Jialin",
                "clpid": "Song-Jialin"
            },
            {
                "family_name": "Lanka",
                "given_name": "Ravi",
                "clpid": "Lanka-R"
            },
            {
                "family_name": "Zhao",
                "given_name": "Albert",
                "clpid": "Zhao-Albert"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Ono",
                "given_name": "Masahiro",
                "clpid": "Ono-Masahiro"
            }
        ],
        "abstract": "We study the problem of learning a good search policy from demonstrations for combinatorial search spaces. We propose retrospective imitation learning, which, after initial training by an expert, improves itself by learning from its own retrospective solutions. That is, when the policy eventually reaches a feasible solution in a search tree after making mistakes and backtracks, it retrospectively constructs an improved search trace to the solution by removing backtracks, which is then used to further train the policy. A key feature of our approach is that it can iteratively scale up, or transfer, to larger problem sizes than the initial expert demonstrations, thus dramatically expanding its applicability beyond that of conventional imitation learning. We showcase the effectiveness of our approach on two tasks: synthetic maze solving, and integer program based risk-aware path planning.",
        "doi": "10.48550/arXiv.1804.00846",
        "publisher": "arXiv",
        "publication_date": "2018-04-03"
    },
    {
        "id": "authors:jc24z-mgw41",
        "collection": "authors",
        "collection_id": "jc24z-mgw41",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20190205-111434225",
        "type": "monograph",
        "title": "Generative Multi-Agent Behavioral Cloning",
        "author": [
            {
                "family_name": "Zhan",
                "given_name": "Eric",
                "clpid": "Zhan-Eric"
            },
            {
                "family_name": "Zheng",
                "given_name": "Stephan",
                "clpid": "Zheng-Stephan"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Sha",
                "given_name": "Long",
                "clpid": "Sha-Long"
            },
            {
                "family_name": "Lucey",
                "given_name": "Patrick",
                "clpid": "Lucey-P"
            }
        ],
        "abstract": "We propose and study the problem of generative multi-agent behavioral cloning, where the goal is to learn a generative, i.e., non-deterministic, multi-agent policy from pre-collected demonstration data. Building upon advances in deep generative models, we present a hierarchical policy framework that can tractably learn complex mappings from input states to distributions over multi-agent action spaces by introducing a hierarchy with macro-intent variables that encode long-term intent. In addition to synthetic settings, we show how to instantiate our framework to effectively model complex interactions between basketball players and generate realistic multi-agent trajectories of basketball gameplay over long time periods. We validate our approach using both quantitative and qualitative evaluations, including a user study comparison conducted with professional sports analysts.",
        "doi": "10.48550/arXiv.1803.07612",
        "publisher": "arXiv",
        "publication_date": "2018-03-20"
    },
    {
        "id": "authors:687er-8c179",
        "collection": "authors",
        "collection_id": "687er-8c179",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20190205-112328842",
        "type": "monograph",
        "title": "Detecting Adversarial Examples via Neural Fingerprinting",
        "author": [
            {
                "family_name": "Dathathri",
                "given_name": "Sumanth",
                "clpid": "Dathathri-S"
            },
            {
                "family_name": "Zheng",
                "given_name": "Stephan",
                "clpid": "Zheng-Stephan"
            },
            {
                "family_name": "Murray",
                "given_name": "Richard M.",
                "orcid": "0000-0002-5785-7481",
                "clpid": "Murray-R-M"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            }
        ],
        "abstract": "Deep neural networks are vulnerable to adversarial examples, which dramatically alter model output using small input changes. We propose Neural Fingerprinting, a simple, yet effective method to detect adversarial examples by verifying whether model behavior is consistent with a set of secret fingerprints, inspired by the use of biometric and cryptographic signatures. The benefits of our method are that 1) it is fast, 2) it is prohibitively expensive for an attacker to reverse-engineer which fingerprints were used, and 3) it does not assume knowledge of the adversary. In this work, we pose a formal framework to analyze fingerprints under various threat models, and characterize Neural Fingerprinting for linear models. For complex neural networks, we empirically demonstrate that Neural Fingerprinting significantly improves on state-of-the-art detection mechanisms by detecting the strongest known adversarial attacks with 98-100% AUC-ROC scores on the MNIST, CIFAR-10 and MiniImagenet (20 classes) datasets. In particular, the detection accuracy of Neural Fingerprinting generalizes well to unseen test-data under various black- and whitebox threat models, and is robust over a wide range of hyperparameters and choices of fingerprints.",
        "doi": "10.48550/arXiv.1803.03870",
        "publisher": "arXiv",
        "publication_date": "2018-03-11"
    },
    {
        "id": "authors:c1zrc-dn749",
        "collection": "authors",
        "collection_id": "c1zrc-dn749",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20181023-101356776",
        "type": "monograph",
        "title": "Multi-resolution Tensor Learning for Large-Scale Spatial Data",
        "author": [
            {
                "family_name": "Zheng",
                "given_name": "Stephan",
                "clpid": "Zheng-Stephan"
            },
            {
                "family_name": "Yu",
                "given_name": "Rose",
                "clpid": "Yu-Rose"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            }
        ],
        "abstract": "High-dimensional tensor models are notoriously computationally expensive to train. We present a meta-learning algorithm, MMT, that can significantly speed up the process for spatial tensor models. MMT leverages the property that spatial data can be viewed at multiple resolutions, which are related by coarsening and finegraining from one resolution to another. Using this property, MMT learns a tensor model by starting from a coarse resolution and iteratively increasing the model complexity. In order to not \"over-train\" on coarse resolution models, we investigate an information-theoretic fine-graining criterion to decide when to transition into higher-resolution models. We provide both theoretical and empirical evidence for the advantages of this approach. When applied to two real-world large-scale spatial datasets for basketball player and animal behavior modeling, our approach demonstrate 3 key benefits: 1) it efficiently captures higher-order interactions (i.e., tensor latent factors), 2) it is orders of magnitude faster than fixed resolution learning and scales to very fine-grained spatial resolutions, and 3) it reliably yields accurate and interpretable models.",
        "doi": "10.48550/arXiv.1802.06825",
        "publication_date": "2018-02-19"
    },
    {
        "id": "authors:j9tmb-65s22",
        "collection": "authors",
        "collection_id": "j9tmb-65s22",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20190205-113450468",
        "type": "monograph",
        "title": "Long-term Forecasting using Tensor-Train RNNs",
        "author": [
            {
                "family_name": "Yu",
                "given_name": "Rose",
                "clpid": "Yu-Rose"
            },
            {
                "family_name": "Zheng",
                "given_name": "Stephan",
                "clpid": "Zheng-Stephan"
            },
            {
                "family_name": "Anandkumar",
                "given_name": "Anima",
                "clpid": "Anandkumar-A"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            }
        ],
        "abstract": "We present Tensor-Train RNN (TT-RNN), a novel family of neural sequence architectures for multivariate forecasting in environments with nonlinear dynamics. Long-term forecasting in such systems is highly challenging, since there exist long-term temporal dependencies, higher-order correlations and sensitivity to error propagation. Our proposed tensor recurrent architecture addresses these issues by learning the nonlinear dynamics directly using higher order moments and high-order state transition functions. Furthermore, we decompose the higher-order structure using the tensor-train (TT) decomposition to reduce the number of parameters while preserving the model performance. We theoretically establish the approximation properties of Tensor-Train RNNs for general sequence inputs, and such guarantees are not available for usual RNNs. We also demonstrate significant long-term prediction improvements over general RNN and LSTM architectures on a range of simulated environments with nonlinear dynamics, as well on real-world climate and traffic data.",
        "doi": "10.48550/arXiv.1711.00073",
        "publisher": "arXiv",
        "publication_date": "2017-10-31"
    },
    {
        "id": "authors:53p9x-0y271",
        "collection": "authors",
        "collection_id": "53p9x-0y271",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20190205-113745110",
        "type": "monograph",
        "title": "Fine-Grained Retrieval of Sports Plays using Tree-Based Alignment of Trajectories",
        "author": [
            {
                "family_name": "Sha",
                "given_name": "Long",
                "clpid": "Sha-Long"
            },
            {
                "family_name": "Lucey",
                "given_name": "Patrick",
                "clpid": "Lucey-P"
            },
            {
                "family_name": "Zheng",
                "given_name": "Stephan",
                "clpid": "Zheng-Stephan"
            },
            {
                "family_name": "Kim",
                "given_name": "Taehwan",
                "clpid": "Kim-Taehwan"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Sridharan",
                "given_name": "Sridha",
                "clpid": "Sridharan-S"
            }
        ],
        "abstract": "We propose a novel method for effective retrieval of multi-agent spatiotemporal tracking data. Retrieval of spatiotemporal tracking data offers several unique challenges compared to conventional text-based retrieval settings. Most notably, the data is fine-grained meaning that the specific location of agents is important in describing behavior. Additionally, the data often contains tracks of multiple agents (e.g., multiple players in a sports game), which generally leads to a permutational alignment problem when performing relevance estimation. Due to the frequent position swap of agents, it is difficult to maintain the correspondence of agents, and such issues make the pairwise comparison problematic for multi-agent spatiotemporal data. To address this issue, we propose a tree-based method to estimate the relevance between multi-agent spatiotemporal tracks. It uses a hierarchical structure to perform multi-agent data alignment and partitioning in a coarse-to-fine fashion. We validate our approach via user studies with domain experts. Our results show that our method boosts performance in retrieving similar sports plays -- especially in interactive situations where the user selects a subset of trajectories compared to current state-of-the-art methods.",
        "doi": "10.48550/arXiv.1710.02255",
        "publisher": "arXiv",
        "publication_date": "2017-10-06"
    },
    {
        "id": "authors:da6hd-4mf72",
        "collection": "authors",
        "collection_id": "da6hd-4mf72",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20190205-133559444",
        "type": "monograph",
        "title": "Correlational Dueling Bandits with Application to Clinical Treatment in Large Decision Spaces",
        "author": [
            {
                "family_name": "Sui",
                "given_name": "Yanan",
                "orcid": "0000-0002-9480-627X",
                "clpid": "Sui-Yanan"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            },
            {
                "family_name": "Burdick",
                "given_name": "Joel W.",
                "clpid": "Burdick-J-W"
            }
        ],
        "abstract": "We consider sequential decision making under uncertainty, where the goal is to optimize over a large decision space using noisy comparative feedback. This problem can be formulated as a K-armed Dueling Bandits problem where K is the total number of decisions. When K is very large, existing dueling bandits algorithms suffer huge cumulative regret before converging on the optimal arm. This paper studies the dueling bandits problem with a large number of arms that exhibit a low-dimensional correlation structure. Our problem is motivated by a clinical decision making process in large decision space. We propose an efficient algorithm CorrDuel which optimizes the exploration/exploitation tradeoff in this large decision space of clinical treatments. More broadly, our approach can be applied to other sequential decision problems with large and structured decision spaces. We derive regret bounds, and evaluate performance in simulation experiments as well as on a live clinical trial of therapeutic spinal cord stimulation. To our knowledge, this marks the first time an online learning algorithm was applied towards spinal cord injury treatments. Our experimental results show the effectiveness and efficiency of our approach.",
        "doi": "10.48550/arXiv.1707.02375",
        "publisher": "arXiv",
        "publication_date": "2017-07-08"
    },
    {
        "id": "authors:egwzd-zz505",
        "collection": "authors",
        "collection_id": "egwzd-zz505",
        "cite_using_url": "https://resolver.caltech.edu/CaltechAUTHORS:20190410-120658254",
        "type": "monograph",
        "title": "Multi-dueling Bandits with Dependent Arms",
        "author": [
            {
                "family_name": "Sui",
                "given_name": "Yanan",
                "orcid": "0000-0002-9480-627X",
                "clpid": "Sui-Yanan"
            },
            {
                "family_name": "Zhuang",
                "given_name": "Vincent",
                "clpid": "Zhuang-Vincent"
            },
            {
                "family_name": "Burdick",
                "given_name": "Joel W.",
                "clpid": "Burdick-J-W"
            },
            {
                "family_name": "Yue",
                "given_name": "Yisong",
                "orcid": "0000-0001-9127-1989",
                "clpid": "Yue-Yisong"
            }
        ],
        "abstract": "The dueling bandits problem is an online learning framework for learning from pairwise preference feedback, and is particularly well-suited for modeling settings that elicit subjective or implicit human feedback. In this paper, we study the problem of multi-dueling bandits with dependent arms, which extends the original dueling bandits setting by simultaneously dueling multiple arms as well as modeling dependencies between arms. These extensions capture key characteristics found in many real-world applications, and allow for the opportunity to develop significantly more efficient algorithms than were possible in the original setting. We propose the selfsparring algorithm, which reduces the multi-dueling bandits problem to a conventional bandit setting that can be solved using a stochastic bandit algorithm such as Thompson Sampling, and can naturally model dependencies using a Gaussian process prior. We present a no-regret analysis for multi-dueling setting, and demonstrate the effectiveness of our algorithm empirically on a wide range of simulation settings.",
        "doi": "10.48550/arXiv.1705.00253",
        "publisher": "arXiv",
        "publication_date": "2017-04-29"
    }
]