<h1>Gkioxari, Georgia</h1> <h2>Monograph from <a href="https://authors.library.caltech.edu">CaltechAUTHORS</a></h2> <ul> <li>Wu, Jane and Pavlakos, Georgios, el al. (2024) <a href="https://authors.library.caltech.edu/records/0n803-rzq26">Reconstructing Hand-Held Objects in 3D</a>; <a href="https://doi.org/10.48550/arxiv.2404.06507">10.48550/arxiv.2404.06507</a></li> <li>Israel, Uriah and Marks, Markus, el al. (2024) <a href="https://authors.library.caltech.edu/records/47sqx-33w78">A Foundation Model for Cell Segmentation</a>; bioRvix; 2023.11.17.567630; PMCID PMC10690226; <a href="https://doi.org/10.1101/2023.11.17.567630">10.1101/2023.11.17.567630</a></li> <li>Talukder, Sabera and Yue, Yisong, el al. (2024) <a href="https://authors.library.caltech.edu/records/stvvz-was45">TOTEM: TOkenized Time Series EMbeddings for General Time Series Analysis</a>; <a href="https://doi.org/10.48550/arxiv.2402.16412">10.48550/arxiv.2402.16412</a></li> <li>Wu, Chao-Yuan and Johnson, Justin, el al. (2023) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20230316-204045919">Multiview Compressive Coding for 3D Reconstruction</a></li> <li>Sun, Jennifer J. and Karashchuk, Pierre, el al. (2022) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20221219-204745839">BKinD-3D: Self-Supervised 3D Keypoint Discovery from Multi-View Videos</a>; <a href="https://doi.org/10.48550/arXiv.2212.07401">10.48550/arXiv.2212.07401</a></li> <li>Brazil, Garrick and Straub, Julian, el al. (2022) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20221219-204749212">Omni3D: A Large Benchmark and Model for 3D Object Detection in the Wild</a>; <a href="https://doi.org/10.48550/arXiv.2207.10660">10.48550/arXiv.2207.10660</a></li> <li>Gkioxari, Georgia and Ravi, Nikhila, el al. (2022) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20221219-204752587">Learning 3D Object Shape and Layout without 3D Supervision</a>; <a href="https://doi.org/10.48550/arXiv.2206.07028">10.48550/arXiv.2206.07028</a></li> <li>Qian, Shengyi and Kirillov, Alexander, el al. (2021) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20221219-204759329">Recognizing Scenes from Novel Viewpoints</a>; <a href="https://doi.org/10.48550/arXiv.2112.01520">10.48550/arXiv.2112.01520</a></li> <li>Goel, Shubham and Gkioxari, Georgia, el al. (2021) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20221219-204755957">Differentiable Stereopsis: Meshes from multiple views using differentiable rendering</a>; <a href="https://doi.org/10.48550/arXiv.2110.05472">10.48550/arXiv.2110.05472</a></li> <li>Ravi, Nikhila and Reizenstein, Jeremy, el al. (2020) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20221219-204802712">Accelerating 3D Deep Learning with PyTorch3D</a>; <a href="https://doi.org/10.48550/arXiv.2007.08501">10.48550/arXiv.2007.08501</a></li> <li>Smith, Edward J. and Calandra, Roberto, el al. (2020) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20221219-204806086">3D Shape Reconstruction from Vision and Touch</a>; <a href="https://doi.org/10.48550/arXiv.2007.03778">10.48550/arXiv.2007.03778</a></li> <li>Wiles, Olivia and Gkioxari, Georgia, el al. (2019) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20221219-204809456">SynSin: End-to-end View Synthesis from a Single Image</a>; <a href="https://doi.org/10.48550/arXiv.1912.08804">10.48550/arXiv.1912.08804</a></li> <li>Gkioxari, Georgia and Malik, Jitendra, el al. (2019) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20221219-223825499">Mesh R-CNN</a>; <a href="https://doi.org/10.48550/arXiv.1906.02739">10.48550/arXiv.1906.02739</a></li> <li>Yu, Licheng and Chen, Xinlei, el al. (2019) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20221219-204819572">Multi-Target Embodied Question Answering</a>; <a href="https://doi.org/10.48550/arXiv.1904.04686">10.48550/arXiv.1904.04686</a></li> <li>Wijmans, Erik and Datta, Samyak, el al. (2019) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20221219-204816196">Embodied Question Answering in Photorealistic Environments with Point Cloud Perception</a>; <a href="https://doi.org/10.48550/arXiv.1904.03461">10.48550/arXiv.1904.03461</a></li> <li>Das, Abhishek and Gkioxari, Georgia, el al. (2018) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20221219-204822934">Neural Modular Control for Embodied Question Answering</a>; <a href="https://doi.org/10.48550/arXiv.1810.11181">10.48550/arXiv.1810.11181</a></li> <li>Wu, Yi and Wu, Yuxin, el al. (2018) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20221219-204826312">Building Generalizable Agents with a Realistic and Rich 3D Environment</a>; <a href="https://doi.org/10.48550/arXiv.1801.02209">10.48550/arXiv.1801.02209</a></li> <li>Girdhar, Rohit and Gkioxari, Georgia, el al. (2017) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20221219-204836433">Detect-and-Track: Efficient Pose Estimation in Videos</a>; <a href="https://doi.org/10.48550/arXiv.1712.09184">10.48550/arXiv.1712.09184</a></li> <li>Radosavovic, Ilija and Dollár, Piotr, el al. (2017) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20221219-204839800">Data Distillation: Towards Omni-Supervised Learning</a>; <a href="https://doi.org/10.48550/arXiv.1712.04440">10.48550/arXiv.1712.04440</a></li> <li>Das, Abhishek and Datta, Samyak, el al. (2017) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20221219-204833051">Embodied Question Answering</a>; <a href="https://doi.org/10.48550/arXiv.1711.11543">10.48550/arXiv.1711.11543</a></li> <li>Gkioxari, Georgia and Girshick, Ross, el al. (2017) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20221219-204829682">Detecting and Recognizing Human-Object Interactions</a></li> <li>He, Kaiming and Gkioxari, Georgia, el al. (2017) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20221219-204843170">Mask R-CNN</a>; <a href="https://doi.org/10.48550/arXiv.1703.06870">10.48550/arXiv.1703.06870</a></li> <li>Gkioxari, Georgia and Toshev, Alexander, el al. (2016) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20221219-204846541">Chained Predictions Using Convolutional Neural Networks</a>; <a href="https://doi.org/10.48550/arXiv.1605.02346">10.48550/arXiv.1605.02346</a></li> <li>Gkioxari, Georgia and Girshick, Ross, el al. (2015) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20221219-204849905">Contextual Action Recognition with R*CNN</a>; <a href="https://doi.org/10.48550/arXiv.1505.01197">10.48550/arXiv.1505.01197</a></li> <li>Gkioxari, Georgia and Girshick, Ross, el al. (2014) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20221219-204853268">Actions and Attributes from Wholes and Parts</a>; <a href="https://doi.org/10.48550/arXiv.1412.2604">10.48550/arXiv.1412.2604</a></li> <li>Gkioxari, Georgia and Malik, Jitendra (2014) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20221219-204856628">Finding Action Tubes</a>; <a href="https://doi.org/10.48550/arXiv.1411.6031">10.48550/arXiv.1411.6031</a></li> <li>Gkioxari, Georgia and Hariharan, Bharath, el al. (2014) <a href="https://resolver.caltech.edu/CaltechAUTHORS:20221219-204859995">R-CNNs for Pose Estimation and Action Detection</a>; <a href="https://doi.org/10.48550/arXiv.1406.5212">10.48550/arXiv.1406.5212</a></li> </ul>