Publications
Year 2
Peer-Reviewed Journals
- Mao, J., Rothkopf, C. A., & Stocker, A. A. (2025). Adaptation optimizes sensory encoding for future stimuli. PLOS Computational Biology, 21(1), e1012746. https://doi.org/10.1371/journal.pcbi.1012746
- Zolfaghari, H., Ebrahimi, N., Ji, Y., Pitkow, X., & Davoodi, M. (2025). Integrated Analytical Modeling and Numerical Simulation Framework for Design Optimization of Electromagnetic Soft Actuators. Actuators, 14(3), 128. https://doi.org/10.3390/act14030128
Peer-reviewed Conference Publications
- Ananthram, A., Pantalony, R., Bradford, L., Demarest, J., Purvis, A., Krut, K., Stein, R., & McKeown, K. (2025, April). A Benchmark and a Metric for Granular and Coarse Detailed Caption Evaluation. ICLR 2025, Singapore.
- Ananthram, A., Stengel-Eskin, E., Vondrick, C., Bansal, M., & McKeown, K. (2025, April). Try Seeing It from My Perspective: Characterizing Western Bias in Image Understanding in Large Multimodal Models. ICLR 2025, Singapore.
- Azabou, M., Pan, K. X., Arora, V., Knight, I. J., Dyer, E. L., & Richards, B. A. (2024a, October 4). Multi-session, multi-task neural decoding from distinct cell-types and brain regions. The Thirteenth International Conference on Learning Representations. ICLR 2025. https://openreview.net/forum?id=IuU0wcO0mo
- Bencomo, G., Gupta, M., Marinescu, I., McCoy, R. T., & Griffiths, T. L. (2025). Teasing Apart Architecture and Initial Weights as Sources of Inductive Bias in Neural Networks (No. arXiv:2502.20237). Proceedings of the 47th Annual Meeting of the Cognitive Science Society (CogSci 2025). arXiv. https://doi.org/10.48550/arXiv.2502.20237
- Deng, Z., Zollo, T., Eyre, B., Inamdar, A., Madras, D., & Zemel, R. (2025). QuEst: Enhancing Estimates of Quantile-Based Distributional Measures Using Model Predictions. International Conference on Machine Learning (ICML), Vancouver.
- Jesson, A., Beltran-Velez, N., & Blei, D. (2024). Can Generative AI Solve Your In-Context Learning Problem? A Martingale Perspective (No. arXiv:2412.06033). arXiv. ICLR, 2025. https://doi.org/10.48550/arXiv.2412.06033
- Jesson, A., Beltran-Velez, N., Chu, Q., Karlekar, S., Kossen, J., Gal, Y., Cunningham, J. P., & Blei, D. (2024). Estimating the Hallucination Rate of Generative AI (No. arXiv:2406.07457). arXiv. NeurIPS, 2024. https://doi.org/10.48550/arXiv.2406.07457
- Liang, J., Liu, R., Ozguroglu, E., Sudhakar, S., Dave, A., Tokmakov, P., Song, S., & Vondrick, C. (2024). Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (No. arXiv:2406.16862). arXiv. CoRL 2024. https://doi.org/10.48550/arXiv.2406.16862
- Rooke, S., Wang, Z., Tullio, R. W. D., & Balasubramanian, V. (2024, November 6). Trading Place for Space: Increasing Location Resolution Reduces Contextual Capacity in Hippocampal Codes. The Thirty-eighth Annual Conference on Neural Information Processing Systems. NeurIPS 2024.https://openreview.net/forum?id=REIK4SZMJt
- Siam, M. A., & Fahim, D. (2025). Self-Supervised Learning for SAR Target Recognition with Multi-Task Pretext Training. 1207–1213. https://doi.org/10.1109/SoutheastCon56624.2025.10971440
- Tyulina, N., Emmanouil, T. A., & Levitan, S. I. (2024). Understanding Linguistic and Visual Factors that Affect Human Trust Perception of Virtual Agents. ACM Conversational User Interfaces 2024, 1–6. https://doi.org/10.1145/3640794.3665581
- Wang, Z., Tullio, R. W. D., Rooke, S., & Balasubramanian, V. (2025). Time Makes Space: Emergence of Place Fields in Networks Encoding Temporally Continuous Sensory Experiences. NeurIPS 2024. (arXiv:2408.05798). arXiv. NeurIPS 2024. https://doi.org/10.48550/arXiv.2408.05798
- Wang, J., Zollo, T., Zemel, R., & Namkoong, H. (2025). Adaptive Elicitation of Latent Information Using Natural Language (No. arXiv:2504.04204). arXiv. https://doi.org/10.48550/arXiv.2504.04204
- Xu, K., Zhang, L., & Shi, J. (2024a). Detecting Image Attribution for Text-to-Image Diffusion Models in RGB and Beyond (arXiv:2403.19653). arXiv. IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2025. https://doi.org/10.48550/arXiv.2403.19653
- Xu, K., Zhang, L., & Shi, J. (2024b). Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models (arXiv:2405.14828). arXiv. IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2025. https://doi.org/10.48550/arXiv.2405.14828
- Zhang, Y., Wang, Y., Jimenez-Beneto, D., Wang, Z., Azabou, M., Richards, B., Winter, O., Laboratory, I. B., Dyer, E., Paninski, L., & Hurwitz, C. (2024). Towards a “universal translator” for neural dynamics at single-cell, single-spike resolution (No. arXiv:2407.14668). arXiv. NeurIPS 2024. https://doi.org/10.48550/arXiv.2407.14668
Other Conference Publications/Presentations
- Abe, E., Zhang, C., Chhibber, R., Chou, G., Foat, J., Truong, D., Olveczky, B., Sniadecki, N., Tuthill, J., Pereira, T., & Brunton, B. (2025). Deep imitation learning for neuromechanical control: Realistic walking in an embodied fly. COSYNE, Montreal, QC.
- Zhang, C., Abe, E., Foat, J., Burton, B., Pereira, T., Olveczky, B., & Warnberg, E. (2025.). A GPU-Accelerated Deep Reinforcement Learning Pipeline for Simulating Animal Behavior. COSYNE, Montreal, QC.
Preprint
- Ding, Z., Tran, D. T., Ponder, K., Cobos, E., Ding, Z., Fahey, P. G., Wang, E., Muhammad, T., Fu, J., Cadena, S. A., Papadopoulos, S., Patel, S., Franke, K., Reimer, J., Sinz, F. H., Ecker, A. S., Pitkow, X., & Tolias, A. S. (2023). Bipartite invariance in mouse primary visual cortex. bioRxiv. Preprint. https://doi.org/10.1101/2023.03.15.532836
- Olivos-Castillo, I., Schrater, P., & Pitkow, X. (2024). Control when confidence is costly (arXiv:2406.14427). arXiv. Preprint. https://doi.org/10.48550/arXiv.2406.14427
- Ramesh, R., Bisulco, A., DiTullio, R. W., Wei, L., Balasubramanian, V., Daniilidis, K., & Chaudhari, P. (2024). Many Perception Tasks are Highly Redundant Functions of their Input Data (arXiv:2407.13841). arXiv. https://doi.org/10.48550/arXiv.2407.13841
- Yang, H., Gee, J., & Shi, J. (2024). AlignedCut: Visual Concepts Discovery on Brain-Guided Universal Feature Space (arXiv:2406.18344). arXiv. Preprint. https://doi.org/10.48550/arXiv.2406.18344
- Zhang, Y., Wang, Y., Azabou, M., Andre, A., Wang, Z., Lyu, H., Laboratory, T. I. B., Dyer, E., Paninski, L., & Hurwitz, C. (2025). Neural Encoding and Decoding at Scale (No. arXiv:2504.08201). arXiv. Preprint. https://doi.org/10.48550/arXiv.2504.08201
Under Review
- Chiquier, M., Avrech, O., Gandelsman, Y., Feng, B., Bouman, K., & Vondrick, C. (2025). Teaching Humans Subtle Differences with DIFFusion (arXiv:2504.08046). arXiv. In submission to ICCV 2025. https://doi.org/10.48550/arXiv.2504.08046
- Lu, A., Liao, W., Yang, H., & Shi, J. (Under Review). Artifacts and Attention Sinks: Structured Approximations for Efficient Vision Transformers. In submission to ICCV 2025.
- Yang, H., Xu, K., Shi, J., Grossberg, M. D., & Bai, Y. (Under Review). “I Know It When I See It”: Mood Spaces for Connecting and Expressing Visual Concepts".
Year 1
Peer-Reviewed Journals
- H. Yang, J. Gee and J. Shi, "Brain Decodes Deep Nets," 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 2024, pp. 23030-23040, doi: 10.1109/CVPR52733.2024.02173.
- Webb, T. W., Frankland, S. M., Altabaa, A., Segert, S., Krishnamurthy, K., Campbell, D., Russin, J., Giallanza, T., O’Reilly, R., Lafferty, J., & Cohen, J. D. (2024). The relational bottleneck as an inductive bias for efficient abstraction. Trends in Cognitive Sciences, 28(9), 829–843. https://doi.org/10.1016/j.tics.2024.04.00
Peer-reviewed Publications
- Altabaa, A., & Lafferty, J. (2024b). Learning Hierarchical Relational Representations through Relational Convolutions (arXiv:2310.03240). arXiv. http://arxiv.org/abs/2310.03240
https://doi.org/10.48550/arXiv.2310.03240
- Altabaa, A., Webb, T., Cohen, J., & Lafferty, J. (2023). Abstractors and relational cross-attention: An inductive bias for explicit relational reasoning in Transformers (arXiv:2304.00195). arXiv. http://arxiv.org/abs/2304.00195
https://doi.org/10.48550/arXiv.2304.00195
- Ananthram, A., Stengel-Eskin, E., Vondrick, C., Bansal, M., & McKeown, K. (2024). Try Seeing It from My Perspective: Characterizing Western Bias in Image Understanding in Large Multimodal Models. Transactions of the Association for Computational Linguistics (TACL).
- Chiquier, M., Mall, U., & Vondrick, C. (2024). Evolving Interpretable Visual Classifiers with Large Language Models. https://doi.org/10.48550/ARXIV.2404.09941. (In submission to European Conference on Computer Vision (ECCV) 2024)
- Eyre, B., Creager, E., Madras, D., Papyan, V., & Zemel, R. (2023). Out of the Ordinary: Spectrally Adapting Regression for Covariate Shift (arXiv:2312.17463). arXiv. http://arxiv.org/abs/2312.17463
https://doi.org/10.48550/arXiv.2312.17463
- Tyulina, N., Emmanouil, T. A., & Levitan, S. I. 2024. ACM Conversational User Interfaces 2024. In Understanding Linguistic and Visual Factors that Affect Human Trust Perception of Virtual Agents. Luxembourg City.
- McGaughey, K. D., & Gold, J. (2023). Neuroscience 2023. Contributions of sensory adaptation and pupil-linked arousal to perceptual decisions about uncertain and unstable visual stimuli. Society for Neuroscience. (poster presentation)
Under Review
- Altabaa, A., & Lafferty, J. (2024a). Approximation of relation functions and attention mechanisms (arXiv:2402.08856). arXiv. http://arxiv.org/abs/2402.08856
https://doi.org/10.48550/arXiv.2402.08856
- Mahdaviyeh, Y., Lucas, J., Ren. M., Tolias, A., Zemel, R., Pitassi,. T. (2024). Replay Can Probably Increase Forgetting. Submitted to NeurIPS.