Publications

Year 2

Peer-Reviewed Journals

Mao, J., Rothkopf, C. A., & Stocker, A. A. (2025). Adaptation optimizes sensory encoding for future stimuli. PLOS Computational Biology, 21(1), e1012746. https://doi.org/10.1371/journal.pcbi.1012746

Zolfaghari, H., Ebrahimi, N., Ji, Y., Pitkow, X., & Davoodi, M. (2025). Integrated Analytical Modeling and Numerical Simulation Framework for Design Optimization of Electromagnetic Soft Actuators. Actuators, 14(3), 128. https://doi.org/10.3390/act14030128

Peer-reviewed Conference Publications

Ananthram, A., Pantalony, R., Bradford, L., Demarest, J., Purvis, A., Krut, K., Stein, R., & McKeown, K. (2025, April). A Benchmark and a Metric for Granular and Coarse Detailed Caption Evaluation. ICLR 2025, Singapore.

Ananthram, A., Stengel-Eskin, E., Vondrick, C., Bansal, M., & McKeown, K. (2025, April). Try Seeing It from My Perspective: Characterizing Western Bias in Image Understanding in Large Multimodal Models. ICLR 2025, Singapore.

Azabou, M., Pan, K. X., Arora, V., Knight, I. J., Dyer, E. L., & Richards, B. A. (2024a, October 4). Multi-session, multi-task neural decoding from distinct cell-types and brain regions. The Thirteenth International Conference on Learning Representations. ICLR 2025. https://openreview.net/forum?id=IuU0wcO0mo

Bencomo, G., Gupta, M., Marinescu, I., McCoy, R. T., & Griffiths, T. L. (2025). Teasing Apart Architecture and Initial Weights as Sources of Inductive Bias in Neural Networks (No. arXiv:2502.20237). Proceedings of the 47th Annual Meeting of the Cognitive Science Society (CogSci 2025). arXiv. https://doi.org/10.48550/arXiv.2502.20237

Deng, Z., Zollo, T., Eyre, B., Inamdar, A., Madras, D., & Zemel, R. (2025). QuEst: Enhancing Estimates of Quantile-Based Distributional Measures Using Model Predictions. International Conference on Machine Learning (ICML), Vancouver.

Jesson, A., Beltran-Velez, N., & Blei, D. (2024). Can Generative AI Solve Your In-Context Learning Problem? A Martingale Perspective (No. arXiv:2412.06033). arXiv. ICLR, 2025. https://doi.org/10.48550/arXiv.2412.06033

Jesson, A., Beltran-Velez, N., Chu, Q., Karlekar, S., Kossen, J., Gal, Y., Cunningham, J. P., & Blei, D. (2024). Estimating the Hallucination Rate of Generative AI (No. arXiv:2406.07457). arXiv. NeurIPS, 2024. https://doi.org/10.48550/arXiv.2406.07457

Liang, J., Liu, R., Ozguroglu, E., Sudhakar, S., Dave, A., Tokmakov, P., Song, S., & Vondrick, C. (2024). Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (No. arXiv:2406.16862). arXiv. CoRL 2024. https://doi.org/10.48550/arXiv.2406.16862

Rooke, S., Wang, Z., Tullio, R. W. D., & Balasubramanian, V. (2024, November 6). Trading Place for Space: Increasing Location Resolution Reduces Contextual Capacity in Hippocampal Codes. The Thirty-eighth Annual Conference on Neural Information Processing Systems. NeurIPS 2024.https://openreview.net/forum?id=REIK4SZMJt

Siam, M. A., & Fahim, D. (2025). Self-Supervised Learning for SAR Target Recognition with Multi-Task Pretext Training. 1207–1213. https://doi.org/10.1109/SoutheastCon56624.2025.10971440

Tyulina, N., Emmanouil, T. A., & Levitan, S. I. (2024). Understanding Linguistic and Visual Factors that Affect Human Trust Perception of Virtual Agents. ACM Conversational User Interfaces 2024, 1–6. https://doi.org/10.1145/3640794.3665581

Wang, Z., Tullio, R. W. D., Rooke, S., & Balasubramanian, V. (2025). Time Makes Space: Emergence of Place Fields in Networks Encoding Temporally Continuous Sensory Experiences. NeurIPS 2024. (arXiv:2408.05798). arXiv. NeurIPS 2024. https://doi.org/10.48550/arXiv.2408.05798

Wang, J., Zollo, T., Zemel, R., & Namkoong, H. (2025). Adaptive Elicitation of Latent Information Using Natural Language (No. arXiv:2504.04204). arXiv. https://doi.org/10.48550/arXiv.2504.04204

Xu, K., Zhang, L., & Shi, J. (2024a). Detecting Image Attribution for Text-to-Image Diffusion Models in RGB and Beyond (arXiv:2403.19653). arXiv. IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2025. https://doi.org/10.48550/arXiv.2403.19653

Xu, K., Zhang, L., & Shi, J. (2024b). Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models (arXiv:2405.14828). arXiv. IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2025. https://doi.org/10.48550/arXiv.2405.14828

Zhang, Y., Wang, Y., Jimenez-Beneto, D., Wang, Z., Azabou, M., Richards, B., Winter, O., Laboratory, I. B., Dyer, E., Paninski, L., & Hurwitz, C. (2024). Towards a “universal translator” for neural dynamics at single-cell, single-spike resolution (No. arXiv:2407.14668). arXiv. NeurIPS 2024. https://doi.org/10.48550/arXiv.2407.14668

Other Conference Publications/Presentations

Abe, E., Zhang, C., Chhibber, R., Chou, G., Foat, J., Truong, D., Olveczky, B., Sniadecki, N., Tuthill, J., Pereira, T., & Brunton, B. (2025). Deep imitation learning for neuromechanical control: Realistic walking in an embodied fly. COSYNE, Montreal, QC.

Zhang, C., Abe, E., Foat, J., Burton, B., Pereira, T., Olveczky, B., & Warnberg, E. (2025.). A GPU-Accelerated Deep Reinforcement Learning Pipeline for Simulating Animal Behavior. COSYNE, Montreal, QC.

Preprint

Ding, Z., Tran, D. T., Ponder, K., Cobos, E., Ding, Z., Fahey, P. G., Wang, E., Muhammad, T., Fu, J., Cadena, S. A., Papadopoulos, S., Patel, S., Franke, K., Reimer, J., Sinz, F. H., Ecker, A. S., Pitkow, X., & Tolias, A. S. (2023). Bipartite invariance in mouse primary visual cortex. bioRxiv. Preprint. https://doi.org/10.1101/2023.03.15.532836
Olivos-Castillo, I., Schrater, P., & Pitkow, X. (2024). Control when confidence is costly (arXiv:2406.14427). arXiv. Preprint. https://doi.org/10.48550/arXiv.2406.14427

Ramesh, R., Bisulco, A., DiTullio, R. W., Wei, L., Balasubramanian, V., Daniilidis, K., & Chaudhari, P. (2024). Many Perception Tasks are Highly Redundant Functions of their Input Data (arXiv:2407.13841). arXiv. https://doi.org/10.48550/arXiv.2407.13841
Yang, H., Gee, J., & Shi, J. (2024). AlignedCut: Visual Concepts Discovery on Brain-Guided Universal Feature Space (arXiv:2406.18344). arXiv. Preprint. https://doi.org/10.48550/arXiv.2406.18344

Zhang, Y., Wang, Y., Azabou, M., Andre, A., Wang, Z., Lyu, H., Laboratory, T. I. B., Dyer, E., Paninski, L., & Hurwitz, C. (2025). Neural Encoding and Decoding at Scale (No. arXiv:2504.08201). arXiv. Preprint. https://doi.org/10.48550/arXiv.2504.08201

Under Review

Chiquier, M., Avrech, O., Gandelsman, Y., Feng, B., Bouman, K., & Vondrick, C. (2025). Teaching Humans Subtle Differences with DIFFusion (arXiv:2504.08046). arXiv. In submission to ICCV 2025. https://doi.org/10.48550/arXiv.2504.08046
Lu, A., Liao, W., Yang, H., & Shi, J. (Under Review). Artifacts and Attention Sinks: Structured Approximations for Efficient Vision Transformers. In submission to ICCV 2025.

Yang, H., Xu, K., Shi, J., Grossberg, M. D., & Bai, Y. (Under Review). “I Know It When I See It”: Mood Spaces for Connecting and Expressing Visual Concepts".

Year 1

Peer-Reviewed Journals

H. Yang, J. Gee and J. Shi, "Brain Decodes Deep Nets," 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 2024, pp. 23030-23040, doi: 10.1109/CVPR52733.2024.02173.

Webb, T. W., Frankland, S. M., Altabaa, A., Segert, S., Krishnamurthy, K., Campbell, D., Russin, J., Giallanza, T., O’Reilly, R., Lafferty, J., & Cohen, J. D. (2024). The relational bottleneck as an inductive bias for efficient abstraction. Trends in Cognitive Sciences, 28(9), 829–843. https://doi.org/10.1016/j.tics.2024.04.00

Peer-reviewed Publications

Altabaa, A., & Lafferty, J. (2024b). Learning Hierarchical Relational Representations through Relational Convolutions (arXiv:2310.03240). arXiv. http://arxiv.org/abs/2310.03240
https://doi.org/10.48550/arXiv.2310.03240

Altabaa, A., Webb, T., Cohen, J., & Lafferty, J. (2023). Abstractors and relational cross-attention: An inductive bias for explicit relational reasoning in Transformers (arXiv:2304.00195). arXiv. http://arxiv.org/abs/2304.00195
https://doi.org/10.48550/arXiv.2304.00195

Ananthram, A., Stengel-Eskin, E., Vondrick, C., Bansal, M., & McKeown, K. (2024). Try Seeing It from My Perspective: Characterizing Western Bias in Image Understanding in Large Multimodal Models. Transactions of the Association for Computational Linguistics (TACL).

Chiquier, M., Mall, U., & Vondrick, C. (2024). Evolving Interpretable Visual Classifiers with Large Language Models. https://doi.org/10.48550/ARXIV.2404.09941. (In submission to European Conference on Computer Vision (ECCV) 2024)

Eyre, B., Creager, E., Madras, D., Papyan, V., & Zemel, R. (2023). Out of the Ordinary: Spectrally Adapting Regression for Covariate Shift (arXiv:2312.17463). arXiv. http://arxiv.org/abs/2312.17463
https://doi.org/10.48550/arXiv.2312.17463

Tyulina, N., Emmanouil, T. A., & Levitan, S. I. 2024. ACM Conversational User Interfaces 2024. In Understanding Linguistic and Visual Factors that Affect Human Trust Perception of Virtual Agents. Luxembourg City.

McGaughey, K. D., & Gold, J. (2023). Neuroscience 2023. Contributions of sensory adaptation and pupil-linked arousal to perceptual decisions about uncertain and unstable visual stimuli. Society for Neuroscience. (poster presentation)

Under Review

Altabaa, A., & Lafferty, J. (2024a). Approximation of relation functions and attention mechanisms (arXiv:2402.08856). arXiv. http://arxiv.org/abs/2402.08856
https://doi.org/10.48550/arXiv.2402.08856

Mahdaviyeh, Y., Lucas, J., Ren. M., Tolias, A., Zemel, R., Pitassi,. T. (2024). Replay Can Probably Increase Forgetting. Submitted to NeurIPS.