Itai Gat

I am a Ph.D. student under the supervision of Professor Tamir Hazan at the Technion - Israel Institute of Technology. I graduated with my BSc degree in Data Science and Engineering from the Technion. My research is focused on the perception of deep learning classifiers, primarily with applications to multi-modal tasks. On a variety of well-known benchmarks, we discover surprising insights and achieve state-of-the-art results.



itaigat dot mail at gmail dot com

Publications

    2024

  1. Masked Audio Generation using a Single Non-Autoregressive Transformer. Alon Ziv, Itai Gat, Gael Le Lan, Tal Remez, Felix Kreuk, Alexandre Defossez, Jade Copet, Gabriel Synnaeve, Yossi Adi. International Conference on Learning Representations (ICLR), 2024
  2. [ PDF, Code and Models, Website, BibTeX ]
  3. Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation. Guy Yariv, Itai Gat, Sagie Benaim, Lior Wolf, Idan Schwartz, Yossi Adi. The Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI), 2024
  4. [ PDF, Code and Models, Website, BibTeX ]
  5. Layer Collaboration in the Forward-Forward Algorithm. Guy Lorberbom*, Itai Gat*, Yossi Adi, Alex Schwing, Tamir Hazan. The Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI), 2024
  6. [ PDF, BibTeX ]

    2023

  7. Code Llama: Open Foundation Models for Code. Baptiste Rozière, Jonas Gehring, Fabian Gloeckle, Sten Sootla, Itai Gat, Xiaoqing Ellen Tan, Yossi Adi, Jingyu Liu, Tal Remez, Jérémy Rapin, Artyom Kozhevnikov, Ivan Evtimov, Joanna Bitton, Manish Bhatt, Cristian Canton Ferrer, Aaron Grattafiori, Wenhan Xiong, Alexandre Défossez, Jade Copet, Faisal Azhar, Hugo Touvron, Louis Martin, Nicolas Usunier, Thomas Scialom, Gabriel Synnaeve. arXiv, 2023
  8. [ PDF, Code and Models, Blog, BibTeX ]
  9. Simple and Controllable Music Generation. Jade Copet, Felix Kreuk, Itai Gat, Tal Remez, David Kant, Gabriel Synnaeve, Yossi Adi, Alexandre Défossez. Advances in Neural Information Processing Systems (NeurIPS), 2023
  10. [ PDF, Demo, Code, BibTeX ]
  11. Textually Pretrained Speech Language Models. Michael Hassid, Tal Remez, Tu Anh Nguyen, Itai Gat, Alexis Conneau, Felix Kreuk, Jade Copet, Alexandre Defossez, Gabriel Synnaeve, Emmanuel Dupoux, Roy Schwartz, Yossi Adi. Advances in Neural Information Processing Systems (NeurIPS), 2023
  12. [ PDF, Samples, BibTeX ]
  13. Expresso: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis. Tu Anh Nguyen, Wei-Ning Hsu, Antony d'Avirro, Bowen Shi, Itai Gat, Maryam Fazel-Zarandi, Tal Remez, Jade Copet, Gabriel Synnaeve, Michael Hassid, Felix Kreuk, Yossi Adi, Emmanuel Dupoux. International Speech Communication Association (Interspeech), 2023
  14. [ PDF, BibTeX ]
  15. AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation. Guy Yariv, Itai Gat, Lior Wolf, Yossi Adi, Idan Schwartz. International Speech Communication Association (Interspeech), 2023
  16. [ PDF, Page, Code, BibTeX ]
  17. Augmentation Invariant Discrete Representation for Generative Spoken Language Modeling (Oral). Itai Gat, Felix Kreuk, Tu Anh Nguyen, Ann Lee, Jade Copet, Gabriel Synnaeve, Emmanuel Dupoux, Yossi Adi. International Conference on Spoken Language Translation (IWSLT), 2023
  18. [ PDF, BibTeX ]

    2022

  19. On the Importance of Gradient Norm in PAC-Bayesian Bounds. Itai Gat, Yossi Adi, Alex Schwing, Tamir Hazan. Advances in Neural Information Processing Systems (NeurIPS), 2022
  20. [ PDF, BibTeX ]
  21. On The Robustness of Self-Supervised Representations for Spoken Language Modeling. Itai Gat, Felix Kreuk, Ann Lee, Jade Copet, Gabriel Synnaeve, Emmanuel Dupoux, Yossi Adi. arXiv, 2022
  22. [ PDF, BibTeX ]
  23. A Functional Information Perspective on Model Interpretation. Itai Gat, Nitay Calderon, Roi Reichart, Tamir Hazan. Proceedings of the International Conference on Machine Learning (ICML), 2022
  24. [ PDF, Code, BibTeX ]
  25. Speech Emotion Recognition using Self-Supervised Features. Edmilson Morais, Ron Hoory, Weizhong Zhu, Itai Gat, Matheus Damasceno, Hagai Aronowitz. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022
  26. [ PDF, BibTeX ]
  27. Speaker Normalization for Self-supervised Speech Emotion Recognition. Itai Gat, Hagai Aronowitz, Weizhong Zhu, Edmilson Morais, Ron Hoory. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022
  28. [ PDF, BibTeX ]
  29. Towards a Common Speech Analysis Engine. Hagai Aronowitz, Itai Gat, Edmilson Morais, Weizhong Zhu, Ron Hoory. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022
  30. [ PDF, BibTeX ]

    2021

  31. Latent Space Explanation by Intervention. Itai Gat*, Guy Lorberbom*, Idan Schwartz, Tamir Hazan. Proceedings of the AAAI Conference on Artificial Intelligence, 2021
  32. [ PDF, BibTeX ]
  33. Perceptual Score: What Data Modalities Does Your Model Perceive?. Itai Gat, Idan Schwartz, Alexander Schwing. Advances in Neural Information Processing Systems (NeurIPS), 2021
  34. [ PDF, Code, BibTeX ]
  35. Are VQA Systems RAD? Measuring Robustness to Augmented Data with Focused Interventions. Daniel Rosenberg, Itai Gat, Amir Feder, Roi Reichart. Association for Computational Linguistics (ACL), 2021
  36. [ PDF, Page, BibTeX ]

    2020

  37. Removing Bias in Multi-modal Classifiers: Regularization by Maximizing Functional Entropies. Itai Gat, Idan Schwartz, Alexander Schwing, Tamir Hazan. Advances in Neural Information Processing Systems (NeurIPS), 2020
  38. [ PDF, Code, BibTeX ]