Itai Gat

I am a Ph.D. student under the supervision of Professor Tamir Hazan at the Technion - Israel Institute of Technology. I graduated with my BSc degree in Data Science and Engineering from the Technion. My research is focused on the perception of deep learning classifiers, primarily with applications to multi-modal tasks. On a variety of well-known benchmarks, we discover surprising insights and achieve state-of-the-art results.



itaigat dot mail at gmail dot com

Publications

    2024

  1. Discrete Flow Matching (Spotlight). Itai Gat, Tal Remez, Neta Shaul, Felix Kreuk, Ricky T. Q. Chen, Gabriel Synnaeve, Yossi Adi, Yaron Lipman. Advances in Neural Information Processing Systems (NeurIPS), 2024
  2. [ PDF, BibTeX ]
  3. Generator Matching: Generative modeling with arbitrary Markov processes. Peter Holderrieth, Marton Havasi, Jason Yim, Neta Shaul, Itai Gat, Tommi Jaakkola, Brian Karrer, Ricky TQ Chen, Yaron Lipman.
  4. [ PDF, BibTeX ]
  5. Exact Byte-Level Probabilities from Tokenized Language Models for FIM-Tasks and Model Ensembles. Buu Phan, Brandon Amos, Itai Gat, Marton Havasi, Matthew Muckley, Karen Ullrich.
  6. [ PDF, BibTeX ]
  7. The Llama 3 Herd of Models. Llama Team, AI @ Meta.
  8. [ PDF, Code and Models, Website, BibTeX ]
  9. D-Flow: Differentiating through Flows for Controlled Generation. Heli Ben-Hamu, Omri Puny, Itai Gat, Brian Karrer, Uriel Singer, Yaron Lipman. International Conference on Machine Learning (ICML), 2024
  10. [ PDF, BibTeX ]
  11. SpiRit-LM: Interleaved Spoken and Written Language Model. Tu Anh Nguyen, Benjamin Muller, Bokai Yu, Marta R. Costa-jussa, Maha Elbayad, Sravya Popuri, Paul-Ambroise Duquenne, Robin Algayres, Ruslan Mavlyutov, Itai Gat, Gabriel Synnaeve, Juan Pino, Benoit Sagot, Emmanuel Dupoux. Transactions of the Association for Computational Linguistics (TACL), 2024
  12. [ PDF, Website, BibTeX ]
  13. Masked Audio Generation using a Single Non-Autoregressive Transformer. Alon Ziv, Itai Gat, Gael Le Lan, Tal Remez, Felix Kreuk, Alexandre Defossez, Jade Copet, Gabriel Synnaeve, Yossi Adi. International Conference on Learning Representations (ICLR), 2024
  14. [ PDF, Code and Models, Website, BibTeX ]
  15. Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation. Or Tal, Alon Ziv, Itai Gat, Felix Kreuk, Yossi Adi. International Society for Music Information Retrieval (ISMIR)
  16. [ PDF, Code and Models, Website, BibTeX ]
  17. Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation. Guy Yariv, Itai Gat, Sagie Benaim, Lior Wolf, Idan Schwartz, Yossi Adi. The Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI), 2024
  18. [ PDF, Code and Models, Website, BibTeX ]
  19. Layer Collaboration in the Forward-Forward Algorithm. Guy Lorberbom*, Itai Gat*, Yossi Adi, Alex Schwing, Tamir Hazan. The Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI), 2024
  20. [ PDF, BibTeX ]

    2023

  21. Code Llama: Open Foundation Models for Code. Baptiste Rozière, Jonas Gehring, Fabian Gloeckle, Sten Sootla, Itai Gat, Xiaoqing Ellen Tan, Yossi Adi, Jingyu Liu, Tal Remez, Jérémy Rapin, Artyom Kozhevnikov, Ivan Evtimov, Joanna Bitton, Manish Bhatt, Cristian Canton Ferrer, Aaron Grattafiori, Wenhan Xiong, Alexandre Défossez, Jade Copet, Faisal Azhar, Hugo Touvron, Louis Martin, Nicolas Usunier, Thomas Scialom, Gabriel Synnaeve. arXiv, 2023
  22. [ PDF, Code and Models, Blog, BibTeX ]
  23. Simple and Controllable Music Generation. Jade Copet, Felix Kreuk, Itai Gat, Tal Remez, David Kant, Gabriel Synnaeve, Yossi Adi, Alexandre Défossez. Advances in Neural Information Processing Systems (NeurIPS), 2023
  24. [ PDF, Demo, Code, BibTeX ]
  25. Textually Pretrained Speech Language Models. Michael Hassid, Tal Remez, Tu Anh Nguyen, Itai Gat, Alexis Conneau, Felix Kreuk, Jade Copet, Alexandre Defossez, Gabriel Synnaeve, Emmanuel Dupoux, Roy Schwartz, Yossi Adi. Advances in Neural Information Processing Systems (NeurIPS), 2023
  26. [ PDF, Samples, BibTeX ]
  27. Expresso: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis. Tu Anh Nguyen, Wei-Ning Hsu, Antony d'Avirro, Bowen Shi, Itai Gat, Maryam Fazel-Zarandi, Tal Remez, Jade Copet, Gabriel Synnaeve, Michael Hassid, Felix Kreuk, Yossi Adi, Emmanuel Dupoux. International Speech Communication Association (Interspeech), 2023
  28. [ PDF, BibTeX ]
  29. AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation. Guy Yariv, Itai Gat, Lior Wolf, Yossi Adi, Idan Schwartz. International Speech Communication Association (Interspeech), 2023
  30. [ PDF, Page, Code, BibTeX ]
  31. Augmentation Invariant Discrete Representation for Generative Spoken Language Modeling (Oral). Itai Gat, Felix Kreuk, Tu Anh Nguyen, Ann Lee, Jade Copet, Gabriel Synnaeve, Emmanuel Dupoux, Yossi Adi. International Conference on Spoken Language Translation (IWSLT), 2023
  32. [ PDF, BibTeX ]

    2022

  33. On the Importance of Gradient Norm in PAC-Bayesian Bounds. Itai Gat, Yossi Adi, Alex Schwing, Tamir Hazan. Advances in Neural Information Processing Systems (NeurIPS), 2022
  34. [ PDF, BibTeX ]
  35. On The Robustness of Self-Supervised Representations for Spoken Language Modeling. Itai Gat, Felix Kreuk, Ann Lee, Jade Copet, Gabriel Synnaeve, Emmanuel Dupoux, Yossi Adi. arXiv, 2022
  36. [ PDF, BibTeX ]
  37. A Functional Information Perspective on Model Interpretation. Itai Gat, Nitay Calderon, Roi Reichart, Tamir Hazan. Proceedings of the International Conference on Machine Learning (ICML), 2022
  38. [ PDF, Code, BibTeX ]
  39. Speech Emotion Recognition using Self-Supervised Features. Edmilson Morais, Ron Hoory, Weizhong Zhu, Itai Gat, Matheus Damasceno, Hagai Aronowitz. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022
  40. [ PDF, BibTeX ]
  41. Speaker Normalization for Self-supervised Speech Emotion Recognition. Itai Gat, Hagai Aronowitz, Weizhong Zhu, Edmilson Morais, Ron Hoory. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022
  42. [ PDF, BibTeX ]
  43. Towards a Common Speech Analysis Engine. Hagai Aronowitz, Itai Gat, Edmilson Morais, Weizhong Zhu, Ron Hoory. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022
  44. [ PDF, BibTeX ]

    2021

  45. Latent Space Explanation by Intervention. Itai Gat*, Guy Lorberbom*, Idan Schwartz, Tamir Hazan. Proceedings of the AAAI Conference on Artificial Intelligence, 2021
  46. [ PDF, BibTeX ]
  47. Perceptual Score: What Data Modalities Does Your Model Perceive?. Itai Gat, Idan Schwartz, Alexander Schwing. Advances in Neural Information Processing Systems (NeurIPS), 2021
  48. [ PDF, Code, BibTeX ]
  49. Are VQA Systems RAD? Measuring Robustness to Augmented Data with Focused Interventions. Daniel Rosenberg, Itai Gat, Amir Feder, Roi Reichart. Association for Computational Linguistics (ACL), 2021
  50. [ PDF, Page, BibTeX ]

    2020

  51. Removing Bias in Multi-modal Classifiers: Regularization by Maximizing Functional Entropies. Itai Gat, Idan Schwartz, Alexander Schwing, Tamir Hazan. Advances in Neural Information Processing Systems (NeurIPS), 2020
  52. [ PDF, Code, BibTeX ]