Blaauw, Merlijn, and Jordi Bonada. 2017.
βA Neural Parametric Singing Synthesizer.β arXiv:1704.03809 [Cs], April.
Carr, C. J., and Zack Zukowski. 2018.
βGenerating Albums with SampleRNN to Imitate Metal, Rock, and Punk Bands.β arXiv:1811.06633 [Cs, Eess], November.
Chen, Nanxin, Yu Zhang, Heiga Zen, Ron J. Weiss, Mohammad Norouzi, and William Chan. 2020.
βWaveGrad: Estimating Gradients for Waveform Generation.β arXiv.
Dieleman, Sander, AΓ€ron van den Oord, and Karen Simonyan. 2018.
βThe Challenge of Realistic Music Generation: Modelling Raw Audio at Scale.β In
Advances In Neural Information Processing Systems, 11.
Du, Yilun, Katherine M. Collins, Joshua B. Tenenbaum, and Vincent Sitzmann. 2021.
βLearning Signal-Agnostic Manifolds of Neural Fields.β In
Advances in Neural Information Processing Systems.
Dupont, Emilien, Hyunjik Kim, S. M. Ali Eslami, Danilo Jimenez Rezende, and Dan Rosenbaum. 2022.
βFrom Data to Functa: Your Data Point Is a Function and You Can Treat It Like One.β In
Proceedings of the 39th International Conference on Machine Learning, 5694β5725. PMLR.
Elbaz, Dan, and Michael Zibulevsky. 2017.
βPerceptual Audio Loss Function for Deep Learning.β In
Proceedings of the 18th International Society for Music Information Retrieval Conference (ISMIRβ2017), Suzhou, China.
Engel, Jesse, Cinjon Resnick, Adam Roberts, Sander Dieleman, Douglas Eck, Karen Simonyan, and Mohammad Norouzi. 2017.
βNeural Audio Synthesis of Musical Notes with WaveNet Autoencoders.β In
PMLR.
Goel, Karan, Albert Gu, Chris Donahue, and Christopher RΓ©. 2022.
βItβs Raw! Audio Generation with State-Space Models.β arXiv.
Grais, Emad M., Dominic Ward, and Mark D. Plumbley. 2018.
βRaw Multi-Channel Audio Source Separation Using Multi-Resolution Convolutional Auto-Encoders.β arXiv:1803.00702 [Cs], March.
Hernandez-Olivan, Carlos, Javier Hernandez-Olivan, and Jose R. Beltran. 2022.
βA Survey on Artificial Intelligence for Music Generation: Agents, Domains and Perspectives.β arXiv.
Kong, Zhifeng, Wei Ping, Jiaji Huang, Kexin Zhao, and Bryan Catanzaro. 2021.
βDiffWave: A Versatile Diffusion Model for Audio Synthesis.β arXiv.
Kreuk, Felix, Gabriel Synnaeve, Adam Polyak, Uriel Singer, Alexandre DΓ©fossez, Jade Copet, Devi Parikh, Yaniv Taigman, and Yossi Adi. 2022.
βAudioGen: Textually Guided Audio Generation.β arXiv.
Kreuk, Felix, Yaniv Taigman, Adam Polyak, Jade Copet, Gabriel Synnaeve, Alexandre DΓ©fossez, and Yossi Adi. 2022.
βAudio Language Modeling Using Perceptually-Guided Discrete Representations.β arXiv.
Lee, Junhyeok, and Seungu Han. 2021.
βNU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling.β In
Interspeech 2021, 1634β38.
Liutkus, Antoine, Roland Badeau, and GΓ€el Richard. 2011.
βGaussian Processes for Underdetermined Source Separation.β IEEE Transactions on Signal Processing 59 (7): 3155β67.
Luo, Andrew, Yilun Du, Michael J. Tarr, Joshua B. Tenenbaum, Antonio Torralba, and Chuang Gan. 2021.
βLearning Neural Acoustic Fields.β In.
Mehri, Soroush, Kundan Kumar, Ishaan Gulrajani, Rithesh Kumar, Shubham Jain, Jose Sotelo, Aaron Courville, and Yoshua Bengio. 2017.
βSampleRNN: An Unconditional End-to-End Neural Audio Generation Model.β In
Proceedings of International Conference on Learning Representations (ICLR) 2017.
Pascual, Santiago, Gautam Bhattacharya, Chunghsin Yeh, Jordi Pons, and Joan SerrΓ . 2022.
βFull-Band General Audio Synthesis with Score-Based Diffusion.β arXiv.
Platen, Patrick von, Suraj Patil, Anton Lozhkov, Pedro Cuenca, Nathan Lambert, Kashif Rasul, Mishig Davaadorj, and Thomas Wolf. 2022.
βDiffusers: State-of-the-Art Diffusion Models.β GitHub.
Sarroff, Andy M., and Michael Casey. 2014.
βMusical Audio Synthesis Using Autoencoding Neural Nets.β In. Ann Arbor, MI: Michigan Publishing, University of Michigan Library.
SchlΓΌter, J., and S. BΓΆck. 2014.
βImproved Musical Onset Detection with Convolutional Neural Networks.β In
2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 6979β83.
Sprechmann, Pablo, Joan Bruna, and Yann LeCun. 2014.
βAudio Source Separation with Discriminative Scattering Networks.β arXiv:1412.7022 [Cs], December.
StΓΆter, Fabian-Robert, Stefan Uhlich, Antoine Liutkus, and Yuki Mitsufuji. 2019.
βOpen-Unmix - A Reference Implementation for Music Source Separation.β Journal of Open Source Software 4 (41): 1667.
Tenenbaum, J. B., and W. T. Freeman. 2000.
βSeparating Style and Content with Bilinear Models.β Neural Computation 12 (6): 1247β83.
Tzinis, Efthymios, Zhepei Wang, and Paris Smaragdis. 2020. βSudo Rm -Rf: Efficient Networks for Universal Audio Source Separation.β In, 6.
Venkataramani, Shrikant, and Paris Smaragdis. 2017.
βEnd to End Source Separation with Adaptive Front-Ends.β arXiv:1705.02514 [Cs], May.
Venkataramani, Shrikant, Y. Cem Subakan, and Paris Smaragdis. 2017.
βNeural Network Alternatives to Convolutive Audio Models for Source Separation.β arXiv:1709.07908 [Cs, Eess], September.
Verma, Prateek, and Julius O. Smith. 2018.
βNeural Style Transfer for Audio Spectograms.β In
31st Conference on Neural Information Processing Systems (NIPS 2017).
Wyse, L. 2017.
βAudio Spectrogram Representations for Processing with Convolutional Neural Networks.β In
Proceedings of the First International Conference on Deep Learning and Music, Anchorage, US, May, 2017 (arXiv:1706.08675v1 [Cs.NE]).
Xu, Dejia, Peihao Wang, Yifan Jiang, Zhiwen Fan, and Zhangyang Wang. 2022.
βSignal Processing for Implicit Neural Representations.β In.
No comments yet. Why not leave one?