Human Voice: Legal Protection Challenges
Abstract
Focused on the phenomenon of human voice, the paper purports to develop approaches to legal protection of ordinary people's voices and those of artists as the technologies for sound synthesis, music and vocal performance are becoming more sophisticated. It was demonstrated the problem of legal protection of human voice has become especially pressing one in the context of artificial neural networks such as vocaloid designed for sound synthesis and voice cloning. The study provides a systemic arrangement of legally meaningful knowledge of human voice useful for the development of legal provisions to protect this personal good. The legal substance of vocal impersonation as a way to simulate and manipulate a synthesized voice was explored. Theoretically applicable legal constructs for protection of voice and computerized cloning technologies were analyzed. A trend to use copyright rather than patent for protecting vocaloid-like generative neural networks and other technological solutions for vocal synthesis was identified. The concept of voice was critically analyzed to propose a viable legal provision for its protection. The primary and auxiliary features of the concept of voice were parametrized for possible use in disputes on the legitimacy of voice cloning or vocal impersonation. The concept of vocal identity of ordinary people and artists was proposed for legal protection of this personal good as a set of performative sonic features including the basic parameters of singing voice as well as acoustic-phonetic and articulatory features of vocalization. A comprehensive legal and technological methodology for the protection of vocal identity was proposed.
References
Abel J. E. (2022) The New Real: Media and Mimesis in Japan from Stereographs to Emoji. Minneapolis: University of Minnesota Press, 360 p.
Amiot E. (2016) Music Through Fourier Space.
Anisimova S.V., Golovatiy A.I. (2021) Musical Fragments, Notes and Chord Recognition Algorithms. Vestnik Polotskogo gosudarstvennogo univeristeta. Seria S. Fundamentalnye nauki=Polotsk State University Bulletin. Series S. Fundamental Sciences, no. 4, pp. 45-49 (in Russ.)
Aznauryan L.A. (2017) Study in models and algorithmic methods for analysis of musical works. Papers of Annual Research Conference PhysicMinneapolis:al, Mathematical and Natural Sciences. Yerevan: Russian-Armenian (Slavonic) University, pp. 67-76.
Balashova A.I. (2022) AI in copyright and patent law: range of objects and subjects, legal protection term. Zhurnal suda po intellectualnym pravam=Journal of Intellectual Property Rights Court, no. 2, pp. 90-98 (in Russ.)
Beliavskiy D.M., Darbinyan S.S. et al. (2016) Digital Identification of Objects: Beyond Technologies. Moscow: Nauchnoe obozrenie, 252 p. (in Russ.)
Bridy A. (2016) The evolution of authorship: works made by code. Columbia Journal of Law and the Arts, vol. 39, pp. 395, 397.
Boden M. (2004) The creative mind. Myths and mechanisms. London-New York: Routledge, 344 p.
Calo R. (2014) Robotics and the Lessons of Cyberlaw (2014) California Law Review, vol. 103, no. 3, pp. 513-563. University of Washington School of Law. Research Paper No. 2014-08.
Cheng H., Guo Y., Wang T. et al. (2023) Voice-face homogeneity tells deepfake. ACM Transactions on Multimedia Computing, Communications and Applications, no. 3, pp. 1-22.
Davis H. (2014) Creating music from literature. Papers of EACL Workshop on computer linguistics for literature, pp. 1-10. arXiv: 1403.2124. Bibcode: 2014arXiv1403.2124D. 10.3115/v1/W14-0901. S2CID 9028922.
Davis H., Mohammad S. (2014) Generating Music from Literature. In Proceedings of the Workshop on Computational Linguistics for Literature. Gothenburg: Association for Computational Linguistics, pp. 1-10.
Deleuze G. (2011) Logique du sens. Мoscow: Akademicheskiy Proekt, 472 p. (in Russ.)
Denikin А.А. (2013) Model of diegetic analysis of the sound in screen media. Mediamuzyka research e-journal=Media Music Research E-Journal, no. 2. (in Russ.)
Drenten J., Brooks G. (2020) Celebrity 2.0: Lil Miquela and the rise of a virtual star system. Feminist Media Studies, no.8, pp. 1319-1323.
Eck D., Schmidhuber J. (2002) Finding temporal structure in music: Blues improvisation with LSTM recurrent networks. In: Proceedings of the 2002 IEEE Workshop on Neural Networks for Signal Processing, pp. 747-756.
Eidsheim N. S. (2018) Widening Rings of Being: Singer as Stylist and Technician. In: The Race of Sound: Listening, Timbre, and Vocal in African-American Music. Durham: Duke University Press, pp. 177-200.
Emelianov V.V. (2023) Development of Voice. Manual for Vocal Control and Training. Saint Petersburg: Lan' Press, 168 p. (in Russ.)
Ivanov S.M. (2018) Neural network-generated polyphonic music. Papers of Lomonosov Conference Computational mathematics and cybernetics panel. Moscow: MGY Press, pp. 110-111 (in Russ.)
Ivanov A.P., Nikiforov V. (2006) Vocal Art. Moscow: Golos-Press, p. 37 (in Russ.)
Jones A., Bennett R., Cross S. (2015) 10 — Keeping it real? Life, death, and holograms on the live music stage. In: A. Jones, R. Bennett (eds.). The Digital Evolution of Live Music. Whitney: Chandos Publishing, pp. 123-138.
Khachatryan А.А. (2023) AI neural network as a tool for recognition of musical genres. Applied mathematics: current problems of mathematics, computer science and simulation: papers of the All-Russia research workshop of young scientists. Krasnodar: Russian Energy Agency Press, pp. 479-483 (in Russ.)
Khrisanfova L.A., Diveeva A.S. (2017) Perceiving the peculiarities of human psychology from face photos and voice recordings. Yearbook of Psychological and Psychoanalytical Studies. Moscow: Institute of Psychoanalysis, pp. 157-164 (in Russ.)
Kolonei V.A.(2017) Function of musical intonation components. Muzykalnoe iskusstvo=Music Art, no. 16, pp. 18-31 (in Russ.)
Levy D. (2005) Robots Unlimited: Life in a Virtual Age.
Lund J. (2012) An Empirical Examination of the Lay Listener Test in Music Composition Copyright Infringement.
Makarevich I.V. (2022) Prospects of technologies for recognition of human emotions by facial expressions and voice. Papers of Regional Conference for young researchers. Volgograd: State Technological University Press, pp. 156157 (in Russ.)
Malakhov V.E. (2022) Applying decomposition to study generative music creation processes. Informatsionnye tekhnologii i telekommunikatsii=Information and Communication, vol. 10, no. 4, pp. 69-78.
March L. (2023) Wrap You Up in My Blue Hair: Vocaloid, Hyperpop, and Identity in “Ashnikko Feat. Hatsune Miku — Daisy 2.0”. Television and New Media, no. 8, pp. 894-910.
Markina Yu.Yu., Belov Yu.S. (2018) Spectral coefficients as a necessary parameter in creating human voice cloning systems using deep learning methods. Mezhdunarodnyi studencheskiy nauchnyi vestnik=International Bulletin of Student Research, no. 1. Available at: URL: https://eduherald.ru/ru/article/view?id=18125 (accessed: 26.01.2023)
Mazzei L.A. (2016) Voice Without a Subject. Cultural Studies. Critical Methodologies, no. 2, pp. 151-161.
Nikolskiy A.V., Alexeev E.E. et al. (2019) A Jew's Harp Story: Jew's harp and individual song as a basis of tone-oriented musical systems. Yazyki i folklor ko-rennykh narodov Sibiri=Languages and Folklore of Aboriginal Siberian Peoples, no. 37, pp. 5-32.
Perduto P. (2022) NFT — Income for Creative Minds. How to Generate Additional Income as a Photographer, Graphic Designer, Composer, Musician or other Art Creator. S.L.: Books on demand Publishers, 156 p.
Rais М. (2016) Musical modes: classification, characteristics, properties, history, current status. Israel XXI. Music Magazine, no. 3, p. 9 (in Russ.)
Ramalho A. (2017) Will Robots Rule the (Artistic) World? A Proposed Model for the Legal Status of Creations by Artificial Intelligence Systems.
Rinkerman G. (2023) Artificial Intelligence and evolving issues under US copyright and patent law. Interactive Entertainment Law Review, no. 2, pp. 4865.
Rudin L.B. (2009) Principles of Voiceology. Moscow: Granitza, 104 p. (in Russ.).
Sivolap Т.Е. (2019) Audiovisual works: peculiarities of legal protection. Current issues of radio and cinema technologies: papers of International Conference of Technological Research to the Centenary of the Cinema and Television Institute. D.P. Barsukov et al. (eds.). Saint Petersburg: State Institute of Cinema and Television Press, pp. 237-242 (in Russ.)
Stulova G.P. (2014) Pedagogical factors affecting the singing voice timbre. UNESCO Chair of Musical Art and Education Bulletin, no. 4, pp. 90-104 (in Russ.)
Tikhomirova N.F. (2021) Conceptual principles of classification in the analysis of creative products of musical culture. Vestnik Donetskogo natcionalnogo Universiteta=Donetsk National University Bulletin, no. 1, pp. 123-128 (in Russ.)
Tormozova VA. (2023) Current issues of audiovisual works as an object of copyright. Trudy po intellektualnoy sobstvennosti=Works on Intellectual Property, no. 4, pp. 54-67.
Santiago J. (2017) The “Blurred Lines” of Copyright Law: Setting a New Standard for Copyright Infringement in Music. Brooklyn Law Review, vol. 83, pp. 289321.
Toropova A.V (2017) Musical sensation as a basis of efficient musical therapy A study in cross-cultural interpretation of L.S. Vygotsky's psychological framework in context of variable models of musical therapy. In: Art studies in the context of other sciences in Russia and elsewhere: parallels and interactions. Papers of the international research conference. Moscow: Soglasie Press, pp. 394-405 (in Russ.)
Umetalieva-Bayalieva Ch.T. (2015) Musical language as an object of cultural studies. The means and elements of musical expression (musical language). Nauka i novye tehnologii=Science and New Technologies, no. 2, pp. 204-207 (in Russ.)
Vanyukov P.D. (2023) The legal regime for AI and its outcomes. Intellektual-naya sobstvennost=Intellectual Property, no. 3, pp. 58-68 (in Russ.)
Vasilevskaya L.Yu. et al. (2021) Digitizing Civil Law Transactions: Legal Characteristics of “Artificial Intelligence” and “Digital” Subjects (a civil study in 5 vols.). Vol. 1. Moscow: Prospekt, 288 p. (in Russ.)
Watt D., Harrison P., Cabot-King L. (2020) Who owns your voice? Linguistic and legal perspectives of relationship between vocal distinctiveness and the rights of the individual speaker. International Journal of Speech, Language and the Law, no. 1, pp. 137-180.
Yuan J.E. (2024) Making virtual celebrity: Platformization and intermediation in digital cultural production.
Zakharov А.О. (2020) Neural networks in music. Week of Science and Creativity. Papers of international research workshop for students, postgraduate students and young scientists. Saint Petersburg: State Institute of Cinema and Television Publishers Press, pp. 36-39 (in Russ.)
Copyright (c) 2024 Budnik R.A., Evpak E.G.
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the Licensing, Copyright, Open Access and Repository Policy.