{"id":38,"date":"2021-05-24T16:36:46","date_gmt":"2021-05-24T16:36:46","guid":{"rendered":"https:\/\/educate.winona.edu\/idmaa\/?page_id=38"},"modified":"2021-06-15T14:12:12","modified_gmt":"2021-06-15T14:12:12","slug":"session-7","status":"publish","type":"page","link":"https:\/\/educate.winona.edu\/idmaa\/2021-glitch\/schedule\/session-7\/","title":{"rendered":"Session 7"},"content":{"rendered":"<p><strong><span lang=\"EN\">Dongchen Hou: The Brokenness in Mediation: Speech-to-Text Technology in Chinese<\/span><\/strong><\/p>\n<p>When technological media become omnipresent and function perfectly, their presence goes unnoticeable. The popularity of speech-to-text technology testifies the technological convenience to human life: it can smoothly transfer vocal information to a visual, textual mode of representation. Speech-to-text technology, though only emerged in the recent decade, follows a historical genealogy that contains other mediating technologies such as stenography and typewriter. Stenographers and typists took the mediating position in transcribing audial information into visual written texts, either in handwritten or typed forms. With the development of Artificial Intelligence (AI) and machine deep learning, Google, Amazon Alexa, OS Siri, and Microsoft have developed technologies that can achieve simultaneous transcription as did by stenographers and typists.\u00a0It is undeniable that speech-to-text technologies have significantly changed human communicative modes with both humans and machines in general, however, the technology also poses challenges to users in different linguistic and social contexts. This research juxtaposes speech-to-text technology with other mediating technologies, including stenography and typewriter, to explore the ontological significance of broken media\/mediation. Morador (2015) defines a technology that is \u201cbroken\u201d as \u201cthose activities, directed towards the satisficing of human wants that are intended to produce changes in the material world that either do not manage to satisfy these wants or do not produce changes in the material world, or both.\u201d (17). This definition, however, fails to depict and understand the non-human side of the picture. In this paper, I draw on Harman\u2019s object-oriented ontology (2018), which provides another picture in the significance of non-human objects in discourse. I argue that mediated technologies gain presence and significance in the \u201cbrokenness.\u201d In other words, only when technologies break do they appear. I look at how digital technologies, after its total replacement of the human mediator, such as stenographer and typists, function in mediating different modalities of communication.\u00a0This research examines, to be specific, when speech-to-text technology breaks, and to whom it is broken. With a strong alphanumeric mechanism built in the technology in its invention, the brokenness is prominent and more visible in the context of non-alphabetic linguistic practices, such as the Chinese, or translingual practices. Incorporating Chinese writing into techno- linguistic modernity was never easy in history due to the Chinese language\u2019s \u201cincompatibility\u201d with alphabets and alphanumeric technologies (Mullaney 2017; Tsu 2014; Hou, 2019). In the Chinese language context, speech-to-text technologies are often \u201cbroken\u201d: either the technology cannot accurately identify the correct pronunciation of Chinese words, or wrong Chinese characters or expressions are chosen in texts, or the voice-recognition fails to capture non- standard, dialect Chinese. By so doing, this research aims to provide an alternative picture in understanding the significance of brokenness in media technology with a general ontological concern, exemplified in speech-to-text problems in the context of the Chinese language.<\/p>\n<p>&nbsp;<\/p>\n<p><strong>Valeria Lopez Torres:\u00a0On glitches as indicators of authenticity in AI-generated images\u00a0<\/strong><\/p>\n<p>Ever since technology was first introduced into daily life, people have learned to co-exist with technological error. Thus, the cultural significance of these errors\u2014also called \u200b<em>glitches<\/em>\u2014<em>\u200b\u00a0<\/em>has shifted along with technology, as they have been manipulated for various purposes\u2014seeking and embracing them to create art; introducing them intentionally to improve interaction; or working endlessly towards their eradication. While one definition of glitch is related to a \u201cminor malfunction\u201d (Merriam-Webster Dictionary), another definition relates to authenticity, as in \u201ca false or spurious electronic signal\u201d (idem). This paper examines the cultural role of the glitch as a marker of authenticity and as a device to aid viewers in discerning truth from fakery in a context in which sophisticated algorithms are able to generate highly-realistic (and convincing) images of human-looking faces through the use of Artificial Intelligence (AI).\u00a0As these AI-generated images enter the current visual landscape, viewers must arm themselves against deception, thus looking for glitches, which usually manifest as irregularities in the background and with skin texture, interruption of patterns, and unexplained blobs, among other inconsistencies. For the computer scientist who strives to refine these technologies to generate more convincing and realistic images, glitches are undesirable. Conversely, for the lay viewer who is confronted for the first time by these hyper-realistic AI-generated images, glitches are desirable, as they allow for verification of their origin and authenticity (where\u00a0<em>authentic=human\u200b<\/em>while<em>u<\/em>\u200b\u00a0<em>nauthentic=computer-generated\u200b<\/em>).\u00a0In an image-driven society,\u200b\u00a0<em>\u200b<\/em>where the boundaries between the virtual and non-virtual blur, and traditional notions of truth and reality are challenged, these technologies are reminders of the exciting possibilities of their positive applications, as well as the dangers of potential misuses, particularly in light of the emergence of \u200b<em>deep fakes \u200b<\/em>(videos in which AI algorithms are used to show someone saying or doing something that they did not in fact do or say). \u200bThe ethical consequences of these images and their potential uses are largely related to surveillance, control, and ultimately, power. Thus, d esigners, scientists, artists and scholars alike must think seriously about h\u200b ow technological progress might enable higher degrees of control in detriment of human rights and freedom. Similarly, e\u200bducators must ponder how to respond to the many ethical issues brought about by these technologies, and teach emerging designers how to engage, view, and work critically with these images as they permeate our visual cultural landscape.<\/p>\n<p><strong>Zach Whalen: The 2020 Presidential Debate But Only The Parts Where Someone Breathes Loudly Or Sighs<\/strong><\/p>\n<p>\u201cThis is the hard hitting debate that the government doesn&#8217;t want you to see.\u201d- X Caines (7 months ago)<\/p>\n<p>\u201cThis makes me feel like I can\u2019t breathe.\u201d- echilcote423 (3 weeks ago (edited))<\/p>\n<p>The quotes above are two comments representing the (currently) 2,293 comments of a\u00a0video &#8212; a video I created in October 2020 that currently has around 250,000 views on\u00a0YouTube (https:\/\/www.youtube.com\/watch?v=bBNV32aeVfE). This video lasts for one\u00a0minute and thirty-five seconds, and other than a very brief title, it consists of exactly\u00a0what the title describes: breathing from mostly Donald Trump and Joe Biden during one\u00a0of their debates leading up to the 2020 presidential election. I created the supercut with a\u00a0Python program called videogrep, and uploaded it to the YouTube channel that I use\u00a0mostly for uploading lectures and tutorials for my classes. The video languished until\u00a0some time in early April 2021 when &#8212; for reasons I don\u2019t fully understand &#8212; Youtube\u00a0began recommending it to certain viewers and the view count started ticking up rapidly.\u00a0What does this video\u2019s popularity reveal about algorithmically hegemonic taste silos and\u00a0the current state of Youtube shitposting memes? What role does technological literacy\u00a0play in viewers\u2019 interest in the video? And, more importantly, how does this video refract\u00a0or resonate with the figure of \u201cbreathing\u201d in the midst of a global pandemic respiratory\u00a0disease and while Black men are still being killed by police with their last words \u201cI can\u2019t\u00a0breathe\u201d galvanizing national and global protests?\u00a0In this presentation, I will demonstrate the method I used to create the video and argue\u00a0that it fits into a particular category of Youtube videos that emphasize arbitrary\u00a0technological interventions (for example, \u201cThe bee movie\u201d but only when they say &#8220;E&#8221; or\u00a0\u201c\u2018All Star\u2019 by Smashmouth but every word is someBODY\u201d ). Next, although the\u00a0much-criticized recommendation engine in Youtube is a black box, the analytics\u00a0provided for my video suggest some inferences that can be made about who that engine\u00a0perceives as the audience for this video. Finally, I will use the demographic information\u00a0included with those analytics to reflect on the meaning of this video in a cultural context.\u00a0Ultimately, I feel ambivalent about the \u201csuccess\u201d of this video, but I believe exploring and reflecting on the implications of its popularity will provide an interesting insight into\u00a0Youtube as a slice of contemporary digital culture.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Dongchen Hou: The Brokenness in Mediation: Speech-to-Text Technology in Chinese When technological media become omnipresent and function perfectly, their presence goes unnoticeable. The popularity of speech-to-text technology testifies the technological convenience to human life: it can smoothly transfer vocal information to a visual, textual mode of representation. Speech-to-text technology, though only emerged in the recent &hellip; <a href=\"https:\/\/educate.winona.edu\/idmaa\/2021-glitch\/schedule\/session-7\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Session 7<\/span> <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":346,"featured_media":0,"parent":2,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-38","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/educate.winona.edu\/idmaa\/wp-json\/wp\/v2\/pages\/38","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/educate.winona.edu\/idmaa\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/educate.winona.edu\/idmaa\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/educate.winona.edu\/idmaa\/wp-json\/wp\/v2\/users\/346"}],"replies":[{"embeddable":true,"href":"https:\/\/educate.winona.edu\/idmaa\/wp-json\/wp\/v2\/comments?post=38"}],"version-history":[{"count":3,"href":"https:\/\/educate.winona.edu\/idmaa\/wp-json\/wp\/v2\/pages\/38\/revisions"}],"predecessor-version":[{"id":160,"href":"https:\/\/educate.winona.edu\/idmaa\/wp-json\/wp\/v2\/pages\/38\/revisions\/160"}],"up":[{"embeddable":true,"href":"https:\/\/educate.winona.edu\/idmaa\/wp-json\/wp\/v2\/pages\/2"}],"wp:attachment":[{"href":"https:\/\/educate.winona.edu\/idmaa\/wp-json\/wp\/v2\/media?parent=38"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}