动漫 av 探花偷拍以下这项发展中的合成媒体技能分支具备交易用途,但也有可能被用来扯后腿选举和分布假话。 译者 | 王婕 作家 | DJ PANGBURN 来源 | Fast Company 裁剪 | 蒲蒲 俄罗斯演义家维克多·佩列文(Victor Pelevin)的赛博一又克演义《Homo Zapiens》中,一位名叫Babylen Tatarsky的诗东谈主在苏联解体,俄罗斯经济靠近崩溃之际,被一位在莫斯科的大学挚友聘为告白案牍撰稿东谈主。 Tatarsky凭借着秘密的翰墨禀赋一齐水长船高,而他也逐步发现了这样一个事实:时任俄罗斯总统的叶利钦等政要和那时的要紧政事事件,实践上王人是造谣仿真的产物。放眼面前,跟着日益纯熟的“深度换脸”技能出现,佩列文的遐想似乎正在冉冉变为现实。 (数据瞻念凝视:【赛博一又克演义】赛博一又克演义属于科幻演义的类型,兴起于上世纪七十年代的好意思国,这一类故事里有巨额对新兴信息技能和生物科技的形容,连接触及跨国财团操纵高新技能,故事的主角一般会设定成游走在社会主流以外的边际东谈主,他们活在异日社会的阴雨面,可爱修改电脑的软硬件建立,珍贵改造躯壳,拒绝融入主流体制,靠着正当或者违规的技能技巧铤而走险,无意不吝与超等大公司拒抗。这种高与低并存产生的反差,酿成了一种特殊的好意思学效果,被空洞为“高技术、低生计”六个字。) 在“深度换脸”(亦或被计议东谈主员称之为“合成媒体”)的边界内,世东谈主的安静力主要鸠合在可能对政事现实酿成严重阻扰的“伪善神情”上,以及那些刻意师法东谈主写稿作风和声息的深度学习算法。 但是,如今合成媒体技能的另一个分支——“深度换身”正在速即发展。 2018年8月,加州大学伯克利分校的计议东谈主员发表了一篇题为《东谈主东谈主王人在跳舞》的论文和视频,展示了深度学习算法怎么将专科舞者的动作挪动到业余舞者身上。天然这一计议效果还有待完善,但已标明机器学习的计议东谈主员正在入部属手更具挑战的任务——“深度换身”。 同庚,德国海德堡大学的比约恩·奥默博士指引的一个计议团队发表了一篇对于教训机器传神规复东谈主类动作的论文。 本年4月,日本东谈主工智能公司Data Grid征战了一种东谈主工智能技能,不错自动生成不存在的东谈主体全身模子,并阐发了它在前锋和服装边界中的实践应用。 光显,“深度换身”的确不错打造部分兴致的交易应用,比如换脸跳舞应用技艺,或者被应用在体育和生物医学计议上,但坏心应用的案例在如今充斥着假话和假新闻的政事配景下也愈发受到关注。 天然脚下“深度换身”还不行十足掩东谈主耳目,但就像任何深度学习技能不异,它也终将卓绝,“深度换身”想要凑数其间,仅仅时辰问题。 东谈主体合成 为了已毕深度换脸,估计机科学家使用了生成式拒抗汇集(GAN),它由两个神经汇集构成:合成器/生成汇集,以及检测器/辩别汇集。这些神经汇集在精致的反馈回路中运行,生成确切的合成图像和视频。合成器从数据库创建图像,而检测器则在另一个数据库责任,用以细目合成器制造的图像是否准确真实。 “深度换脸”的初次坏心应用出面前Reddit(一个应答新闻站点)上,那时斯嘉丽·约翰逊等女演员的脸被移植到了色情电影演员的脸上。 Fast.AI公司鸠合首创东谈主瑞秋·托马斯表示,在面前已存在的“深度换脸”制品中,95%王人是想通过“伪善”的瞻念素材来进行个东谈主散乱词语。托马斯说:“其中一些深度换脸视频并不一定就使用了至极精致复杂的技能。” 但是,这种情况正启动接济。 法里德(新罕布什尔州汉诺威达特茅斯学院估计机科学解释)指出,中国的“深度换脸”应用技艺“Zao”就很好地讲明了这项技能在不到两年的时辰里发展得有多快。 “那些来自Zao的换脸视频看起来真的至极棒,况兼这些东谈主造品有好多就跟在电影版块当中呈现的画面不异。”法里德以为,“这无疑是一个卓绝,要知谈想让这款app大范围应用且至极百万东谈主的下载量并进犯易。这是‘深度换脸’技能走向熟习的象征。” “通过深度换脸的图像和视频,咱们基本上已毕了CGI技能(通用网关接口,是一种伏击的互联网技能,不错让一个客户端,从网页浏览器向彭胀在汇集做事器上的技艺苦求数据。CGI描摹了做事器和苦求处理技艺之间传输数据的一种圭臬)的环球化。”他进一步表示,“咱们把CGI技能从好莱坞的电影公司中带出来,交到了YouTube视频制作家的手中。” 海德堡大学图像处理(HCI)和跨学科科学估计中心(IWR)估计机视觉解释比约恩?奥默指引了一个计议和征战东谈主体合成媒体的团队。与该边界的大多数计议东谈主员不异,该小组的总体方针是领路图像,并教训机器怎么领会图像和视频。最终,他但愿团队大致更好地了解东谈主类是怎么领路图像的。 “咱们还是看到了东谈主体合成的化身不仅为游戏行业,还有许多其他边界王人创造了营收,”奥默表示,“尤其是对我的团队来说,咱们商量的是十足不同的边界,比如生物医学计议。咱们但愿更详确地了解东谈主类以致动物跟着时辰的推移,在残疾等雷同情况下,躯壳姿态的演进。” 东谈主脸合成与东谈主体合成的进程有着雄壮的互异。奥默表示,现时东谈主们还是对东谈主脸合成进行了更多的计议,这其中有几个原因。 开端,任何数码相机或智高手机王人有内置的面部检测技能,这种技能不错用于检测像含笑这样的任务,也不错用来识别瞻念众的目视对象。这样的应用技艺大致在产生营收的同期带动更多计议。但正如奥默所说,它们也导致了“巨额的数据聚积、数据整理和东谈主脸图像得回,而这些王人是征战深度学习计议的基础。” 其次,对奥默来说更兴致的是,天然每个东谈主的脸看起来王人不不异,但当把脸和通盘躯壳放在沿途比较时,变化其实并不大。“这等于为什么我说面部计议还是到了一定阶段,与通盘东谈主体比较,它创造了至极好的终局,因为东谈主体的可变性要大得多,处理起来愈加复杂,淌若你朝着这个方上前进,还需要学习更多”,奥默说。 奥默也不知谈东谈主体合成何时才略够达到他和计议东谈主员想要的圭臬。但是,纵瞻念那些慷慨解囊的深度换脸日益熟习,奥默指出,淌若莫得通过深度学习估计机视觉智能、东谈主工智能或其他技能制造的伪造品来一窥究竟,东谈主类可能早就受骗了。 “但是,淌若你想让它在更大的社会层面上被接受,那还需要几年的时辰,”奥默说,“深度换身”和其他深度作秀将变得愈加便宜和更精深。“计议界自身还是朝着一个好的标的发展,这小数得到了许多计议团体的高度赞叹,且这些团体对咱们大致愈加肤浅地得回算法这一进度的结识发展表现了很大的作用,比如github等。是以,你不错从一些论文高下载最新的代码,然后在不太了解荫藏内容的情况下,径直应用它。” 感到“力不从心” 不是每个东谈主王人能创造出“哄动一时”的深度换脸。但是,奥默许为,跟着时辰的推移,财富将不再成为得回估计资源方面的闭塞,软件的适用性也将变得容易得多。法里德说,有了“深度换身”,慷慨解囊的东谈主就不错利用深度换脸技能中的典型静止图像径直在摄像中启齿谈话,让“方针对象”为所欲为。 VRT电台(佛兰德播送公司)的考查记者兼驻外记者汤姆范德韦赫惦记,记者、还有东谈主权行动东谈主士和握不同政见者们,王人有可能被“深度换身”刀兵化。 汤姆范德韦赫(图片由本东谈主提供) 2016年大选时代假新闻的激增,以及2017年“深度换脸”的兴起,引发了范德韦赫对合成媒体的计议。2018年夏天,他在斯坦福大学启动了一项旨在拒抗坏心使用“深度换脸”的门径计议。 “受约束最大的不是大东谈主物、政客和名东谈主,”范德韦赫表示,“惟有正常东谈主——像你、我、女记者,以及那些可能成为或还是成为深度换脸受害者的边际群体。” 两周前,荷兰新闻主播迪翁·斯塔克斯发现我方的脸被“深度换脸”技能映射到了又名色情女演员的身上,该视频还被上传到PornHub网站(全球最大的色情视频分享类网站之一)并在互联网上等闲传播。尽管PornHub很快就删除了这段视频,但范德韦赫表示,她的声誉已接受到了毁伤。 为了更好地遐想“深度换身”是怎么责任的,范德韦赫提到了2018年CNN首席白宫记者吉姆·阿科斯塔的镜头。在贪念论网站Infowars,裁剪保罗约瑟夫沃森上传了一段视频:阿科斯塔似乎咄咄逼东谈主地推着又名试图拿他麦克风的白宫责任主谈主员。 这与C-SPAN(好意思国一家提供公众做事的非渔利性的媒体公司)播出的原始片断有显着不同。Infowars的裁剪宣称他并莫得批改视频,并将系数互异王人归因于“视频压缩”。 但是,正如《颓唐报》对视频进行的时辰轴裁剪分析知道,沃森的视频的确穷乏了原视频的其中几帧。“深度换身”就像裁剪视频时对帧数进行转换不异,不错改变事件的确切性。 竖立于2018年的Deeptrace Labs是一家汇集安全公司,正在征战基于估计机视觉和深度学习的器具,以分析和领路视频,尤其是那些不错被任何东谈主工智能支配或合成的视频。 该公司首创东谈主乔治?帕特里尼曾在阿姆斯特丹大学德尔塔实验室从事深度学习的博士后计议。他表示,几年前我方启动计议技能怎么防御或驻防异日合成媒体的浪掷。 帕特里尼以为,由东谈主体合成、东谈主脸合成和音频合成构成的坏心深度作秀,将很快被用来时弊记者和政客。 他提到了一段深度换脸的色情视频,视频中印度记者拉娜·阿尤布的脸被换成了又名色情女演员的躯壳,当作这场伪善信息领略的一部分,这一转为的宗旨就在于抹黑她的考查报谈。 此前,她公开条目对强奸和谋杀又名8岁克什米尔女孩的行动进行王法审判。本年3月,Deeptrace Labs对加蓬总统阿里·邦戈的“深度换脸视频”进行了考查。 尽管这个非洲国度的许多东谈主,包括加蓬部队在内王人以为邦戈一动不动的脸、眼睛和躯壳干涉着一个深度骗局,并基于此发动了一场不收效的政变,帕特里尼仍向《琼斯母亲》杂志表示,他不肯定总统的视频是合成的。 “咱们找不到任何意义肯定这是深度换脸的终局。我以为总统还谢世,这一料到随后也被阐发,不外他实践上是中风了。”帕特里尼说:“我想在这里指出的重心是,问题不在于视频到底是真实假,伏击的是东谈主们很明晰它会在公众公论中引发怀疑,在某些方位还可能引发暴力。” 动漫 av (图片由Deeptrace Labs提供) 最近,范德韦赫了解到,又名政党东谈主士正在宣战“深度换脸”最受宽宥的创造者之一,并条目其愚弄这项技能来中伤某东谈主。这种定制的“深度换脸”可能会成为一门大生意。 “‘深度换脸’会成为东谈主们谋利的器具,”范德韦赫说,“东谈主们会为它买单。是以,政府并不需要切身上阵,他们只需要说合一个专诚干这行的东谈主就不错了。” 《华尔街日报(博客,微博)》最近的报谈称,一家英国动力公司的首席彭胀官被骗,将24.3万好意思元转入了一家匈牙利供应商的账户。这位高管说,他肯定我方是在和雇主谈话,况兼他的雇主似乎也还是批准了这笔走动。 面前,这位首席彭胀官还是意志到他碰到了一种名为“汇集垂纶”的深度换音作秀。法里德以为,深度作秀技能,以致包括“深度换身”技能,在金融边界的讹诈很有可能呈淡漠之势。 “我不错制作一个杰夫·贝佐斯的深度换脸视频,让他在里面说亚马逊的股票正鄙人跌。”法里德说,爱唯侦察论坛bt工厂“想想看,作念空亚马逊股票能赚若干钱。当你法规它的时候,伤害还是酿成了……面前再遐想一下,当你看到一个民主党候选东谈主说一些违规或漠不柔和的话的视频时,你还以为你不行在选举前一天晚上傍边千千万万选民的投票吗?” 法里德以为,应答媒体和深度作秀视频的衔尾,不管是“深度换脸”如故“深度换身”,王人很容易产生极大的不良影响。应答媒体公司频繁无力或不肯治疗他们的平台和内容,因此深度换脸不错像野火不异延伸。 “当你把深度换脸的才略与在全球分布和消耗这些内容的才略衔尾起来时,进犯就来了。”他表示,“出于好多原因,咱们生计在一个高度分化的社会,也因此东谈主们连接会把意见相左的东谈主往坏处想。” 但对于Fast.AI公司鸠合首创东谈主瑞秋·托马斯来说,在新的汇集打破中,深度换脸对政事进度产生的负面影响简直不错忽略不计,因为政府和行业还是在与书面形式的伪善信息作斗争。 她说,这些风险不仅与技能相关,还与东谈主为要素相关。社会南北极分化配景下,好意思国和其他国度的大片地区不再有不错十足信任的事实来源。 这种不信任可能会让有政事动机的“深度换脸”制造者无孔不入。正如隐秘学者丹妮尔?西特龙所指出的,当深度换脸被揭穿时,它不错向那些肯定假话的东谈主表示,假话是有一定道理的。西特恩称这是“说谎者的红利”。 法里德以为,“深度换身”技能的卓绝将在全体上使这类坏心深度作秀的问题变得更糟。这项技能如今正在快速发展,很有可能在诸如《东谈主东谈主王人在跳舞》等高校计议和“Zao”APP征战商的教导下,将“深度换脸”正当化。 “一朝能对全身动作进行合成师法,那时画面上就不再仅仅出现一个讲话的脑袋了,你以致不错假装成别东谈主作念瞻念事或杀东谈主。”法里德说:“是不是还是不错这样操作了?面前可能还不行已毕。但一两年后,东谈主们就能作念到全身深度师法,这一料到并不是莫得道理的,况兼一朝已毕会表现至极强盛的作用。” 行业复兴 面前,科技行业还莫得达成肃除深度换脸的共鸣,许多不同的技能正在计议和测试中。 举例,范德韦赫的计议团队创造了各式里面挑战,并探索了不同的门径。其中一个计议小组计议了胶片的数字水印以识别深度换脸。另一个团队则试图使用区块链技能来征战信任,这亦然区块链技能自身的上风之一。另外,还有一个团队通过使用与当先“深度换脸”换取的深度学习技能来识别伪物。 “Sherlock AI是一个自动检测深度换脸的器具,由来自斯坦福大学的辍学生征战,”范德韦赫先容,“因此,他们取样了一些卷积模子,然后在视频中寻找额外。这一进程也被其他深度换脸检测器使用,比如Deeptrace Labs。他们使用名为FaceForensics++的数据集,然后对其进行测试,其准确率高达97%,对东谈主脸的识别效果也很好。” Deeptrace Lab基于API(应用技艺接口)的监控系统不错监测到深度伪造视频的创建、上传和分享。自2018年竖立以来,该公司还是在互联网上发现了卓绝1.4万个伪善视频。 Deeptrace Lab的系统网罗到的信息不错告诉公司过甚客户,深度伪造品的制作家在作念什么,伪造品来自那里,他们在使用什么算法,以及这些器具的可访谒性怎么。 帕特里尼说,他的团队发现,95%的深度伪造品王人是伪善色情类的“深度换脸”产物,大多数视频来自于一小撮名东谈主。到面前为止,Deeptrace Lab还莫得看到任安在田野应用的全身合成技能产物。 “你不行用单一的算法或想法往返首这些问题的不断有谋略,”帕特里尼表示,“这与征战一些能告诉你合成媒体不珍爱况的器具相关。” 范德韦赫以为反深度换脸技能的下一个要紧发明将是软生物特征识别技能。每个东谈主王人有我方特有的面部色调——扬起的眉毛、嘴唇的动作、手部的动作,这些王人不错当作某种个东谈主特征。 加州大学伯克利分校的计议东谈主员施卢蒂·阿加瓦尔使用了软生物计量模子来细目一些画面里的面部抽搐是否是为了视频效果而东谈主为的终局。(阿加瓦尔的论文导师是深度作秀视频民众、达特茅斯大学解释哈尼·法里德。) “基本念念路是,咱们不错征战相关这些寰宇列国指引东谈主的软生物识别模子,比如2020年总统候选东谈主,然后倘若视频启动失真,咱们不错对它们进行分析,来细目它们的确切性,”阿加瓦尔本年6月向伯克利新闻表示。 尽管商量到不同的东谈主在不同的环境下可能会呈现不同的面部抽搐,阿加瓦尔的模子并不十足可靠,但范德韦赫以为公司将来不错提供用于身份考证的软生物特征签名,这种特征可能是无人不晓的眼睛扫描或全身扫描。 “我以为这是咱们前进的标的:与学术界和大型科技公司协调,以创建更大的数据集。”范德韦赫表示,“当作新绅士,咱们应该勤劳匡助媒体加深对深度伪造的了解。” 最近,Facebook和微软与大学联手发起了深度换脸检测挑战。另一个值得安静的勤劳是国防高档计议缱绻局的方针,即用语义鉴证法来处理深度换脸伪物,寻找酿成不实的算法。 举例,一个东谈主在深度换脸视频中戴了与其不相当的耳饰。而在2018年9月,东谈主工智能基金会筹集了1000万好意思元,通过机器学习和东谈主类和谐员创建了一个识别深度换脸和其他坏心内容的器具。 但是,托马斯仍然怀疑技能是否能十足不断深度换脸的问题,不管它们摄取什么形式。她以为征战更好的系统来识别深度换脸是有价值的,但她重申,其他类型的不实信息也很放荡。 托马斯说,利益关联者应该探索社会和形式要素,因为这些要素也会导致严重的深度换脸和其他不实信息。 为什么对深度换脸的监管难度很大? 托马斯、范德韦赫和法里德一致以为,政府将不得不介入并监管深度换脸技能,因为放大此类挑动性内容的应答媒体平台要么无力监管,要么不肯意监管我方的内容。 本年6月,众议院谍报委员会主席、民主党众议员亚当·希夫就深度换脸技能酿成的伪善信息和伪善信息约束举行了初次听证会。希夫在开场白中指出,科技公司对此前的假视频作念出了不同的反馈。 YouTube立即删除了这段慢速播放的视频,而Facebook将其标注为假,并为止了它在通盘平台上的传播速率。这些不同的反馈导致希夫条目应答媒体公司制定策略,改造深度换脸视频的上传和传播。 “在短期内,彭胀伪善信息和其他无益的、挑动性的内容对这些平台来说是成心可图的,因此咱们的激励机制是十足错位的。”托马斯表示,“我不以为这些平台应该对它们所承载的内容承担包袱,但我如实以为它们应该对它们积极彭胀的内容承担包袱(举例,YouTube将亚历克斯?琼斯的视频保举给那些以致莫得在寻找他的东谈主160亿次)。” 托马斯补充谈:“总的来说,我以为,商量一下咱们怎么通过立法来处理那些将多半社会老本外部化、同期暗里条目利润的其它行业(如工业羞辱、大型香烟和快餐/垃圾食物),是有匡助的。” 帕特里尼表示,对合成媒体的监管可能会变得很复杂。但是他也以为,面前的一些法律,比如那些触及中伤、指责和版权的法律,不错用来监管坏心的深度换脸。 帕特里尼说,出台一项全面防止深度换脸的法律将是不实的行动。相背,他认识政府维持成心于社会的合成媒体应用,同期资助计议征战检测深度换脸的器具,并饱读动初创企业和其他公司也这样作念。 “政府还不错栽植公民这种技能的存在,因此咱们需要再行历练咱们的耳朵和眼睛,不要肯定咱们在互联网上看到和听到的一切。”帕特里尼说:“咱们需要给东谈主们和社会先打好防御针,而不是在可能两年后因为浪掷这项技能而发生至极横祸性或有争议的事情时才一火羊补牢。” 奥默表示,估计机视觉计议东谈主员很明晰深度换脸技能的坏心应用。他以为政府应该为怎么使用深度换脸技能征战问责制。 “咱们王人看到了图像领路的应用,以及它可能带来的公正,”奥默说,“但其中一个至极伏击的部分是要明确承担哪些包袱,以及谁将承担这一包袱?采访过我的政府机构等光显看到他们也负有这一包袱。公司也许为了鼓动的利益,他们可能也不得不表示他们看到了我方的包袱;但是,到面前为止,咱们心里王人很明晰他们是怎么处理这一包袱的。” “这是一件很辣手的事情,”奥默接着表示,“仅仅但愿这一切王人会往日……但是咱们知谈它将愈演愈烈。” You’ve been warned: Full body deepfakes are the next step in AI-based human mimicry This developing branch of synthetic media technology has commercial applications—but also has the potential to disrupt elections and spread disinformation. In Russian novelist Victor Pelevin’s cyberpunk novel, Homo Zapiens, a poet named Babylen Tatarsky is recruited by an old college buddy to be an advertising copywriter in Moscow amid post-Soviet Russia’s economic collapse. With a talent for clever wordplay, Tatarsky quickly climbs the corporate ladder, where he discovers that politicians like then-Russian president Boris Yeltsin and major political events are, in fact, virtual simulations. With the advent of ever-more sophisticated deepfakes, it feels as if something like Pelevin’s vision is slowly coming true. Within the field of deepfakes, or “synthetic media” as researchers call it, much of the attention has been focused full body deepfakes. In August 2018, University of California Berkeley researchers released a paper and video titled “Everybody Dance Now,” demonstrating how deep learning algorithms can transfer a professional dancers’ moves developed an AI that can automatically generate whole body models of nonexistent persons, identifying practical applications in the fashion and apparel industries. While it’s clear that full body deepfakes have interesting commercial applications, like deepfake dancing apps or in fields like athletics and biomedical research, malicious use cases are an increasing concern amid today’s polarized political climate riven by disinformation and fake news. For now, full body deepfakes aren’t capable of completely fooling the eye, but like any deep learning technology, advances will be made. It’s only a question of how soon full body deepfakes will become indistinguishable from the real. SYNTHESIZING ENTIRE HUMAN BODIES To create deepfakes, computer scientists use Generative Adversarial Networks, or GANs. Comprised of two neural networks—a synthesizer or generative network, and a detector or discriminative network—these neural networks work in a feedback loop of refinement to create realistic synthetic images and video. The synthesizer creates an image from a database, while the latter, working from another database, determines whether the synthesizer’s image is accurate and believable. The first malicious use of deepfakes appeared Scarlett Johansson were mapped Fast.AI says that 95% of the deepfakes in existence are pornographic material meant to harass certain individuals with fake sexual acts. “Some of these deepfakes videos aren’t necessarily using very sophisticated techniques,” says Thomas. But, that is starting to change. Farid points to the Chinese deepfake app Zao as being illustrative of how quickly the technology has evolved in less than than two years. “The ones that I saw [from Zao] looked really, really good, and got around a lot of the artifacts, like in the movie versions where the face flickered,” says Farid. “It’s improving. Getting this as an app working at scale, downloading to millions of people, is hard. It’s a sign of the maturity of the deepfake technology.” “With deepfake images and videos, we’ve essentially democratized CGI technology,” he says. “We’ve taken it out of the hands of Hollywood studios and put it in the hands of YouTube video creators.” Bj?rn Ommer, professor for computer vision at the Heidelberg University Collaboratory for Image Processing (HCI) & Interdisciplinary Center for Scientific Computing (IWR), leads a team that is researching and developing full body synthetic media. Like most researchers in the field, the group’s overall goal is to understand images and to teach machines how to understand images and video. Ultimately, he hopes the team gains a better understanding of how human beings understand images. “We’ve seen synthetic avatars that have been created not just in the gaming industry but a lot of other fields that are creating revenue,” says Ommer. “For my group, in particular, it’s entirely different fields that we are considering, like biomedical research. We want to get a more detailed understanding of human or even animal posture over time, relating to disabilities and the like.” There are critical differences between the processes of synthesizing faces and entire bodies. Ommer says that more research into face synthesis has been carried out. And there are a few reasons for this. First, any digital camera or smartphone has built-in face detection, technology that can be used for tasks like smile detection or to identify the person a viewer is looking at. Such applications can generate revenue, leading to more research. But they have also led to, as Ommer says, “a lot of data set assembly, data curation, and obtaining face images—the substrate upon which deep learning research is built.” Secondly, and more interesting to Ommer, is that while each human face looks different, there isn’t much variability when the face is compared to an entire human body. “That is why the research on faces has come to a stage where I would say it is creating really decent results compared to entire human bodies with much more variability being there, much more complicated to handle, and much more to learn if you head in that direction,” says Ommer. Ommer isn’t sure when full synthesized bodies will be of the quality that he and researchers want. Looking at the maturation of malicious deepfakes, however, Ommer notes that humans can already be tricked quite easily without fakes created by deep learning computer vision intelligence, artificial intelligence, or other technologies. “But, if you want to make it appealing to larger society, it will take a few more years,” says Ommer, who says full body and other deepfakes will become cheaper and more prevalent. “The research community itself has moved in a direction—and this is very much appreciated by much of the community that is responsible for a lot of this steady progress that we see—where the algorithms are easily available, like on Github and so on. So, you can just download the most recent code from some paper, and then, without much knowledge of what’s under the hood, just apply it.” FEELING “POWERLESS AND PARALYZED” Not every person will be able to create a “blockbuster deepfake.” But, given more time, Ommer says money will no longer be an issue in terms of computational resources, and the applicability of software will also become much easier. Farid says that with full body deepfakes, malicious creators will be able to work deepfake technology’s typically stationary figure talking directly into the camera, making targets do and say things they never would. Tom Van de Weghe, an investigative journalist and foreign correspondent for VRT (the Flemish Broadcasting Corporation), worries that journalists, but also human rights activists and dissidents, could have footage of them weaponized by full body deepfakes. The explosion of fake news during the 2016 election, and the rise of deepfakes in 2017 inspired Van de Weghe to research synthetic media. In the summer of 2018, he began a research fellowship at Stanford University to study ways of battling the malicious use of deepfakes. “It’s not the big shots, the big politicians, and the big famous guys who are the most threatened,” says Van de Weghe. “It’s the normal people—people like you, me, female journalists, and sort of marginalized groups that could become or are already becoming the victims of deepfakes.” Two weeks ago, Dutch news anchor Dionne Stax discovered her face “deepfaked” onto a porn actress’s body, after the video was uploaded to PornHub and distributed on the internet. Although PornHub quickly removed the video, Van de Weghe says that the damage to her reputation had already been done. He also points to China’s AI public broadcasters as proof that the Chinese government has the capability to pull off realistic deepfakes. To imagine how a full body deepfake might work, Van de Weghe points to 2018 footage of Jim Acosta, CNN’s chief White House correspondent. In a video clip uploaded by Paul Joseph Watson, an editor at conspiracy theory site Infowars, Acosta seems to aggressively push a white house staffer trying to take his microphone. The original clip, broadcast by C-SPAN, differs markedly from Watson’s. The Infowars editor claimed he didn’t doctor the footage and attributed any differences to “video compression” artifacts. But, as The Independent demonstrated in a side-by-side analysis of the videos in an editing timeline, Watson’s video is missing several frames from the original. A full body deepfake could, like editing video frames, alter the reality of an event. Deeptrace Labs, founded in 2018, is a cybersecurity company that is building tools based on computer vision and deep learning to analyze and understand videos, particularly those that could be manipulated or synthesized by any sort of AI. Company founder Giorgio Patrini, previously a postdoc researcher on deep learning at the DELTA Lab, University of Amsterdam, says that a few years ago he started investigating how technology could prevent or defend against future misuse of synthetic media. Patrini believes that malicious deepfakes, made up of a combination of synthetic full bodies, faces, and audio, will soon be used to target journalists and politicians. He pointed to a deepfake porn video that featured Indian journalist Rana Ayyub’s face swapped >told Mother Jones that he did not believe the video of the president had been synthesized. “We couldn’t find any reasons to believe it was a deepfake, and I think that was later confirmed that the president is still alive but that he’d had a stroke,” says Patrini. “The main point I want to make here is that it doesn’t matter if a video is a deepfake or not yet—it’s that people know that it can spark doubt in public opinion and potentially violence in some places.” Recently, Van de Weghe learned that a political party operative approached one of the most popular deepfake creators, requesting a deepfake to damage a certain individual. Such custom, made-to-order deepfakes could become big business. “There is money to be earned with deepfakes,” says Van de Weghe. “People will order it. So, a government doesn’t have to create a deepfake—they just have to contact a person who is specialized in deepfakes to create one.” The Wall Street Journal recently reported that a UK energy company CEO was fooled into transferring $243,000 to the account of a Hungarian supplier. The executive said he believed he was talking to his boss, who had seemingly approved the transaction. Now, the CEO believes he was the victim of an audio deepfake scam known as vishing. Farid believes other fraudulent deepfake financial schemes, which might include full body deepfakes, are only a matter of time. “I could create a deepfake video of Jeff Bezos where he says that Amazon stock is going down,” says Farid. “Think of all of the money that could be made shorting Amazon stock. By the time you rein it in, the damage has already been done. . . . Now imagine a video of a Democratic party nominee saying illegal or insensitive things. You don’t think you can swing the vote of hundreds of thousands of voters the night before an election?” Farid thinks a combination of social media and deepfake videos, whether of faces or full bodies, could easily wreak havoc. Social media companies are largely unable or unwilling to moderate their platforms and content, so deepfakes can spread like wildfire. “When you pair the ability to create deepfake content with the ability to distribute and consume it globally, it’s problematic,” he says. “We live in a highly polarized society, for a number of reasons, and people are going to think the worst of the people they disagree with.” But for Fast.AI’s Thomas, deepfakes are almost unnecessary in the new cyber skirmishes to negatively influence the political process, as governments and industry already struggle with fake information in the written form. She says the risks aren’t just about technology but human factors. Society is polarized, and vast swaths of the United States (and other countries) no longer have shared sources of truth that they can trust. This mistrust can play into the hands of politically motivated deepfake creators. When a deepfake is debunked, as privacy scholar Danielle Citron noted, it can suggest to those who bought the lie that there is some truth to it. Citron calls this “the liar’s dividend.” Farid thinks advancements in full body deepfake technology will make the overall problem of this type of nefarious deepfakery worse. The technology is evolving fast, spurred by university research like “Everybody Dance Now” and private sector initiatives such as Zao to monetize deepfakes. “Once you can do full body, it’s not just talking heads anymore: you can simulate people having sex or killing someone,” Farid says. “Is it just around the corner? Probably not. But eventually it’s not unreasonable that in a year or two that people will be able to do full body deepfakes, and it will be incredibly powerful.” INDUSTRY RESPONSE Currently, no consensus approach to rooting out deepfakes exists within the tech industry. A number of different techniques are being researched and tested. Van de Weghe’s research team, for instance, created a variety of internal challenges that explored different approaches. One team investigated digital watermarking of footage to identify deepfakes. Another team used blockchain technology to establish trust, which is one of its strengths. And yet another team identified deepfakes by using the very same deep learning techniques that created them in the first place. “Some Stanford dropouts created Sherlock AI, an automatic deepfake detection tool,” says Van de Weghe. “So, they sampled some convolutional models and then they look for anomalies in a video. It’s a procedure being used by other deepfake detectors, like Deeptrace Labs. They use the data sets called FaceForensics++, and then they test it. They’ve got like 97% accuracy and work well with faces.” Deeptrace Labs’ API-based monitoring system can see the creation, upload, and sharing of deepfake videos. Since being founded in 2018, the company has found over 14,000 fake videos on the internet. Insights gleaned by Deeptrace Labs’ system can inform the company and its clients about what deepfake creators are making, where the fakes came from, what algorithms they are using, and how accessible these tools are. Patrini says his team found that 95% of deepfakes are face swaps in the fake porn category, with most of them being a narrow subset of celebrities. So far, Deeptrace Labs hasn’t seen any full body synthesis technology being used out in the wild. “You cannot really summarize a solution for these problems in a single algorithm or idea,” says Patrini. “It’s about building several tools that can tell you different things about synthetic media overall.” Van de Weghe thinks the next big thing in anti-deepfake technology will be soft biometric signatures. Every person has their own unique facial tics—raised brows, lip movements, hand movements—that function as personal signatures of sorts. Shruti Agarwal, a researcher at UC-Berkeley, used soft biometric models to determine if such facial tics have been artificially created for videos. (Agarwal’s thesis adviser is fake video expert and Dartmouth professor Hany Farid.) “The basic idea is we can build these soft biometric models of various world leaders, such as 2020 presidential candidates, and then as the videos start to break, for example, we can analyze them and try to determine if we think they are real or not,” Agarwal told Berkeley News in June of this year. Although Agarwal’s models aren’t fullproof, since people in different circumstances might use different facial tics, Van de Weghe think companies could offer soft biometric signatures for identity verification purposes in the future. Such a signature could be something as well-known as eye scans or a full body scan. “I think that’s the way forward: create bigger data sets in cooperation with academics and big tech companies,” Van de Weghe says. “And we as newsrooms should try and train people and build media literacy about deepfakes.” Recently, Facebook and Microsoft teamed up with universities to launch the Deepfake Detection Challenge. Another notable effort is the Defense Advanced Research Projects Agency’s (DARPA) goal of tackling deepfakes with semantic forensics, which looks for algorithmic errors that create, for instance, mismatched earrings worn by a person in a deepfake video. And in September 2018, the AI Foundation raised $10 million to create a tool that identifies deepfakes and other malicious content through both machine learning and human moderators. But, Fast.AI’s Thomas remains skeptical that technology can fully solve the problem of deepfakes, whatever form they might take. She sees value in creating better systems for identifying deepfakes but reiterates that other types of misinformation are already rampant. Thomas says stakeholders should explore the social and psychological factors that play into deepfakes and other misinformation as well. WHY IT’S TOUGH TO REGULATE DEEPFAKES Thomas, Van de Weghe, and Farid all agree that governments will have to step in and regulate deepfake technology because social media platforms, which amplify such incendiary content, are either unable or unwilling to police their own content. In June, Rep. Adam Schiff (D-CA), chair of the House Intelligence Committee, held the first hearing on the misinformation and disinformation threats posed by deepfakes. In his opening remarks, Schiff made note of how tech companies responded differently to the fake Pelosi video. YouTube immediately deleted the slowed-down video, while Facebook labeled it false and throttled back the speed at which it spread across the platform. These disparate reactions led Schiff to demand social media companies establish policies to remedy the upload and spread of deepfakes. “In the short-term, promoting disinformation and other toxic, incendiary content is profitable for the major platforms, so we have a total misalignment of incentives,” says Fast.AI’s Thomas. “I don’t think that the platforms should be held liable for content that they host, but I do think they should be held liable for content they actively promote (e.g. YouTube recommended Alex Jones’ videos 16 billion times to people who weren’t even looking for him).” “And, in general, I think it can be helpful to consider how we’ve [legislatively] dealt with other industries that externalize large costs to society while privately claiming the profits (such as industrial pollution, big tobacco, and fast food/junk food),” Thomas adds. Deeptrace Labs’ Patrini says regulation of synthetic media could prove complicated. But, he believes some current laws, like those covering defamation, libel, and copyright, could be used to police malicious deepfakes. A blanket law to stop deepfakes would be misguided, says Patrini. Instead, he advocates government support for synthetic media applications that benefit society, while funding research into creating tools to detect deepfakes and encouraging startups and other companies to do the same. “[Government] can also educate citizens that this technology is already here and that we need to retrain our ears and eyes to not believe everything we see and hear on the internet,” says Patrini. “We need to inoculate people and society instead of repairing things in maybe two years when something very catastrophic or controversial might happen because of misuse of this technology.” Ommer says computer vision researchers are well aware of the malicious applications of deepfakes. And he sees a role for government to play in creating accountability for how deepfakes are used. “We all see applications of image understanding and the benefits that it can potentially have,” says Ommer. “A very important part of this is responsibility and who will take a share in this responsibility? Government agencies and so on who have interviewed me obviously see their share in this responsibility. Companies say and probably—in the interest of their stockholders—have to say that they see their responsibility; but, we all know how they have handled this responsibility up until now.” “It’s a tricky thing,” Ommer says. “Just hoping that this will all go away . . . it won’t.” 本文首发于微信公众号:数据瞻念。著作内容属作家个东谈主瞻念点,不代表和讯网态度。投资者据此操作动漫 av,风险请自担。 |