{"id":109,"date":"2015-09-12T12:17:03","date_gmt":"2015-09-12T10:17:03","guid":{"rendered":"http:\/\/aireligion.org\/?p=109"},"modified":"2015-09-12T12:17:03","modified_gmt":"2015-09-12T10:17:03","slug":"apple-tv-highlights-how-far-ai-has-come-and-how-far-it-has-to-go","status":"publish","type":"post","link":"https:\/\/aireligion.org\/?p=109","title":{"rendered":"Apple TV highlights how far AI has come \u2014 and how far it has to go"},"content":{"rendered":"<header class=\"article-header\">\n<div class=\"article-byline\">\n<div class=\"social-icons article-byline-social article-social\">\n<div class=\"article-social-wrapper\">\n<p><img src=\"https:\/\/fortunedotcom.files.wordpress.com\/2015\/09\/gettyimages-487395820.jpg?quality=80&amp;w=840&amp;h=485&amp;crop=1\" alt=\"\" \/><\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/header>\n<div class=\"article-body\">\n<div class=\"article-bottom\">\n<div class=\"article-body-text rail-offset\">\n<p>Voice recognition is great, but Siri isn\u2019t making movies \u2026 yet?<\/p>\n<p>If you watched, read or even heard about Apple\u2019s big event on Wednesday, chances are you noticed that <a href=\"http:\/\/fortune.com\/2015\/09\/09\/apple-tv\/\">the upcoming Apple TV upgrade<\/a> sounds impressive. One of its big draws is that users can search for shows and control the experience using their voice rather than relying on a remote control. This new Siri-powered voice search capability is a feature that demonstrates just how far artificial intelligence technology has come, as well as how far it has to go before it rivals the full power of the human mind.<\/p>\n<p><!--more--><\/p>\n<p>In a nutshell, the new Apple TV voice search feature looks a lot like using <a href=\"http:\/\/fortune.com\/2015\/06\/23\/amazon-echo-review\/\">Amazon\u2019s music-playing, news-reading Echo intelligent speaker<\/a>\u2014only, you know, on a TV. Viewers will be able to search for shows or movies by name or actress or genre, or some combination of them. They\u2019ll be able to fast-forward and rewind by exact times (by saying, for example, \u201cFast-forward serven minutes\u201d) and even call up the weather or other information, all by pressing a button and speaking like they might to another human being. Or, I guess, Siri.<\/p>\n<figure><a href=\"https:\/\/fortunedotcom.files.wordpress.com\/2015\/09\/apple-tv.jpg?quality=80\"><img loading=\"lazy\" class=\"wp-image-1297234 size-full\" src=\"https:\/\/fortunedotcom.files.wordpress.com\/2015\/09\/apple-tv.jpg?quality=80&amp;w=770&amp;h=540\" alt=\"Apple TV\" width=\"770\" height=\"540\" \/><\/a><figcaption class=\"image-caption\"><span class=\"credit\">Image courtesy of Apple<\/span><\/figcaption><\/figure>\n<p>It would all be pretty remarkable if speech recognition wasn\u2019t already so commonplace. Whether we use iOS, Android or even Windows, we can all talk to our phones and get answers. On the TV, Roku 3 and Amazon Fire TV already support voice search (albeit with fewer bells and whistles than Apple showed Wednesday). And Amazon <span class=\"tickershortcode quotecard_hook\" data-symbol=\"AMZN\"><span class=\"wrapper trend-wrapper increase\"> <a href=\"http:\/\/fortune.com\/company\/AMZN\">AMZN<\/a><\/span> <span class=\"change percent\">1.37%<\/span> <\/span> has its aforementioned Echo device that lets users play music, get the news, dim their lights and a do whole lot more using their voice.<\/p>\n<h2>Artificial intelligence is all around us<\/h2>\n<p>This is not an indictment of Apple\u2019s innovative spirit, but rather an acknowledgement of how amazing advances in artificial intelligence have been in recent years. Mostly, the pervasiveness of high-quality speech recognition owes to the untold millions of dollars that companies\u2014primarily Google <span class=\"tickershortcode quotecard_hook\" data-symbol=\"GOOG\"><span class=\"wrapper trend-wrapper increase\"> <a href=\"http:\/\/fortune.com\/company\/GOOG\">GOOG<\/a><\/span> <span class=\"change percent\">0.71%<\/span> <\/span>, Microsoft <span class=\"tickershortcode quotecard_hook\" data-symbol=\"MSFT\"><span class=\"wrapper trend-wrapper increase\"> <a href=\"http:\/\/fortune.com\/company\/MSFT\">MSFT<\/a><\/span> <span class=\"change percent\">0.37%<\/span> <\/span>, Facebook <span class=\"tickershortcode quotecard_hook\" data-symbol=\"FB\"><span class=\"wrapper trend-wrapper increase\"> <a href=\"http:\/\/fortune.com\/company\/FB\">FB<\/a><\/span> <span class=\"change percent\">0.08%<\/span> <\/span> and Baidu\u2014have spent researching and commercializing a field of AI called deep learning.<\/p>\n<p>Putting aside thin comparisons to how brains work (an unavoidable consequence of so-called artificial \u201cneural network\u201d algorithms that form the technology\u2019s foundation), <a href=\"https:\/\/gigaom.com\/2015\/01\/29\/new-to-deep-learning-here-are-4-easy-lessons-from-google\/\">the reality of learning models<\/a> is that they\u2019re very good at recognizing patterns. Train a voice-recognition system with enough voice samples, and it will <a href=\"http:\/\/usa.baidu.com\/deep-speech-lessons-from-deep-learning\/\">learn to recognize spoken words<\/a>. Train a computer vision system on enough images and it will learn to <a href=\"http:\/\/fortune.com\/2015\/03\/17\/google-facenet-artificial-intelligence\/\">recognize the objects (or faces) in them<\/a>. The same goes for <a href=\"http:\/\/google-opensource.blogspot.com\/2013\/08\/learning-meaning-behind-words.html\">the meanings of words in text<\/a>, the <a href=\"http:\/\/benanne.github.io\/2014\/08\/05\/spotify-cnns.html\">sounds in different types of music<\/a>, <a href=\"http:\/\/googleresearch.blogspot.com\/2015\/02\/from-pixels-to-actions-human-level.html\">the rules of video games<\/a> \u2014 you name it.<\/p>\n<figure><a href=\"https:\/\/fortunedotcom.files.wordpress.com\/2015\/09\/baseball.jpg?quality=80\"><img loading=\"lazy\" class=\"size-large wp-image-1299040\" src=\"https:\/\/fortunedotcom.files.wordpress.com\/2015\/09\/baseball.jpg?quality=80&amp;w=1024\" alt=\"When I search Google Photos for &quot;baseball.&quot; \" width=\"1024\" height=\"614\" \/><\/a><figcaption class=\"image-caption\"><span class=\"caption\">When I search Google Photos for \u201cbaseball.\u201d<\/span><\/figcaption><\/figure>\n<p>Once reports started emerging about the successes these companies achieved with deep learning, their peers caught on pretty quickly. Apple, Amazon, Netflix <span class=\"tickershortcode quotecard_hook\" data-symbol=\"NFLX\"><span class=\"wrapper trend-wrapper decrease\"> <a href=\"http:\/\/fortune.com\/company\/NFLX\">NFLX<\/a><\/span> <span class=\"change percent\">-1.98%<\/span> <\/span>, Pinterest, Twitter <span class=\"tickershortcode quotecard_hook\" data-symbol=\"TWTR\"><span class=\"wrapper trend-wrapper decrease\"> <a href=\"http:\/\/fortune.com\/company\/TWTR\">TWTR<\/a><\/span> <span class=\"change percent\">-1.15%<\/span> <\/span> and other companies began buying up startups and hiring experts to get their own deep learning efforts off the ground. That\u2019s why advanced speech recognition, computer vision and text analysis are so pervasive now \u2014 from Google Photos to Microsoft\u2019s <a href=\"http:\/\/www.skype.com\/en\/translator-preview\/\">Skype Translate<\/a> to <a href=\"https:\/\/swiftkey.com\/en\/\">SwiftKey\u2019s predictive keyboard app<\/a> that knows which word you\u2019ll type next.<\/p>\n<p>There are also artificial intelligence startups, often based on deep learning, that specialize in outsourcing a variety of these sci-fi tasks. <a href=\"https:\/\/www.expectlabs.com\/\">Expect Labs<\/a> specializes in voice search. Earlier this year, <a href=\"https:\/\/wit.ai\/blog\/2015\/01\/05\/wit-ai-facebook\">Facebook acquired Wit.AI<\/a>, a startup building a speech-recognition system that lets developers turn regular applications into voice-powered ones. <a href=\"http:\/\/www.clarifai.com\/\">Clarifai<\/a>analyzes images; <a href=\"https:\/\/www.metamind.io\/\">MetaMind<\/a> analyzes images and text;<a href=\"https:\/\/www.dextro.co\/\">Dextro<\/a> analyzes video; and AlchemyAPI, <a href=\"https:\/\/www-03.ibm.com\/press\/us\/en\/pressrelease\/46205.wss\">acquired earlier this year by IBM<\/a><span class=\"tickershortcode quotecard_hook\" data-symbol=\"IBM\"><span class=\"wrapper trend-wrapper increase\"> <a href=\"http:\/\/fortune.com\/company\/IBM\">IBM<\/a><\/span> <span class=\"change percent\">0.77%<\/span> <\/span>, analyzes images, text and news articles online.<\/p>\n<p>Oh, yeah, IBM has Watson, too. Since winning at<em>Jeopardy! <\/em>in 2011, the <a href=\"http:\/\/fortune.com\/video\/2015\/04\/14\/ibm-jumps-into-health-data\/\">Watson machine-learning software has been busy reading and learning text documents<\/a> in fields from retail to oncology.<\/p>\n<h2>Leaving art to the artists<\/h2>\n<p>When I watched Apple\u2019s <span class=\"tickershortcode quotecard_hook\" data-symbol=\"AAPL\"><span class=\"wrapper trend-wrapper increase\"> <a href=\"http:\/\/fortune.com\/company\/AAPL\">AAPL<\/a><\/span> <span class=\"change percent\">1.31%<\/span> <\/span> event Wednesday, though, I was also reminded that AI will probably never be the star of the show when it comes to entertainment\u2014at least not anytime soon. Content is still king, and when we\u2019re using Apple TV, or any AI for that matter, we\u2019ll be using it to sort through and analyze creative content that no AI can yet create.<\/p>\n<p>We want Apple TV to help us sort through the shows and movies on Netflix, Hulu and iTunes. We want Spotify to help us find new music we\u2019ll like. We want Google Photos and Facebook to help us find and organize our favorite photos.<\/p>\n<p>Sure, you\u2019ll occasionally read headlines <a href=\"http:\/\/www.technologyreview.com\/view\/537716\/machine-learning-algorithm-mines-rap-lyrics-then-writes-its-own\/\">about computers that can rap<\/a>, <a href=\"http:\/\/www.technologyreview.com\/view\/541091\/how-machine-vision-is-about-to-change-the-fashion-world\/\">identify fashion trends<\/a> or even devise recipes, but take a look at the finished products and they\u2019re all a little less impressive. Mashing up lines and rhymes from a collection of rap songs is not a particularly creative endeavor, especially when Eminem\u2019s creative way of rhyming words confounds the algorithms. Confirming correlations between what clothing designers come up with and what people wear is not the same as developing a new fall lineup that blows people\u2019s minds.<\/p>\n<p>One could argue that people are <a href=\"http:\/\/fortune.com\/2015\/05\/27\/ibm-watson-recipes\/\">generally optimistic about IBM\u2019s Chef Watson cookbook<\/a>\u2014an attempt to show that an AI system can mine recipes and then come up with its own unique ones\u2014but at least one reviewer called its Austrian Chocolate Burrito <a href=\"http:\/\/www.fastcodesign.com\/3045147\/ibms-watson-designed-the-worst-burrito-ive-ever-had\">the worst he\u2019s ever had<\/a>. Perhaps Watson should have listened to its human collaborator, chef Michael Laiskonis, and included cotija cheese in the recipe rather than cheese curds.<\/p>\n<figure><a href=\"https:\/\/fortunedotcom.files.wordpress.com\/2015\/09\/ibmburrito.jpg?quality=80\"><img loading=\"lazy\" class=\"wp-image-1299044 size-large\" src=\"https:\/\/fortunedotcom.files.wordpress.com\/2015\/09\/ibmburrito.jpg?quality=80&amp;w=1024\" alt=\"ibmburrito\" width=\"1024\" height=\"683\" \/><\/a><figcaption class=\"image-caption\"><span class=\"credit\">Image Courtesy IBM<\/span><\/figcaption><\/figure>\n<p>If there\u2019s a moral to all of this, it\u2019s that we should be amazed by AI and how prevalent it is becoming in our lives. When it comes to recognizing distinct breeds of dogs or tens of thousands of human faces, AI systems <a href=\"http:\/\/fortune.com\/2015\/03\/17\/google-facenet-artificial-intelligence\/\">are actually better than people<\/a>. The 21st century is shaping up to be a lot more like <em>The Jetsons<\/em> than many of could have predicted even at the turn of the millennium (we didn\u2019t even have the <em>iPod<\/em> in 2000, much less the<em> iPhone <\/em>or Siri).<\/p>\n<p>But make no mistake: AI today is often just serving and surfacing the genius of the human mind. I\u2019m happy for Apple TV and Siri to elevate my movie watching experience, but it will be a while before I\u2019ll be watching a<em>good<\/em> movie made by Siri.<\/p>\n<p><em>For more about the new Apple TV, watch this Fortune video:<\/em><\/p>\n<div class=\"video-wrapper\" data-ratio=\"0.56363636363636\" data-pos=\"1\"><object id=\"bc-video-4473784820001-1\" class=\"BrightcoveExperience\" data=\"http:\/\/c.brightcove.com\/services\/viewer\/federated_f9?&amp;width=550&amp;height=310&amp;flashID=bc-video-4473784820001-1&amp;bgcolor=%23FFFFFF&amp;playerID=3160175193001&amp;playerKey=AQ~~%2CAAAB668kGak~%2CLMlvL4u4ShOTHD9z00VquajMOcH97tcW&amp;isVid=true&amp;isUI=true&amp;videoSmoothing=true&amp;seamlessTabbing=false&amp;swliveconnect=true&amp;dynamicStreaming=true&amp;autoStart=false&amp;%40videoPlayer=4473784820001&amp;linkBaseURL=http%3A%2F%2Ffor.tn%2F1JUqXcJ&amp;includeAPI=true&amp;templateLoadHandler=Fortune_onTemplateLoad&amp;templateReadyHandler=brightcove%5B%22templateReadyHandlerbc-video-4473784820001-1%22%5D&amp;wmode=opaque&amp;adServerURL=http%3A%2F%2Fpubads.g.doubleclick.net%2Fgampad%2Fads%3Fenv%3Dvp%26gdfp_req%3D1%26impl%3Ds%26output%3Dxml_vast2%26iu%3D%2F8484%2Ffort%2Fvideo_bc%2Ftech_bc%26sz%3D1000x1%26cust_params%3Dtags%253Dapple%252Cappletv%252Cappletvsiri%252Cnewappletv%252Cstevejobs%252Ctimcook%2526ch%253Dtech%2526topic%253Dappletv%26unviewed_position_start%3D1%26correlator%3Dtimestamp&amp;debuggerID=&amp;originalTemplateReadyHandler=Fortune_onTemplateReady&amp;startTime=1442052878505\" type=\"application\/x-shockwave-flash\" width=\"550\" height=\"310\"><\/object><\/div>\n<\/div>\n<div class=\"video-wrapper\" data-ratio=\"0.56363636363636\" data-pos=\"1\"><\/div>\n<\/div>\n<p><a href=\"http:\/\/fortune.com\/2015\/09\/10\/apple-tv-artificial-intelligence\/\">http:\/\/fortune.com\/2015\/09\/10\/apple-tv-artificial-intelligence\/<\/a><\/p>\n<\/div>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Voice recognition is great, but Siri isn\u2019t making movies \u2026 yet? If you watched, read or even heard about Apple\u2019s big event on Wednesday, chances are you noticed that the upcoming Apple TV upgrade sounds impressive. One of its big draws is that users can search for shows and control the experience using their voice &hellip; <a href=\"https:\/\/aireligion.org\/?p=109\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Apple TV highlights how far AI has come \u2014 and how far it has to go<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[2],"tags":[],"_links":{"self":[{"href":"https:\/\/aireligion.org\/index.php?rest_route=\/wp\/v2\/posts\/109"}],"collection":[{"href":"https:\/\/aireligion.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/aireligion.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/aireligion.org\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/aireligion.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=109"}],"version-history":[{"count":1,"href":"https:\/\/aireligion.org\/index.php?rest_route=\/wp\/v2\/posts\/109\/revisions"}],"predecessor-version":[{"id":110,"href":"https:\/\/aireligion.org\/index.php?rest_route=\/wp\/v2\/posts\/109\/revisions\/110"}],"wp:attachment":[{"href":"https:\/\/aireligion.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=109"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/aireligion.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=109"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/aireligion.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=109"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}