计算视觉:改变日常生活的关键技术

需积分: 10 5 下载量 15 浏览量 更新于2024-07-26 收藏 22.94MB PDF 举报
“Computer Vision in Daily Life”讨论了计算视觉在日常生活中的应用,包括广告、社交媒体、人机交互、生物识别、监控、军事智能、机器人、电子遗产、生物医学成像、娱乐、遥感、移动通信和智能视觉计算等多个领域。该文由香港中文大学信息工程系的Xiaoou Tang撰写。 计算机视觉是一种让机器“看”并理解周围环境的技术。在这个领域,研究主要集中在三个方面:面部识别、图像处理和视频分析。 1. 面部识别:在照片中自动识别人脸(Face Detection),通过聚类算法(Cluster Annotation)将相似的脸分组以便标记(Face Clustering)。这在社交媒体上的照片标签、生物识别系统以及安全监控中都有广泛应用。 2. 图像处理:传统的基于文本的图像搜索往往结果模糊,例如搜索“苹果”可能得到各种与苹果公司、苹果产品或水果相关的图片。为了解决这个问题,可以使用多特征搜索(Search Photos using Multiple Features),结合语义签名(Semantic Signatures)进行更精确的图像检索。 3. 视频分析:高阶照片质量评估(High-Level Photo Quality Assessment)是自动评价照片质量的一种方法,有助于优化图像呈现,如通过一键重排名(One-click Re-Ranking)和语义签名进行照片的再排序,提供更愉悦的视觉体验。 这些技术在日常生活中扮演着重要角色,例如,广告行业利用计算机视觉进行精准投放;社交媒体上,自动人脸检测和标签功能方便用户管理照片;在人机交互中,面部识别技术被用于解锁设备或身份验证;生物医学成像则在医疗诊断中发挥关键作用;而遥感和移动通信则依赖于计算机视觉来获取和解析地理信息。 总结来说,计算视觉不仅改变了我们查找和分享信息的方式,还在安全监控、个人隐私保护、智能设备交互等方面产生了深远影响。随着技术的不断发展,计算视觉将在更多领域带来创新和变革,比如自动驾驶、智能家居、虚拟现实等,持续改变我们的日常生活。
2017-12-21 上传
Humans perceive the three-dimensional structure of the world with apparent ease. However, despite all of the recent advances in computer vision research, the dream of having a computer interpret an image at the same level as a two-year old remains elusive. Why is computer vision such a challenging problem and what is the current state of the art? 'Computer Vision: Algorithms and Applications' explores the variety of techniques commonly used to analyze and interpret images. It also describes challenging real-world applications where vision is being successfully used, both for specialized applications such as medical imaging, and for fun, consumer-level tasks such as image editing and stitching, which students can apply to their own personal photos and videos. More than just a source of 'recipes,' this exceptionally authoritative and comprehensive textbook/reference also takes a scientific approach to basic vision problems, formulating physical models of the imaging process before inverting them to produce descriptions of a scene. These problems are also analyzed using statistical models and solved using rigorous engineering techniques. Topics and features: Structured to support active curricula and project-oriented courses, with tips in the Introduction for using the book in a variety of customized courses; Presents exercises at the end of each chapter with a heavy emphasis on testing algorithms and containing numerous suggestions for small mid-term projects; Provides additional material and more detailed mathematical topics in the Appendices, which cover linear algebra, numerical techniques, and Bayesian estimation theory; Suggests additional reading at the end of each chapter, including the latest research in each sub-field, in addition to a full Bibliography at the end of the book; and, Supplies supplementary course material for students at the associated website. Suitable for an upper-level undergraduate or graduate-level course in computer science or engineering, this textbook focuses on basic techniques that work under real-world conditions and encourages students to push their creative boundaries. Its design and exposition also make it eminently suitable as a unique reference to the fundamental techniques and current research literature in computer vision.