|
|
授课人:霍普克罗夫特教授是美国康奈尔大学著名计算机科学教授,1985年图灵奖得主。他是美国科学院、工程院院士,是美国艺术与科学研究院院士。他曾先后在普林斯顿大学、康奈尔大学、斯坦福大学等著名高等学府工作,曾任职于一些著名科学研究机构如美国科学基金会和美国国家研究院。1992年到1998年5月,他被布什总统指定为监督国家科学基金会的国家科学委员会成员。2005年,John获得IEEE哈里·古德(Harry Goode)纪念奖,并且于2007年获得计算机研究协会的杰出贡献奖。 |
2011年起Hopcroft教授开始受邀为上海交通大学学生授课。6年间对上海交大人才培养工作和师资队伍建设工作倾注了大量精力。由于他对中国教育和科研的突出贡献,2016年国庆前夕,John 被授予“中国政府友谊奖”,由国务院总理李克强、副总理张高丽和马凯等领导颁奖并为中国科研及教育献计献策。2017年夏季学期面向上海交通大学全校学生授课。 |
|
课程简介:本课程将介绍现代数据科学,特别是机器学习的基础知识。未来信息时代的热点将是如何利用计算机从海量数据中去理解并提取有用数据。课程旨在为学生面向未来四十年计算机领域的应用和研究做准备。课程重点将从传统离散数学转向概率、统计以及数值方法等技术。 |
先修课程:离散数学、概率论、线性代数、数学分析 |
课程大纲:
课程内容将由浅入深,介绍现代数据科学的理论基础及一些前沿课题,具体包括: 1. 概率论 2. 高维数据 3. 奇异值分解 4. 随机图 5. 马尔科夫链和蒙特卡洛 6. 机器学习 7. 数据流 8. 其它相关主题 |
教材: 《Foundations of Data Science》, by Avrim Blum, John Hopcroft, and Ravindran Kannan, 2017. 《信息时代的计算机科学理论》 中文版. |
课程时间及地点:19-22 周(2017年6月 26 –7月 23 ),交大上院500号, 其中: √ 星期一、星期二、星期三、星期四:10:00-11:40 |
作业 HW1 : 2.1 2.5 2.11 2.12 2.14 ( 截止时间: 7月3日 ) HW2 : 2.15 2.16 2.21 2.27 2.33 或 2.38 ( 截止时间: 7月5日 ) HW3 : 3.5 3.8 3.11 3.13 3.18 3.27(可选) ( 截止时间: 7月10日 ) HW4 : 4.2 4.3 4.6 4.7 4.8 ( 截止时间: 7月12日 ) HW5 : 4.16 4.20(第二部分)4.23 4.25 4.47 ( 截止时间: 7月17日 ) HW6 : 5.5 5.9 5.10 5.11 HW7 : 6.1 6.3 6.8 6.9 6.10 ( 截止时间: 7月21日 ) |
作业答案 HW1 HW2 HW3 HW4 HW5 HW7 |
公告 6月27日 : 6月28日将完成high dimensional space,及SVD第一讲的授课。 6月28日 : 6月29日将完成3.6-3.8的授课。 6月30日 : 7月3日将完成Markov Chain的授课。总评安排:出勤(20%)+ 作业(40%)+ 期中(20%)+ 期末(20%)。 7月4日:7月5日起上课教室换到上院100号;7月6日进行期中考试。 7月19日:7月21日上午10-12点在上院100号进行期末考试,考前同学们如果需要答疑,请直接邮件联系助教。期末考试范围:期中考试后内容为主,包含MCMC。 7月21日:最后一次作业的作业本在电群3号楼东309 ,同学们如果需要请去自取。 |
Teaching Assistant: 龙环 ( Email: longhuan@sjtu.edu.cn ) 陈雅静 ( Email: cyj907@sjtu.edu.cn ) 闫海伦 ( Email: m_yanhailun@126.com ) 陆尤静( Email: 15215601516@163.com ) 顾章轩 ( Email: zhangxgu@126.com ) 李元媛 ( Email: 412890161@qq.com ) 孙凯华 ( Email: skh199139@163.com ) 郭韵 ( Email: 18818262874@163.com ) 符天凡 ( Email: futianfan@gmail.com ) 郝楠 ( Email: 1107569284@sjtu.edu.cn ) |
|
|
Scholars: Prof. John E. Hopcroft was honored with the A. M. Turing Award in 1986. He is a member of the National Academy of Sciences (NAS), the National Academy of Engineering (NAE) and a fellow of the American Academy of Arts and Sciences (AAAS), the American Association for the Advancement of Science, the Institute of Electrical and Electronics Engineers (IEEE), and the Association of Computing Machinery (ACM). In 1992, he was appointed by President Bush to the National Science Board (NSB), which oversees the National Science Foundation (NSF), and served through May 1998. From 1995-98, Hopcroft served on the National Research Council's Commission on Physical Sciences, Mathematics, and Applications. |
In addition to these appointments, Hopcroft serves as a member of the Scientific Advisory Committee for the David and Lucile Packard Fellowships in Science and Engineering, the SIAM financial management committee, IIIT New Delhi advisory board, Microsoft's technical advisory board for research Asia, the Engineering Advisory Board, Seattle University, and the program committee for Chile Millennium Science Initiative. In late 2016, John was honored with the ‘Chinese government friendship award’ for his great contribution in improving the education and academia in China. |
Description: While traditional areas of computer science remain highly important, increasingly researchers of the future will be involved with using computers to understand and extract usable information from massive data arising in applications, not just how to make computers useful on specific well-defined problems. This course intends to cover the theory likely to be useful in the next 40 years, and related topics gave students an advantage in the last 40 years. One of the major changes is the switch from discrete mathematics to more of an emphasis on probability, statistics, and numerical methods. |
Prerequisites:Discrete Mathematics, Probabilistic theory, Linear algebra,Mathematical Analysis |
Syllabus: 1. The Probabilistic method 2. High dimensional space 3. Singular Value Decomposition 4. Random graph 5. Markov Chain Monte Carlo 6. Machine learning 7. Data flow 8. Other topics |
Textbook: 《Foundations of Data Science》, by Avrim Blum, John Hopcroft, and Ravindran Kannan, 2017. 《Computer Science Information Theory for the Information Age》 Chinese Version. |
Time and Location: Week 19-22 (June 26th –July 23th ), ShangYuan 500 √ Monday, Tuesday, Wednesday, Thursday: 10:00-11:40 |
Homework HW1 : 2.1 2.5 2.11 2.12 2.14 ( Due : July 3rd) HW2 : 2.15 2.16 2.21 2.27 either 2.33 or 2.38 ( Due : July 5th ) HW3 : 3.5 3.8 3.11 3.13 3.18 3.27(optional) ( Due: July 10th ) HW4 : 4.2 4.3 4.6 4.7 4.8 ( Due: July 12th ) HW5 : 4.16 4.20(part2)4.23 4.25 4.47 ( Due: July 17th ) HW6 : 5.5 5.9 5.10 5.11 HW7 : 6.1 6.3 6.8 6.9 6.10 ( Due: July 21th ) |
Homework Answer HW1 HW2 HW3 HW4 HW5 HW7 |
Announcement June 27th : High dimensional space and Chapter 1 of SVD will be completed in June 28th. June 28th : 3.6-3.8 will be completed in June 29th. June 30th : Markov Chain will be completed in July 3rd.Final grade:attendance(20%)+ homework(40%)+ midterm(20%)+ final(20%). July 4th:Our location will change to ShangYuan 100 Since July 5th. Midterm exam will be hosted on July 6th. July 19th:Finalterm exam will be hosted on July 21th ( 10:00am-12:00am ) in ShangYuan 100 . If you have questions, please connect our TAs by e-mail. |
Teaching Assistant: Huan Long ( Email: longhuan@sjtu.edu.cn ) Yajing Chen ( Email: cyj907@sjtu.edu.cn ) Hailun Yan ( Email: m_yanhailun@126.com ) Youjing Lu ( Email: 15215601516@163.com ) Zhangxuan Gu ( Email: zhangxgu@126.com ) Yuanyuan Li ( Email: 412890161@qq.com ) Kaihua Sun ( Email: skh199139@163.com ) Yun Guo ( Email: 18818262874@163.com ) Tianfan Fu ( Email: futianfan@gmail.com ) Nan Hao ( Email: 1107569284@sjtu.edu.cn ) |