diff options
author | Hui Lan <lanhui@zjnu.edu.cn> | 2019-04-14 21:10:25 +0800 |
---|---|---|
committer | Hui Lan <lanhui@zjnu.edu.cn> | 2019-04-14 21:10:25 +0800 |
commit | 90739f99b9c9e8d40daaf601e63d734bd396b24a (patch) | |
tree | 3efdf389406ef2bfe07f9f1ed708eb533c667e9c | |
parent | 72339b6b3668f85868bdae96e292e91ed7dd2871 (diff) |
Fixed untracked files
-rw-r--r-- | LectureNotesOnPython.html | 1699 |
1 files changed, 0 insertions, 1699 deletions
diff --git a/LectureNotesOnPython.html b/LectureNotesOnPython.html deleted file mode 100644 index aff0a09..0000000 --- a/LectureNotesOnPython.html +++ /dev/null @@ -1,1699 +0,0 @@ -<?xml version="1.0" encoding="utf-8" ?> -<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> -<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en"> -<head> -<meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> -<meta name="generator" content="Docutils 0.14: http://docutils.sourceforge.net/" /> -<title>Lecture Notes on Python</title> -<meta name="authors" content="蓝珲 (lanhui AT zjnu.edu.cn)" /> -<style type="text/css"> - -/* -:Author: David Goodger (goodger@python.org) -:Id: $Id: html4css1.css 7952 2016-07-26 18:15:59Z milde $ -:Copyright: This stylesheet has been placed in the public domain. - -Default cascading style sheet for the HTML output of Docutils. - -See http://docutils.sf.net/docs/howto/html-stylesheets.html for how to -customize this style sheet. -*/ - -/* used to remove borders from tables and images */ -.borderless, table.borderless td, table.borderless th { - border: 0 } - -table.borderless td, table.borderless th { - /* Override padding for "table.docutils td" with "! important". - The right padding separates the table cells. */ - padding: 0 0.5em 0 0 ! important } - -.first { - /* Override more specific margin styles with "! important". */ - margin-top: 0 ! important } - -.last, .with-subtitle { - margin-bottom: 0 ! important } - -.hidden { - display: none } - -.subscript { - vertical-align: sub; - font-size: smaller } - -.superscript { - vertical-align: super; - font-size: smaller } - -a.toc-backref { - text-decoration: none ; - color: black } - -blockquote.epigraph { - margin: 2em 5em ; } - -dl.docutils dd { - margin-bottom: 0.5em } - -object[type="image/svg+xml"], object[type="application/x-shockwave-flash"] { - overflow: hidden; -} - -/* Uncomment (and remove this text!) to get bold-faced definition list terms -dl.docutils dt { - font-weight: bold } -*/ - -div.abstract { - margin: 2em 5em } - -div.abstract p.topic-title { - font-weight: bold ; - text-align: center } - -div.admonition, div.attention, div.caution, div.danger, div.error, -div.hint, div.important, div.note, div.tip, div.warning { - margin: 2em ; - border: medium outset ; - padding: 1em } - -div.admonition p.admonition-title, div.hint p.admonition-title, -div.important p.admonition-title, div.note p.admonition-title, -div.tip p.admonition-title { - font-weight: bold ; - font-family: sans-serif } - -div.attention p.admonition-title, div.caution p.admonition-title, -div.danger p.admonition-title, div.error p.admonition-title, -div.warning p.admonition-title, .code .error { - color: red ; - font-weight: bold ; - font-family: sans-serif } - -/* Uncomment (and remove this text!) to get reduced vertical space in - compound paragraphs. -div.compound .compound-first, div.compound .compound-middle { - margin-bottom: 0.5em } - -div.compound .compound-last, div.compound .compound-middle { - margin-top: 0.5em } -*/ - -div.dedication { - margin: 2em 5em ; - text-align: center ; - font-style: italic } - -div.dedication p.topic-title { - font-weight: bold ; - font-style: normal } - -div.figure { - margin-left: 2em ; - margin-right: 2em } - -div.footer, div.header { - clear: both; - font-size: smaller } - -div.line-block { - display: block ; - margin-top: 1em ; - margin-bottom: 1em } - -div.line-block div.line-block { - margin-top: 0 ; - margin-bottom: 0 ; - margin-left: 1.5em } - -div.sidebar { - margin: 0 0 0.5em 1em ; - border: medium outset ; - padding: 1em ; - background-color: #ffffee ; - width: 40% ; - float: right ; - clear: right } - -div.sidebar p.rubric { - font-family: sans-serif ; - font-size: medium } - -div.system-messages { - margin: 5em } - -div.system-messages h1 { - color: red } - -div.system-message { - border: medium outset ; - padding: 1em } - -div.system-message p.system-message-title { - color: red ; - font-weight: bold } - -div.topic { - margin: 2em } - -h1.section-subtitle, h2.section-subtitle, h3.section-subtitle, -h4.section-subtitle, h5.section-subtitle, h6.section-subtitle { - margin-top: 0.4em } - -h1.title { - text-align: center } - -h2.subtitle { - text-align: center } - -hr.docutils { - width: 75% } - -img.align-left, .figure.align-left, object.align-left, table.align-left { - clear: left ; - float: left ; - margin-right: 1em } - -img.align-right, .figure.align-right, object.align-right, table.align-right { - clear: right ; - float: right ; - margin-left: 1em } - -img.align-center, .figure.align-center, object.align-center { - display: block; - margin-left: auto; - margin-right: auto; -} - -table.align-center { - margin-left: auto; - margin-right: auto; -} - -.align-left { - text-align: left } - -.align-center { - clear: both ; - text-align: center } - -.align-right { - text-align: right } - -/* reset inner alignment in figures */ -div.align-right { - text-align: inherit } - -/* div.align-center * { */ -/* text-align: left } */ - -.align-top { - vertical-align: top } - -.align-middle { - vertical-align: middle } - -.align-bottom { - vertical-align: bottom } - -ol.simple, ul.simple { - margin-bottom: 1em } - -ol.arabic { - list-style: decimal } - -ol.loweralpha { - list-style: lower-alpha } - -ol.upperalpha { - list-style: upper-alpha } - -ol.lowerroman { - list-style: lower-roman } - -ol.upperroman { - list-style: upper-roman } - -p.attribution { - text-align: right ; - margin-left: 50% } - -p.caption { - font-style: italic } - -p.credits { - font-style: italic ; - font-size: smaller } - -p.label { - white-space: nowrap } - -p.rubric { - font-weight: bold ; - font-size: larger ; - color: maroon ; - text-align: center } - -p.sidebar-title { - font-family: sans-serif ; - font-weight: bold ; - font-size: larger } - -p.sidebar-subtitle { - font-family: sans-serif ; - font-weight: bold } - -p.topic-title { - font-weight: bold } - -pre.address { - margin-bottom: 0 ; - margin-top: 0 ; - font: inherit } - -pre.literal-block, pre.doctest-block, pre.math, pre.code { - margin-left: 2em ; - margin-right: 2em } - -pre.code .ln { color: grey; } /* line numbers */ -pre.code, code { background-color: #eeeeee } -pre.code .comment, code .comment { color: #5C6576 } -pre.code .keyword, code .keyword { color: #3B0D06; font-weight: bold } -pre.code .literal.string, code .literal.string { color: #0C5404 } -pre.code .name.builtin, code .name.builtin { color: #352B84 } -pre.code .deleted, code .deleted { background-color: #DEB0A1} -pre.code .inserted, code .inserted { background-color: #A3D289} - -span.classifier { - font-family: sans-serif ; - font-style: oblique } - -span.classifier-delimiter { - font-family: sans-serif ; - font-weight: bold } - -span.interpreted { - font-family: sans-serif } - -span.option { - white-space: nowrap } - -span.pre { - white-space: pre } - -span.problematic { - color: red } - -span.section-subtitle { - /* font-size relative to parent (h1..h6 element) */ - font-size: 80% } - -table.citation { - border-left: solid 1px gray; - margin-left: 1px } - -table.docinfo { - margin: 2em 4em } - -table.docutils { - margin-top: 0.5em ; - margin-bottom: 0.5em } - -table.footnote { - border-left: solid 1px black; - margin-left: 1px } - -table.docutils td, table.docutils th, -table.docinfo td, table.docinfo th { - padding-left: 0.5em ; - padding-right: 0.5em ; - vertical-align: top } - -table.docutils th.field-name, table.docinfo th.docinfo-name { - font-weight: bold ; - text-align: left ; - white-space: nowrap ; - padding-left: 0 } - -/* "booktabs" style (no vertical lines) */ -table.docutils.booktabs { - border: 0px; - border-top: 2px solid; - border-bottom: 2px solid; - border-collapse: collapse; -} -table.docutils.booktabs * { - border: 0px; -} -table.docutils.booktabs th { - border-bottom: thin solid; - text-align: left; -} - -h1 tt.docutils, h2 tt.docutils, h3 tt.docutils, -h4 tt.docutils, h5 tt.docutils, h6 tt.docutils { - font-size: 100% } - -ul.auto-toc { - list-style-type: none } - -</style> -</head> -<body> -<div class="document" id="lecture-notes-on-python"> -<h1 class="title">Lecture Notes on Python</h1> -<table class="docinfo" frame="void" rules="none"> -<col class="docinfo-name" /> -<col class="docinfo-content" /> -<tbody valign="top"> -<tr><th class="docinfo-name">Authors:</th> -<td>蓝珲 (lanhui AT zjnu.edu.cn)</td></tr> -<tr><th class="docinfo-name">Version:</th> -<td>0.1.2 of 2019-04-14</td></tr> -</tbody> -</table> -<div class="contents topic" id="id1"> -<p class="topic-title first">内容目录</p> -<ul class="simple"> -<li><a class="reference internal" href="#id2" id="id25">前言</a></li> -<li><a class="reference internal" href="#python" id="id26">Python的发音纠正</a></li> -<li><a class="reference internal" href="#id3" id="id27">Python源流</a></li> -<li><a class="reference internal" href="#id4" id="id28">Python的关键词</a></li> -<li><a class="reference internal" href="#id5" id="id29">值的类型</a></li> -<li><a class="reference internal" href="#variable" id="id30">变量(Variable)</a></li> -<li><a class="reference internal" href="#mutable" id="id31">可变(mutable)类型与不可变类型</a></li> -<li><a class="reference internal" href="#id6" id="id32">数与格式化显示</a></li> -<li><a class="reference internal" href="#strings" id="id33">字符串(Strings)</a></li> -<li><a class="reference internal" href="#concatenation" id="id34">字符串相加(concatenation)</a></li> -<li><a class="reference internal" href="#slice" id="id35">子串(slice)</a></li> -<li><a class="reference internal" href="#id7" id="id36">搜索字符串</a></li> -<li><a class="reference internal" href="#string" id="id37">String类(对象)方法</a></li> -<li><a class="reference internal" href="#in" id="id38">in操作符</a></li> -<li><a class="reference internal" href="#id8" id="id39">字符串比较</a></li> -<li><a class="reference internal" href="#id9" id="id40">列表</a></li> -<li><a class="reference internal" href="#id10" id="id41">列表作为参数</a></li> -<li><a class="reference internal" href="#append" id="id42">注意区别 <tt class="docutils literal">append</tt> 与 <tt class="docutils literal">+</tt> 操作符</a><ul> -<li><a class="reference internal" href="#tdd-test-driven-development" id="id43">TDD - Test-driven Development</a></li> -<li><a class="reference internal" href="#id11" id="id44">计算复杂度</a></li> -</ul> -</li> -<li><a class="reference internal" href="#dictionary" id="id45">字典(Dictionary)</a><ul> -<li><a class="reference internal" href="#incremental-development" id="id46">递增开发(Incremental Development)</a></li> -<li><a class="reference internal" href="#keyvalue" id="id47">key与value互换</a></li> -<li><a class="reference internal" href="#id12" id="id48">字典里面可以有字典</a></li> -</ul> -</li> -<li><a class="reference internal" href="#id13" id="id49">函数</a><ul> -<li><a class="reference internal" href="#id14" id="id50">局部变量</a></li> -<li><a class="reference internal" href="#id15" id="id51">全局变量</a></li> -<li><a class="reference internal" href="#id16" id="id52">调用函数与传递参数</a></li> -<li><a class="reference internal" href="#flow-of-execution" id="id53">函数执行顺序 (flow of execution)</a></li> -</ul> -</li> -<li><a class="reference internal" href="#id17" id="id54">文件</a></li> -<li><a class="reference internal" href="#id18" id="id55">排序</a><ul> -<li><a class="reference internal" href="#id19" id="id56">排序一组数或一组字符串</a></li> -<li><a class="reference internal" href="#id20" id="id57">自定义排序算法</a><ul> -<li><a class="reference internal" href="#id21" id="id58">选择排序</a></li> -<li><a class="reference internal" href="#merge-sort" id="id59">合并排序 (Merge sort)</a></li> -</ul> -</li> -<li><a class="reference internal" href="#id22" id="id60">比较排序速度</a></li> -<li><a class="reference internal" href="#id23" id="id61">排序元组列表</a></li> -<li><a class="reference internal" href="#lambda" id="id62">巧用 lambda 函数进行灵活排序</a></li> -</ul> -</li> -<li><a class="reference internal" href="#id24" id="id63">参考</a></li> -</ul> -</div> -<div class="section" id="id2"> -<h1><a class="toc-backref" href="#id25">前言</a></h1> -<p>非学究写书,无空洞行文。</p> -<p>Python语法简洁,库函数全面强大,编程速度快,运行速度也不慢。</p> -<p>大学里, 往往是专家教初学者。 专家也是从初学者过来的,只不过专家经常忘 -记这一点。 要知道,初学者可能只写过不到10行的程序,而专家已经写了至少 -10000行程序了。 两者的大脑构造不同。 学习是困难的, 教育或培训应把专 -家的工作流程放慢100倍! 初学者要做的是尽量快的使自己的程序量到达1000, -这包括中间无数次除错, 每次除错都是一次微小的学习。 要想感到舒服, 只 -有积累自己的经验, 无其它捷径。</p> -<p>面对新东西,初学者往往会问这些困惑 (ask the student's name):</p> -<ul class="simple"> -<li>这个东西是什么意思?</li> -<li>怎么来的?</li> -<li>有什么用?</li> -<li>怎么用?</li> -<li>什么情况下用?</li> -<li>为什么程序中要加入这个东西?</li> -</ul> -<p>动态的教育方式会更好。专家与初学者坐在电脑前, 逐步回答上述问题, 直到 -初学者说“我明白了”。</p> -</div> -<div class="section" id="python"> -<h1><a class="toc-backref" href="#id26">Python的发音纠正</a></h1> -<p>国人普遍把th发作s。 Not quite correct。</p> -<p>ˈpī-ˌthän , -thən <a class="reference external" href="https://cn.bing.com/search?q=define%20python&tf=U2VydmljZT1EaWN0aW9uYXJ5QW5zd2VyVjIgU2NlbmFyaW89RGVmaW5pdGlvblNjZW5hcmlvIFBvc2l0aW9uPU5PUCBSYW5raW5nRGF0YT1UcnVlIEZvcmNlUGxhY2U9RmFsc2UgUGFpcnM9RGljdGlvbmFyeVdvcmQ6cHl0aG9uO3NjbjpEZWZpbml0aW9uU2NlbmFyaW87cDpRQVM7IHw%3d&hs=hyRBF0mYq9hrfQUq66DIZnFVta1ZGRfBiBks25oUguk%3d">pronounciation</a></p> -</div> -<div class="section" id="id3"> -<h1><a class="toc-backref" href="#id27">Python源流</a></h1> -<p>Python之父Guido van Rossum,荷兰人,1956年生,1982年阿姆斯特丹大学获得 -数学与计算机科学硕士学位。有过ABC语言的工作经验。1989年设计了Python语 -言。</p> -<p>Python语法简洁,有大而全而有用的标准库。</p> -<p>自然(natural)语言。特点:歧义,重复。“The penny dropped。” “不要。”</p> -<p>正式(formal)语言。特点:只管字面意思。</p> -<p>计算机组成概要:CPU,总线,内存,硬盘。</p> -<p>Bit, byte, KB, MB, GB, TB换算。</p> -<p>变量的命名。如,层叠策略,用CDCL还是TiledStrategy?</p> -<p>教务管理系统,<a class="reference external" href="http://10.1.70.164/jwglxt">http://10.1.70.164/jwglxt</a>?</p> -<p>习语言、易语言等目前非主流语言。</p> -<p>最简单的类定义:</p> -<blockquote> -<dl class="docutils"> -<dt>class A:</dt> -<dd>pass</dd> -</dl> -</blockquote> -<p>以上面的类为蓝本,创建一个实例:a = A()。 虽然这个a什么也做不了。</p> -<p>Python文件命令行执行。 python a.py。</p> -<p>函数头的三要素:def,函数名,参数列表:</p> -<blockquote> -<div class="line-block"> -<div class="line">def add_number(a, b):</div> -<div class="line-block"> -<div class="line">return a + b</div> -</div> -</div> -<div class="line-block"> -<div class="line">def add_lst(a, b):</div> -<div class="line-block"> -<div class="line">if len(a) != len(b):</div> -<div class="line-block"> -<div class="line">return 'ERROR: a and b not in equal length.'</div> -</div> -<div class="line">n = len(a)</div> -<div class="line">result = []</div> -<div class="line">for i in range(n):</div> -<div class="line-block"> -<div class="line">result.append(a[i] + b[i])</div> -</div> -<div class="line">return result</div> -<div class="line"><br /></div> -</div> -<div class="line">print( add_lst([1,2,3],[-1,-2,-3]) )</div> -</div> -</blockquote> -</div> -<div class="section" id="id4"> -<h1><a class="toc-backref" href="#id28">Python的关键词</a></h1> -<div class="line-block"> -<div class="line">def pass</div> -<div class="line">from import</div> -<div class="line">False True</div> -<div class="line">in</div> -<div class="line">None</div> -<div class="line">class</div> -<div class="line">return</div> -<div class="line">while for</div> -<div class="line">continue break</div> -<div class="line">and or not</div> -<div class="line">if else elif</div> -<div class="line">try except finally raise</div> -<div class="line">lambda nonlocal</div> -<div class="line">del global with</div> -<div class="line">yield assert</div> -<div class="line">as is</div> -</div> -<p>关键词被语言留用(reserved),无法作变量名。</p> -</div> -<div class="section" id="id5"> -<h1><a class="toc-backref" href="#id29">值的类型</a></h1> -<p>所有的值都是对象。a = 5, help(a) a.bit_length()</p> -<p>数字。1, 1.,1.1, .1, 1e1, 1e-1, 1E1, 1E-1</p> -<dl class="docutils"> -<dt>字符串(string)。'hello', 100 * 'hello', 'hello' * 100, 'Weight is %4.2f kg' % (70.2)</dt> -<dd>f = open('a.html') -s = f.read() -f.close()</dd> -<dt>列表(list)。['a', 'b', 'c', 'd']</dt> -<dd>['bob', 170, 'john', '180'] -[1, 2, 3, 4] -range(10) 返回一个range对象。可以用list函数把这个对象变成列表。 -等价的是range(0, 10, 1),从0开始,步进1,不包括10。 -A list of list -A list of tuples -A list of objects</dd> -</dl> -<p>元组(tuple),字典(dict)。</p> -</div> -<div class="section" id="variable"> -<h1><a class="toc-backref" href="#id30">变量(Variable)</a></h1> -<p>是一个名字(name),是指向一个值(value)的名字。</p> -<p>值存放在内存(memory)中的某个地址。</p> -<p>尽量选有意义的简短的名字。比如,代表个数用n,代表索引用i,j,k。</p> -<p>关键词不能用作变量名。</p> -<p>值存放在内存某处。值会记录指向它的变量个数。</p> -<p>为节省空间,如果几个变量的值相同,那么这些变量有时会指向这个值(而不是为每个变量单独分配内存空间单独存放该值)。</p> -<p>这叫做interning技术。但并非总是如此。</p> -<div class="line-block"> -<div class="line">a = 10</div> -<div class="line">b = 10</div> -<div class="line">c = 10</div> -<div class="line">id(a), id(b), id(c)</div> -<div class="line">(8791229060416, 8791229060416, 8791229060416)</div> -</div> -<p>值10存在地址8791229060416,所有a,b,c三个变量都指向(point to)这个地址。</p> -<div class="line-block"> -<div class="line">x = 257</div> -<div class="line">y = 257</div> -<div class="line">id(x), id(y)</div> -<div class="line">(46487024, 46487952)</div> -</div> -<p>以上虽然变量x与y的值都是一样,可是这两个值存放在不同的内存地址。</p> -<div class="line-block"> -<div class="line">s1 = 'hello'</div> -<div class="line">s2 = 'hello'</div> -<div class="line">id(s1), id(s2)</div> -<div class="line">s1 == s2</div> -<div class="line">s1 is s2</div> -</div> -<div class="line-block"> -<div class="line">s1 = 'h' * 100</div> -<div class="line">s2 = 'h' * 100</div> -<div class="line">id(s1), id(s2)</div> -</div> -<div class="line-block"> -<div class="line">s3 = 'hello, world!'</div> -<div class="line">s4 = 'hello, world!'</div> -<div class="line">id(s3), id(s4)</div> -<div class="line">(46703536, 46705136)</div> -</div> -<div class="line-block"> -<div class="line">class A:</div> -<div class="line-block"> -<div class="line">pass</div> -</div> -</div> -<div class="line-block"> -<div class="line">a = A()</div> -<div class="line">b = A()</div> -<div class="line">a</div> -<div class="line"><__main__.A object at 0x0000000002CD92E8></div> -<div class="line">b</div> -<div class="line"><__main__.A object at 0x0000000002CD9240></div> -</div> -<div class="line-block"> -<div class="line">x = [1,2,3]</div> -<div class="line">id(x)</div> -<div class="line">46869512</div> -<div class="line">y = x</div> -<div class="line">id(y)</div> -<div class="line">46869512</div> -<div class="line">x.append(4)</div> -<div class="line">x</div> -<div class="line">[1, 2, 3, 4]</div> -<div class="line">y</div> -<div class="line">[1, 2, 3, 4]</div> -</div> -<div class="line-block"> -<div class="line">x = []</div> -<div class="line">id(x)</div> -<div class="line">46869640</div> -</div> -<div class="line-block"> -<div class="line">x = [1,2,3,4]</div> -<div class="line">y = [1,2,3,4]</div> -<div class="line">id(x)</div> -<div class="line">46869768</div> -<div class="line">id(y)</div> -<div class="line">46868808</div> -</div> -<p>一个没有名字与之对应的值将会被清出内存。</p> -<p>参考资料:</p> -<ul class="simple"> -<li><a class="reference external" href="http://foobarnbaz.com/2012/07/08/understanding-python-variables/">http://foobarnbaz.com/2012/07/08/understanding-python-variables/</a></li> -<li><a class="reference external" href="https://stackoverflow.com/questions/19721002/is-a-variable-the-name-the-value-or-the-memory-location">https://stackoverflow.com/questions/19721002/is-a-variable-the-name-the-value-or-the-memory-location</a></li> -</ul> -</div> -<div class="section" id="mutable"> -<h1><a class="toc-backref" href="#id31">可变(mutable)类型与不可变类型</a></h1> -<p>字符串是不可变的(immutable)类型,不能在原内存地址改变。</p> -<p>a = 'hello' 不可以原地修改a[0] = 'H'。需要修改a的值时,需要对a进行重新赋值a = 'Hello'。</p> -<p>列表是可变(mutable)类型,能在原内存地址改变。</p> -<p>a = [1, 2] 可以原地修改a[0] = 2</p> -<p>参考资料:</p> -<ul class="simple"> -<li><a class="reference external" href="https://stackoverflow.com/questions/8056130/immutable-vs-mutable-types">https://stackoverflow.com/questions/8056130/immutable-vs-mutable-types</a></li> -</ul> -<p>表达式(expression):值,变量或操作符的组合。</p> -<blockquote> -<div class="line-block"> -<div class="line">17</div> -<div class="line">n + 2</div> -</div> -</blockquote> -<p>语句(statement):能够制造一个变量或者显示信息的代码。</p> -<blockquote> -<div class="line-block"> -<div class="line">n = 17</div> -<div class="line">print(n)</div> -</div> -</blockquote> -</div> -<div class="section" id="id6"> -<h1><a class="toc-backref" href="#id32">数与格式化显示</a></h1> -<blockquote> -<div class="line-block"> -<div class="line">x = 3.1415926</div> -</div> -<div class="line-block"> -<div class="line">print('%4.0f' % (x))</div> -<div class="line">print('%4.1f' % (x))</div> -<div class="line">print('%4.2f' % (x))</div> -<div class="line">print('%4.3f' % (x))</div> -<div class="line">print('%4.4f' % (x))</div> -</div> -<div class="line-block"> -<div class="line">print('%6.0f' % (x))</div> -<div class="line">print('%6.1f' % (x))</div> -<div class="line">print('%6.2f' % (x))</div> -<div class="line">print('%6.3f' % (x))</div> -<div class="line">print('%6.4f' % (x))</div> -</div> -<div class="line-block"> -<div class="line">print('%.0f' % (x))</div> -<div class="line">print('%.1f' % (x))</div> -<div class="line">print('%.2f' % (x))</div> -<div class="line">print('%.3f' % (x))</div> -<div class="line">print('%.4f' % (x))</div> -<div class="line">print('%.5f' % (x))</div> -<div class="line">print('%.6f' % (x))</div> -<div class="line">print('%.7f' % (x))</div> -<div class="line">print('%.8f' % (x))</div> -<div class="line">print('%.9f' % (x))</div> -<div class="line">print('%.15f' % (x))</div> -<div class="line">print('%.16f' % (x))</div> -<div class="line">print('%.17f' % (x))</div> -<div class="line">print('%.18f' % (x))</div> -</div> -<div class="line-block"> -<div class="line">print('%4.f' % (x))</div> -<div class="line">print('%5.f' % (x))</div> -<div class="line">print('%6.f' % (x))</div> -<div class="line">print('%7.f' % (x))</div> -<div class="line">print('%8.f' % (x))</div> -</div> -<div class="line-block"> -<div class="line">print('%f' % (x))</div> -</div> -</blockquote> -</div> -<div class="section" id="strings"> -<h1><a class="toc-backref" href="#id33">字符串(Strings)</a></h1> -<p>由字符组成。</p> -<div class="line-block"> -<div class="line">fruit = 'banana!'</div> -<div class="line">first_letter = fruit[0]</div> -<div class="line">second_letter = fruit[1]</div> -</div> -<p>索引(index)从0开始,所以1代表第二个字符。只用整数。</p> -<p>负整数代表从字符串末尾开始。如fruit[-1]代表fruit字符串最后一个字符。</p> -<div class="line-block"> -<div class="line">i = 1</div> -<div class="line">fruit[i]</div> -<div class="line">fruit[i+1]</div> -</div> -<p>len()函数。返回字符串字符个数。len(fruit)。</p> -<div class="line-block"> -<div class="line">L = len(fruit)</div> -<div class="line">fruit[L-1],最后一个字符。与fruit[-1]等价。</div> -</div> -<p>遍历(traverse)字符串。</p> -<blockquote> -<div class="line-block"> -<div class="line">fruit = 'banana'</div> -<div class="line">for c in fruit:</div> -<div class="line-block"> -<div class="line">print(c)</div> -</div> -</div> -</blockquote> -<p>反向遍历。</p> -<blockquote> -<div class="line-block"> -<div class="line">fruit = 'banana'</div> -<div class="line">for i in range(len(fruit)-1,-1,-1):</div> -<div class="line-block"> -<div class="line">print(fruit[i])</div> -</div> -</div> -<div class="line-block"> -<div class="line">fruit = 'banana'</div> -<div class="line">for c in fruit[::-1]: # [start,stop,step]</div> -<div class="line-block"> -<div class="line">print(c)</div> -</div> -</div> -<div class="line-block"> -<div class="line">fruit = 'banana'</div> -<div class="line">for c in ''.join(reversed(fruit)):</div> -<div class="line-block"> -<div class="line">print(c)</div> -</div> -</div> -</blockquote> -<p>以上 <tt class="docutils literal"># [start,stop,step]</tt> 代表注释(comment),注释以 <tt class="docutils literal">#</tt> 号开头。</p> -</div> -<div class="section" id="concatenation"> -<h1><a class="toc-backref" href="#id34">字符串相加(concatenation)</a></h1> -<p>输出Jack, Kack, Lack, Mack, Nack, Ouack, Pack, and Quack</p> -<div class="line-block"> -<div class="line">prefixes = 'JKLMNOPQ'</div> -<div class="line">suffix = 'ack'</div> -<div class="line">for c in prefixes:</div> -<div class="line-block"> -<div class="line">if c == 'O' or c == 'Q':</div> -<div class="line-block"> -<div class="line">print(c + 'u' + suffix)</div> -</div> -<div class="line">else:</div> -<div class="line-block"> -<div class="line">print(c + suffix)</div> -</div> -</div> -</div> -</div> -<div class="section" id="slice"> -<h1><a class="toc-backref" href="#id35">子串(slice)</a></h1> -<p>s[n:m],其中n或m可省略。 -包括第n个字符,不包括第m个字符。(索引自0开始)</p> -<div class="line-block"> -<div class="line">s = 'Monty Python'</div> -<div class="line">s[0:5]</div> -<div class="line">s[6:12]</div> -<div class="line">s[:5]</div> -<div class="line">s[6:]</div> -<div class="line">s[:]</div> -</div> -<p>n一般小于m。如果n大于等于m,那么就返回空字符串。</p> -<p>空字符串的长度是0。</p> -<p>字符串是immutable的。不能改变已有的字符串。</p> -<div class="line-block"> -<div class="line">greeting = 'Hello, world!'</div> -<div class="line">greeting[0] = 'J'</div> -</div> -<div class="line-block"> -<div class="line">greeting = 'Hello, world!'</div> -<div class="line">new_greeting = 'J' + greeting[1:]</div> -</div> -</div> -<div class="section" id="id7"> -<h1><a class="toc-backref" href="#id36">搜索字符串</a></h1> -<div class="line-block"> -<div class="line">def find(word, c):</div> -<div class="line-block"> -<div class="line">i = 0</div> -<div class="line">while i < len(word):</div> -<div class="line-block"> -<div class="line">if word[i] == c:</div> -<div class="line-block"> -<div class="line">return i</div> -</div> -<div class="line">i = i + 1</div> -</div> -<div class="line">return -1</div> -</div> -</div> -<div class="line-block"> -<div class="line">print(find('banana', 'a'))</div> -</div> -<p>练习一:加第三个参数,设定从哪个字符开始搜起。</p> -<p>练习二:加第三个参数,设定从哪个方向开始搜起。</p> -<p>String对象有内置函数find。</p> -<p>数字符串中某个字符的个数。</p> -<p>练习:用上面三参数的find来做。</p> -</div> -<div class="section" id="string"> -<h1><a class="toc-backref" href="#id37">String类(对象)方法</a></h1> -<div class="line-block"> -<div class="line">upper()</div> -<div class="line">lower()</div> -</div> -<p>方法调用:invocation/call</p> -<div class="line-block"> -<div class="line">word.find('na')</div> -<div class="line">word.find('na', 3)</div> -<div class="line">name.find('b', 1, 2)</div> -</div> -</div> -<div class="section" id="in"> -<h1><a class="toc-backref" href="#id38">in操作符</a></h1> -<p>'a' in 'banana' -'seed' in 'banana'</p> -<p>练习:写出下面的函数,使得 -in_both('apples', 'oranges')返回'aes'。</p> -</div> -<div class="section" id="id8"> -<h1><a class="toc-backref" href="#id39">字符串比较</a></h1> -<p>字典序(alphabetical order)。大写字母排在小写字母前。</p> -<p>字符串之间可以有以下对比操作:</p> -<div class="line-block"> -<div class="line">==</div> -<div class="line">>, >=</div> -<div class="line"><, <=</div> -</div> -<p>练习:写is_reverse函数,使得is_reverse('god', 'dog')返回True。</p> -<p>find_from函数的两种实现。如果能够找出错误,给1分奖励。</p> -<p>字符串是对象(object)。</p> -<p>对象的本质涵义 - data construct。</p> -<p>计算复杂度。</p> -<p>即兴定义函数,制造一个长度不小于4的密码。</p> -</div> -<div class="section" id="id9"> -<h1><a class="toc-backref" href="#id40">列表</a></h1> -<p>语言的内置(built-in)类型。注意与String类比,index也是从0开始, in操作符, 求长度,获得字串,遍历操作类似。</p> -<blockquote> -<div class="line-block"> -<div class="line">[ ]</div> -<div class="line">[10, 20, 30, 40]</div> -<div class="line">['crunchy frog', 'ram bladder', 'lark vomit']</div> -</div> -</blockquote> -<p>列表中的元素不需要是同一类型的: <tt class="docutils literal">['spam', 2.0, 5, [10, 20]]</tt></p> -<p>列表[10,20]在另外一个列表中,这叫嵌套列表。</p> -<p>['spam', 1, ['Brie', 'Roquefort', 'Pol le Veq'], [1, 2, 3]],长度是多少?</p> -<p>列表是 Mutable类型。值可以在原地变。(注意与String的区别)。</p> -<p>IndexError</p> -<p>遍历</p> -<dl class="docutils"> -<dt>for cheese in cheeses:</dt> -<dd>print(cheese)</dd> -<dt>for i in range(len(numbers)):</dt> -<dd>numbers[i] = numbers[i] * 2</dd> -<dt>for x in []:</dt> -<dd>print('This never happens.')</dd> -</dl> -<!-- 讨论软件工程认证数据输入问题。 --> -<p><tt class="docutils literal">+</tt> 操作符用来连接, <tt class="docutils literal">*</tt> 操作符用来重复。</p> -<p>列表的方法</p> -<blockquote> -<p>append</p> -<p>extend</p> -<p>sort</p> -<p>t = ['d', 'c', 'e', 'b', 'a']</p> -<p>t.sort() # 问t.sort()返回什么值?</p> -<p>t</p> -</blockquote> -<p>sum - reduce方法,把几个值变成一个值</p> -<p>map方法,把几个值变成另外几个值</p> -<dl class="docutils"> -<dt>def f(x):</dt> -<dd>return 2*x</dd> -</dl> -<p>list(map(f, [1,2]]))</p> -<p>filter方法,从几个值中选择符合条件的几个值。</p> -<blockquote> -<div class="line-block"> -<div class="line">def f(x):</div> -<div class="line-block"> -<div class="line">if x % 2 == 0:</div> -<div class="line-block"> -<div class="line">return True</div> -</div> -<div class="line">return False</div> -</div> -</div> -<div class="line-block"> -<div class="line">list(filter(f, [1,2,3,4]))</div> -</div> -</blockquote> -<p>pop</p> -<blockquote> -<div class="line-block"> -<div class="line">t = ['a', 'b', 'c']</div> -<div class="line">x = t.pop(1) # pop可不带参数,不带参数返回哪个值?</div> -</div> -</blockquote> -<p>del</p> -<blockquote> -<div class="line-block"> -<div class="line">t = ['a', 'b', 'c']</div> -<div class="line">del t[1]</div> -</div> -<div class="line-block"> -<div class="line">t = ['a', 'b', 'c', 'd', 'e', 'f']</div> -<div class="line">del t[1:5]</div> -</div> -</blockquote> -<p>remove</p> -<blockquote> -<div class="line-block"> -<div class="line">t = ['a', 'b', 'c']</div> -<div class="line">t.remove('b')</div> -</div> -</blockquote> -<p>split</p> -<blockquote> -<div class="line-block"> -<div class="line">list_of_characters = list('spam')</div> -<div class="line">list_of_words = 'spam should be filtered'.split()</div> -<div class="line">list_of_words = 'spam-should-be-filtered'.split('-')</div> -</div> -</blockquote> -<p>join方法</p> -<blockquote> -<div class="line-block"> -<div class="line">','.join(['1','2','3'])</div> -</div> -<div class="line-block"> -<div class="line">a = 'banana'</div> -<div class="line">b = 'banana'</div> -<div class="line">a is b # a与b是不是指向同一个值</div> -<div class="line">a == b</div> -</div> -<div class="line-block"> -<div class="line">a = [1, 2, 3]</div> -<div class="line">b = [1, 2, 3]</div> -<div class="line">a is b # not identical, a and b are not the same object</div> -<div class="line">a == b # equivalent though they have the same values</div> -</div> -</blockquote> -<p>别名(Aliasing)</p> -<p>a = [1, 2, 3] -b = a -b is a</p> -<p>把变量名与对象联系起来叫做reference。 -a与b是指向[1,2,3]的两个references。 -因为[1,2,3]是mutable的,所以使用a对[1,2,3]做改变同样影响到b对应的值。 -error-prone(易错)</p> -</div> -<div class="section" id="id10"> -<h1><a class="toc-backref" href="#id41">列表作为参数</a></h1> -<blockquote> -<div class="line-block"> -<div class="line">def delete_head(t):</div> -<div class="line-block"> -<div class="line">del t[0]</div> -</div> -</div> -<div class="line-block"> -<div class="line">letters = ['a', 'b', 'c']</div> -<div class="line">delete_head(letters) # letters and t points to the same list object.</div> -<div class="line">letters</div> -</div> -</blockquote> -</div> -<div class="section" id="append"> -<h1><a class="toc-backref" href="#id42">注意区别 <tt class="docutils literal">append</tt> 与 <tt class="docutils literal">+</tt> 操作符</a></h1> -<blockquote> -<div class="line-block"> -<div class="line">t1 = [1, 2]</div> -<div class="line">t2 = t1.append(3)</div> -<div class="line">t1</div> -<div class="line">[1, 2, 3]</div> -<div class="line">t2</div> -</div> -<div class="line-block"> -<div class="line">t3 = t1 + [4]</div> -<div class="line">t3</div> -<div class="line">[1, 2, 3, 4]</div> -<div class="line">t1</div> -<div class="line">[1, 2, 3]</div> -</div> -</blockquote> -<p>区别如下两个函数:</p> -<blockquote> -<dl class="docutils"> -<dt>def bad_delete_head(t):</dt> -<dd>t = t[1:] # WRONG!</dd> -<dt>def tail(t):</dt> -<dd>return t[1:]</dd> -</dl> -</blockquote> -<div class="section" id="tdd-test-driven-development"> -<h2><a class="toc-backref" href="#id43">TDD - Test-driven Development</a></h2> -<p>测试驱动开发。 My favourite。 刺激有挑战性。 帮助厘清需求。 帮助编写代码。</p> -<p>推荐使用pytest。如何安装? 使用命令 <tt class="docutils literal">pip install pytest</tt>。</p> -<p>在 <tt class="docutils literal">test_cases.py</tt> 写如下测试用例。然后在命令行运行: <tt class="docutils literal">python <span class="pre">-m</span> pytest test_cases.py</tt> 。</p> -<pre class="code python literal-block"> -<span class="comment single"># Copyright (c) Hui Lan 2019</span> - -<span class="keyword namespace">import</span> <span class="name namespace">random</span> -<span class="keyword namespace">import</span> <span class="name namespace">string</span> - -<span class="keyword">def</span> <span class="name function">make_password</span><span class="punctuation">(</span><span class="name">n</span><span class="punctuation">):</span> - <span class="literal string doc">''' - Return a string of length n consisting of a combination of - letters, digits and special characters. Note that each password - must have at least one lower case letter, one upper case letter, - one digit and one special charater. Return an empty string if n - is less than 4. - '''</span> - - <span class="keyword">if</span> <span class="name">n</span> <span class="operator"><</span> <span class="literal number integer">4</span><span class="punctuation">:</span> - <span class="keyword">return</span> <span class="literal string single">''</span> - - <span class="name">password</span> <span class="operator">=</span> <span class="name">random</span><span class="operator">.</span><span class="name">choice</span><span class="punctuation">(</span><span class="name">string</span><span class="operator">.</span><span class="name">ascii_lowercase</span><span class="punctuation">)</span> <span class="operator">+</span> \ - <span class="name">random</span><span class="operator">.</span><span class="name">choice</span><span class="punctuation">(</span><span class="name">string</span><span class="operator">.</span><span class="name">ascii_uppercase</span><span class="punctuation">)</span> <span class="operator">+</span> \ - <span class="name">random</span><span class="operator">.</span><span class="name">choice</span><span class="punctuation">(</span><span class="name">string</span><span class="operator">.</span><span class="name">digits</span><span class="punctuation">)</span> <span class="operator">+</span> \ - <span class="name">random</span><span class="operator">.</span><span class="name">choice</span><span class="punctuation">(</span><span class="name">string</span><span class="operator">.</span><span class="name">punctuation</span><span class="punctuation">)</span> <span class="operator">+</span> \ - <span class="literal string single">''</span><span class="operator">.</span><span class="name">join</span><span class="punctuation">([</span><span class="name">random</span><span class="operator">.</span><span class="name">choice</span><span class="punctuation">(</span><span class="name">string</span><span class="operator">.</span><span class="name">ascii_letters</span> <span class="operator">+</span> <span class="name">string</span><span class="operator">.</span><span class="name">digits</span> <span class="operator">+</span> <span class="name">string</span><span class="operator">.</span><span class="name">punctuation</span><span class="punctuation">)</span> <span class="keyword">for</span> <span class="name">i</span> <span class="operator word">in</span> <span class="name builtin">range</span><span class="punctuation">(</span><span class="name">n</span><span class="operator">-</span><span class="literal number integer">4</span><span class="punctuation">)])</span> - - <span class="keyword">return</span> <span class="literal string single">''</span><span class="operator">.</span><span class="name">join</span><span class="punctuation">(</span><span class="name">random</span><span class="operator">.</span><span class="name">sample</span><span class="punctuation">(</span><span class="name">password</span><span class="punctuation">,</span> <span class="name">n</span><span class="punctuation">))</span> <span class="comment single"># shuffle password then return</span> - - - - -<span class="keyword">if</span> <span class="name variable magic">__name__</span> <span class="operator">==</span> <span class="literal string single">'__main__'</span><span class="punctuation">:</span> - <span class="keyword">for</span> <span class="name">n</span> <span class="operator word">in</span> <span class="name builtin">range</span><span class="punctuation">(</span><span class="literal number integer">0</span><span class="punctuation">,</span><span class="literal number integer">20</span><span class="punctuation">):</span> - <span class="name">pwd</span> <span class="operator">=</span> <span class="name">make_password</span><span class="punctuation">(</span><span class="name">n</span><span class="punctuation">)</span> - <span class="keyword">print</span><span class="punctuation">(</span><span class="name">pwd</span><span class="punctuation">)</span> -</pre> -</div> -<div class="section" id="id11"> -<h2><a class="toc-backref" href="#id44">计算复杂度</a></h2> -<p>用Big O表述复杂度。O(n), O(n^2), O(n^3)。</p> -<p>密码实验回顾。</p> -</div> -</div> -<div class="section" id="dictionary"> -<h1><a class="toc-backref" href="#id45">字典(Dictionary)</a></h1> -<p>Mutable数据类型。</p> -<p>实际开发中超级有用。</p> -<blockquote> -<div class="line-block"> -<div class="line">d = {} or d = dict()</div> -</div> -<div class="line-block"> -<div class="line">d = {'hot':'热', 'cool':'凉', 'cold':'冷'}</div> -<div class="line">d['warm'] = '温'</div> -<div class="line">d['warm']</div> -<div class="line">d['freezing'] # KeyError</div> -<div class="line">len(d)</div> -</div> -<div class="line-block"> -<div class="line">'warm' in d</div> -<div class="line">'温' in d.values()</div> -</div> -</blockquote> -<p>key</p> -<p>value</p> -<p>key-value pair (item)</p> -<p>item的顺序不可预测,不是按照创建时的顺序。</p> -<div class="section" id="incremental-development"> -<h2><a class="toc-backref" href="#id46">递增开发(Incremental Development)</a></h2> -<p>每次完成一小点。从易到难。</p> -<p>练习:给定一个字符串,数出每个字母出现的频率。</p> -<pre class="code python literal-block"> -<span class="keyword">def</span> <span class="name function">histogram</span><span class="punctuation">(</span><span class="name">s</span><span class="punctuation">):</span> - <span class="literal string doc">''' Cannot pass any test cases. '''</span> - <span class="keyword">pass</span> - -<span class="keyword">def</span> <span class="name function">histogram</span><span class="punctuation">(</span><span class="name">s</span><span class="punctuation">):</span> - <span class="literal string doc">''' Can pass the test case in which s is an empty string. '''</span> - <span class="name">d</span> <span class="operator">=</span> <span class="punctuation">{}</span> - <span class="keyword">return</span> <span class="name">d</span> - -<span class="keyword">def</span> <span class="name function">histogram</span><span class="punctuation">(</span><span class="name">s</span><span class="punctuation">):</span> - <span class="literal string doc">''' Can pass the test cases in which all characters in s are unique. '''</span> - <span class="name">d</span> <span class="operator">=</span> <span class="punctuation">{}</span> - <span class="keyword">for</span> <span class="name">c</span> <span class="operator word">in</span> <span class="name">s</span><span class="punctuation">:</span> - <span class="name">d</span><span class="punctuation">[</span><span class="name">c</span><span class="punctuation">]</span> <span class="operator">=</span> <span class="literal number integer">1</span> - <span class="keyword">return</span> <span class="name">d</span> - -<span class="keyword">def</span> <span class="name function">histogram</span><span class="punctuation">(</span><span class="name">s</span><span class="punctuation">):</span> - <span class="literal string doc">''' Can pass all test cases. '''</span> - <span class="name">d</span> <span class="operator">=</span> <span class="punctuation">{}</span> - <span class="keyword">for</span> <span class="name">c</span> <span class="operator word">in</span> <span class="name">s</span><span class="punctuation">:</span> - <span class="keyword">if</span> <span class="name">c</span> <span class="operator word">not</span> <span class="operator word">in</span> <span class="name">d</span><span class="punctuation">:</span> - <span class="name">d</span><span class="punctuation">[</span><span class="name">c</span><span class="punctuation">]</span> <span class="operator">=</span> <span class="literal number integer">1</span> - <span class="keyword">else</span><span class="punctuation">:</span> - <span class="name">d</span><span class="punctuation">[</span><span class="name">c</span><span class="punctuation">]</span> <span class="operator">+=</span> <span class="literal number integer">1</span> - <span class="keyword">return</span> <span class="name">d</span> - - -<span class="name">h</span> <span class="operator">=</span> <span class="name">histogram</span><span class="punctuation">(</span><span class="literal string single">'good'</span><span class="punctuation">)</span> -<span class="keyword">print</span><span class="punctuation">(</span><span class="name">h</span><span class="punctuation">)</span> -</pre> -<p>练习:给定一个字符串,数出每个单词出现的频率。</p> -<p>练习:给定一个新闻文本,数出每个单词出现的频率。考虑以下方面,(1)只考虑字典里有的单词。(2)单词周围如有标点符号,要先移除。</p> -<pre class="code python literal-block"> -<span class="comment single"># Copyright (C) 2019 Hui Lan</span> -<span class="comment single"># The following line fixes SyntaxError: Non-UTF-8 code starting with ...</span> -<span class="comment single"># coding=utf8</span> - -<span class="keyword">def</span> <span class="name function">file2lst</span><span class="punctuation">(</span><span class="name">fname</span><span class="punctuation">):</span> - <span class="literal string doc">''' Return a list where each element is a word from fname. '''</span> - <span class="name">L</span> <span class="operator">=</span> <span class="punctuation">[]</span> - <span class="name">f</span> <span class="operator">=</span> <span class="name builtin">open</span><span class="punctuation">(</span><span class="name">fname</span><span class="punctuation">)</span> - <span class="keyword">for</span> <span class="name">line</span> <span class="operator word">in</span> <span class="name">f</span><span class="punctuation">:</span> - <span class="name">line</span> <span class="operator">=</span> <span class="name">line</span><span class="operator">.</span><span class="name">strip</span><span class="punctuation">()</span> - <span class="name">lst</span> <span class="operator">=</span> <span class="name">line</span><span class="operator">.</span><span class="name">split</span><span class="punctuation">()</span> - <span class="keyword">for</span> <span class="name">x</span> <span class="operator word">in</span> <span class="name">lst</span><span class="punctuation">:</span> - <span class="name">L</span><span class="operator">.</span><span class="name">append</span><span class="punctuation">(</span><span class="name">x</span><span class="punctuation">)</span> - <span class="name">f</span><span class="operator">.</span><span class="name">close</span><span class="punctuation">()</span> - <span class="keyword">return</span> <span class="name">L</span> - - -<span class="keyword">def</span> <span class="name function">lst2dict</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">):</span> - <span class="literal string doc">''' Return a dictionary given list lst. Each key is an element in the lst. - The value is always 1.'''</span> - <span class="name">d</span> <span class="operator">=</span> <span class="punctuation">{}</span> - <span class="keyword">for</span> <span class="name">w</span> <span class="operator word">in</span> <span class="name">lst</span><span class="punctuation">:</span> - <span class="name">d</span><span class="punctuation">[</span><span class="name">w</span><span class="punctuation">]</span> <span class="operator">=</span> <span class="literal number integer">1</span> - <span class="keyword">return</span> <span class="name">d</span> - - -<span class="keyword namespace">import</span> <span class="name namespace">string</span> -<span class="keyword">def</span> <span class="name function">remove_punctuation</span><span class="punctuation">(</span><span class="name">s</span><span class="punctuation">):</span> - <span class="name">p</span> <span class="operator">=</span> <span class="literal string single">',.:’“”'</span> <span class="operator">+</span> <span class="name">string</span><span class="operator">.</span><span class="name">punctuation</span> - <span class="name">t</span> <span class="operator">=</span> <span class="literal string single">''</span> - <span class="keyword">for</span> <span class="name">c</span> <span class="operator word">in</span> <span class="name">s</span><span class="punctuation">:</span> - <span class="keyword">if</span> <span class="operator word">not</span> <span class="name">c</span> <span class="operator word">in</span> <span class="name">p</span><span class="punctuation">:</span> - <span class="name">t</span> <span class="operator">+=</span> <span class="name">c</span> - <span class="keyword">elif</span> <span class="name">c</span> <span class="operator">==</span> <span class="literal string single">'’'</span><span class="punctuation">:</span> <span class="comment single"># handle the case such as May’s</span> - <span class="keyword">return</span> <span class="name">t</span> - <span class="keyword">return</span> <span class="name">t</span> - -<span class="keyword">def</span> <span class="name function">word_frequency</span><span class="punctuation">(</span><span class="name">fname</span><span class="punctuation">,</span> <span class="name">english_dictionary</span><span class="punctuation">):</span> - <span class="literal string doc">''' Return a dictionary where each key is a word both in the file fname and in - the dictionary english_dictionary, and the corresponding value is the frequency - of that word.'''</span> - <span class="name">d</span> <span class="operator">=</span> <span class="punctuation">{}</span> - <span class="name">L</span> <span class="operator">=</span> <span class="name">file2lst</span><span class="punctuation">(</span><span class="name">fname</span><span class="punctuation">)</span> - <span class="keyword">for</span> <span class="name">x</span> <span class="operator word">in</span> <span class="name">L</span><span class="punctuation">:</span> - <span class="name">x</span> <span class="operator">=</span> <span class="name">remove_punctuation</span><span class="punctuation">(</span><span class="name">x</span><span class="operator">.</span><span class="name">lower</span><span class="punctuation">())</span> - <span class="keyword">if</span> <span class="name">x</span> <span class="operator word">in</span> <span class="name">english_dictionary</span><span class="punctuation">:</span> - <span class="keyword">if</span> <span class="operator word">not</span> <span class="name">x</span> <span class="operator word">in</span> <span class="name">d</span><span class="punctuation">:</span> - <span class="name">d</span><span class="punctuation">[</span><span class="name">x</span><span class="punctuation">]</span> <span class="operator">=</span> <span class="literal number integer">1</span> - <span class="keyword">else</span><span class="punctuation">:</span> - <span class="name">d</span><span class="punctuation">[</span><span class="name">x</span><span class="punctuation">]</span> <span class="operator">+=</span> <span class="literal number integer">1</span> - <span class="keyword">return</span> <span class="name">d</span> - - -<span class="keyword">def</span> <span class="name function">sort_by_value</span><span class="punctuation">(</span><span class="name">d</span><span class="punctuation">):</span> - <span class="literal string doc">''' Return a sorted list of tuples, each tuple containing a key and a value. - Note that the tuples are order in descending order of the value.'''</span> - <span class="keyword namespace">import</span> <span class="name namespace">operator</span> - <span class="name">lst</span> <span class="operator">=</span> <span class="name builtin">sorted</span><span class="punctuation">(</span><span class="name">d</span><span class="operator">.</span><span class="name">items</span><span class="punctuation">(),</span> <span class="name">key</span><span class="operator">=</span><span class="name">operator</span><span class="operator">.</span><span class="name">itemgetter</span><span class="punctuation">(</span><span class="literal number integer">1</span><span class="punctuation">),</span> <span class="name">reverse</span><span class="operator">=</span><span class="name builtin pseudo">True</span><span class="punctuation">)</span> - <span class="keyword">return</span> <span class="name">lst</span> - - -<span class="keyword">if</span> <span class="name variable magic">__name__</span> <span class="operator">==</span> <span class="literal string single">'__main__'</span><span class="punctuation">:</span> - <span class="name">ed</span> <span class="operator">=</span> <span class="name">lst2dict</span><span class="punctuation">(</span><span class="name">file2lst</span><span class="punctuation">(</span><span class="literal string single">'words.txt'</span><span class="punctuation">))</span> <span class="comment single"># from http://greenteapress.com/thinkpython2/code/words.txt</span> - <span class="name">d</span> <span class="operator">=</span> <span class="name">word_frequency</span><span class="punctuation">(</span><span class="literal string single">'brexit-news.txt'</span><span class="punctuation">,</span> <span class="name">ed</span><span class="punctuation">)</span> - <span class="name">lst</span> <span class="operator">=</span> <span class="name">sort_by_value</span><span class="punctuation">(</span><span class="name">d</span><span class="punctuation">)</span> - <span class="keyword">for</span> <span class="name">x</span> <span class="operator word">in</span> <span class="name">lst</span><span class="punctuation">:</span> - <span class="keyword">print</span><span class="punctuation">(</span><span class="literal string single">'</span><span class="literal string interpol">%s</span><span class="literal string single"> (</span><span class="literal string interpol">%d</span><span class="literal string single">)'</span> <span class="operator">%</span> <span class="punctuation">(</span><span class="name">x</span><span class="punctuation">[</span><span class="literal number integer">0</span><span class="punctuation">],</span> <span class="name">x</span><span class="punctuation">[</span><span class="literal number integer">1</span><span class="punctuation">]))</span> -</pre> -<p>练习: 改写函数 <tt class="docutils literal">word_frequency</tt> , 使它能接受第三个参数, <tt class="docutils literal">black_lst</tt>。 <tt class="docutils literal">black_lst</tt> 是包含要排除考虑的单词的列表。 例如, <tt class="docutils literal">black_lst</tt> 可以是 <tt class="docutils literal">['the', 'and', 'of', 'to']</tt> 。</p> -</div> -<div class="section" id="keyvalue"> -<h2><a class="toc-backref" href="#id47">key与value互换</a></h2> -<p>注意到在原来的字典中一个value可能对应多个key的值。比如 <tt class="docutils literal">d = <span class="pre">{'a':1,</span> <span class="pre">'b':2,</span> <span class="pre">'c':2}</span></tt> 中,2就对应两个key,'b'与'c'。</p> -<pre class="code python literal-block"> -<span class="keyword">def</span> <span class="name function">inverse_dictionary</span><span class="punctuation">(</span><span class="name">d</span><span class="punctuation">):</span> - <span class="name">d2</span> <span class="operator">=</span> <span class="punctuation">{}</span> - <span class="keyword">for</span> <span class="name">k</span> <span class="operator word">in</span> <span class="name">d</span><span class="punctuation">:</span> - <span class="name">v</span> <span class="operator">=</span> <span class="name">d</span><span class="punctuation">[</span><span class="name">k</span><span class="punctuation">]</span> - <span class="keyword">if</span> <span class="operator word">not</span> <span class="name">v</span> <span class="operator word">in</span> <span class="name">d2</span><span class="punctuation">:</span> - <span class="name">d2</span><span class="punctuation">[</span><span class="name">v</span><span class="punctuation">]</span> <span class="operator">=</span> <span class="punctuation">[</span><span class="name">k</span><span class="punctuation">]</span> - <span class="keyword">else</span><span class="punctuation">:</span> - <span class="name">d2</span><span class="punctuation">[</span><span class="name">v</span><span class="punctuation">]</span><span class="operator">.</span><span class="name">append</span><span class="punctuation">(</span><span class="name">k</span><span class="punctuation">)</span> - <span class="keyword">return</span> <span class="name">d2</span> - - - -<span class="name">d</span> <span class="operator">=</span> <span class="punctuation">{</span><span class="literal string single">'a'</span><span class="punctuation">:</span><span class="literal number integer">1</span><span class="punctuation">,</span> <span class="literal string single">'b'</span><span class="punctuation">:</span><span class="literal number integer">2</span><span class="punctuation">,</span> <span class="literal string single">'c'</span><span class="punctuation">:</span><span class="literal number integer">2</span><span class="punctuation">}</span> -<span class="name">d2</span> <span class="operator">=</span> <span class="name">inverse_dictionary</span><span class="punctuation">(</span><span class="name">d</span><span class="punctuation">)</span> -<span class="keyword">print</span><span class="punctuation">(</span><span class="name">d2</span><span class="punctuation">)</span> -</pre> -<p>练习: 用 <tt class="docutils literal">inverse_dictionary</tt> 对上面 <tt class="docutils literal">d = <span class="pre">word_frequency('brexit-news.txt',</span> ed)</tt> 产生的 <tt class="docutils literal">d</tt> 进行转化。然后按照单词出现频率从高到低把所有单词都显示出来。每行显示一个频率内的所有单词。</p> -<pre class="code python literal-block"> -<span class="name">d2</span> <span class="operator">=</span> <span class="name">inverse_dictionary</span><span class="punctuation">(</span><span class="name">d</span><span class="punctuation">)</span> -<span class="keyword">for</span> <span class="name">k</span> <span class="operator word">in</span> <span class="name builtin">sorted</span><span class="punctuation">(</span><span class="name">d2</span><span class="operator">.</span><span class="name">keys</span><span class="punctuation">(),</span> <span class="name">reverse</span><span class="operator">=</span><span class="name builtin pseudo">True</span><span class="punctuation">):</span> - <span class="keyword">print</span><span class="punctuation">(</span><span class="literal string single">'</span><span class="literal string interpol">%d</span><span class="literal string single"> </span><span class="literal string interpol">%s</span><span class="literal string single">'</span> <span class="operator">%</span> <span class="punctuation">(</span><span class="name">k</span><span class="punctuation">,</span> <span class="literal string single">' '</span><span class="operator">.</span><span class="name">join</span><span class="punctuation">(</span><span class="name">d2</span><span class="punctuation">[</span><span class="name">k</span><span class="punctuation">])))</span> -</pre> -<p>练习: 使用 <tt class="docutils literal">setdefault</tt> 方法对上面的 <tt class="docutils literal">inverse_dictionary</tt> 进行简化 (减少行数)。</p> -<pre class="code python literal-block"> -<span class="keyword">def</span> <span class="name function">inverse_dictionary</span><span class="punctuation">(</span><span class="name">d</span><span class="punctuation">):</span> - <span class="name">d2</span> <span class="operator">=</span> <span class="punctuation">{}</span> - <span class="keyword">for</span> <span class="name">k</span> <span class="operator word">in</span> <span class="name">d</span><span class="punctuation">:</span> - <span class="name">v</span> <span class="operator">=</span> <span class="name">d</span><span class="punctuation">[</span><span class="name">k</span><span class="punctuation">]</span> - <span class="name">d2</span><span class="operator">.</span><span class="name">setdefault</span><span class="punctuation">(</span><span class="name">v</span><span class="punctuation">,</span> <span class="punctuation">[])</span><span class="operator">.</span><span class="name">append</span><span class="punctuation">(</span><span class="name">k</span><span class="punctuation">)</span> - - <span class="keyword">return</span> <span class="name">d2</span> -</pre> -</div> -<div class="section" id="id12"> -<h2><a class="toc-backref" href="#id48">字典里面可以有字典</a></h2> -<pre class="code python literal-block"> -<span class="name">d</span> <span class="operator">=</span> <span class="punctuation">{</span> <span class="literal string single">'john'</span><span class="punctuation">:{</span><span class="literal string single">'dob'</span><span class="punctuation">:</span><span class="literal string single">'1990-10-23'</span><span class="punctuation">,</span> <span class="literal string single">'height'</span><span class="punctuation">:</span><span class="literal string single">'6 feet 5 inches'</span><span class="punctuation">}</span> <span class="punctuation">}</span> -</pre> -</div> -</div> -<div class="section" id="id13"> -<h1><a class="toc-backref" href="#id49">函数</a></h1> -<p>当我们开始不断复制黏贴代码时,就要考虑把这部分代码做成函数了。</p> -<p>函数 <tt class="docutils literal">unique_words</tt> 与 <tt class="docutils literal">unique_words2</tt> 哪个运行速度快?</p> -<pre class="code python literal-block"> -<span class="keyword">def</span> <span class="name function">unique_words</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">):</span> - <span class="name">d</span> <span class="operator">=</span> <span class="punctuation">{}</span> - <span class="keyword">for</span> <span class="name">x</span> <span class="operator word">in</span> <span class="name">lst</span><span class="punctuation">:</span> - <span class="name">d</span><span class="punctuation">[</span><span class="name">x</span><span class="punctuation">]</span> <span class="operator">=</span> <span class="literal number integer">1</span> - <span class="keyword">return</span> <span class="name builtin">sorted</span><span class="punctuation">(</span><span class="name">d</span><span class="operator">.</span><span class="name">keys</span><span class="punctuation">())</span> - -<span class="keyword">def</span> <span class="name function">unique_words2</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">):</span> - <span class="keyword">return</span> <span class="name builtin">sorted</span><span class="punctuation">(</span><span class="name builtin">list</span><span class="punctuation">(</span><span class="name builtin">set</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">)))</span> - - -<span class="name">N</span> <span class="operator">=</span> <span class="literal number integer">10000000</span> -<span class="keyword">print</span><span class="punctuation">(</span><span class="name">unique_words</span><span class="punctuation">([</span><span class="literal string single">'hello'</span><span class="punctuation">,</span> <span class="literal string single">'world'</span><span class="punctuation">,</span> <span class="literal string single">'am'</span><span class="punctuation">,</span> <span class="literal string single">'he'</span><span class="punctuation">]</span> <span class="operator">*</span> <span class="name">N</span><span class="punctuation">))</span> -<span class="keyword">print</span><span class="punctuation">(</span><span class="name">unique_words2</span><span class="punctuation">([</span><span class="literal string single">'hello'</span><span class="punctuation">,</span> <span class="literal string single">'world'</span><span class="punctuation">,</span> <span class="literal string single">'am'</span><span class="punctuation">,</span> <span class="literal string single">'he'</span><span class="punctuation">]</span> <span class="operator">*</span> <span class="name">N</span><span class="punctuation">))</span> -</pre> -<div class="section" id="id14"> -<h2><a class="toc-backref" href="#id50">局部变量</a></h2> -<p>在函数之内。函数执行结束,局部变量消失。</p> -</div> -<div class="section" id="id15"> -<h2><a class="toc-backref" href="#id51">全局变量</a></h2> -<p>全局变量位于函数之外,模块之内。全局变量对所有模块内的函数可见(可读)。如果在函数内要对全局变量重新赋值,那么要先用 <tt class="docutils literal">global</tt> 声明之 (declare)。</p> -<pre class="code python literal-block"> -<span class="name">verbose</span> <span class="operator">=</span> <span class="name builtin pseudo">True</span> - -<span class="keyword">def</span> <span class="name function">example1</span><span class="punctuation">():</span> - <span class="keyword">if</span> <span class="name">verbose</span><span class="punctuation">:</span> - <span class="keyword">print</span><span class="punctuation">(</span><span class="literal string single">'Running example1'</span><span class="punctuation">)</span> - -<span class="keyword">def</span> <span class="name function">example2</span><span class="punctuation">():</span> - <span class="name">verbose</span> <span class="operator">=</span> <span class="name builtin pseudo">False</span> <span class="comment single"># a NEW local variable verbose</span> - <span class="keyword">if</span> <span class="name">verbose</span><span class="punctuation">:</span> - <span class="keyword">print</span><span class="punctuation">(</span><span class="literal string single">'Running example2'</span><span class="punctuation">)</span> - -<span class="keyword">def</span> <span class="name function">example3</span><span class="punctuation">():</span> - <span class="keyword">global</span> <span class="name">verbose</span> <span class="comment single"># I am actually going to use the global variable verbose; don't create a local one.</span> - <span class="name">verbose</span> <span class="operator">=</span> <span class="name builtin pseudo">False</span> - <span class="keyword">if</span> <span class="name">verbose</span><span class="punctuation">:</span> - <span class="keyword">print</span><span class="punctuation">(</span><span class="literal string single">'Running example3'</span><span class="punctuation">)</span> - - - -<span class="keyword">print</span><span class="punctuation">(</span><span class="name">verbose</span><span class="punctuation">)</span> -<span class="name">example1</span><span class="punctuation">()</span> - -<span class="keyword">print</span><span class="punctuation">(</span><span class="name">verbose</span><span class="punctuation">)</span> -<span class="name">example2</span><span class="punctuation">()</span> -<span class="name">example1</span><span class="punctuation">()</span> - -<span class="keyword">print</span><span class="punctuation">(</span><span class="name">verbose</span><span class="punctuation">)</span> -<span class="name">example3</span><span class="punctuation">()</span> -<span class="name">example1</span><span class="punctuation">()</span> - -<span class="keyword">print</span><span class="punctuation">(</span><span class="name">verbose</span><span class="punctuation">)</span> -</pre> -<p>全局的列表与字典,如果只需改变其内容,而不是重新赋值,则不需要用 <tt class="docutils literal">global</tt> 声明。</p> -<pre class="code python literal-block"> -<span class="name">record</span> <span class="operator">=</span> <span class="punctuation">{</span><span class="literal string single">'s1'</span><span class="punctuation">:</span><span class="literal number integer">65</span><span class="punctuation">,</span> <span class="literal string single">'s2'</span><span class="punctuation">:</span><span class="literal number integer">60</span><span class="punctuation">}</span> - -<span class="keyword">def</span> <span class="name function">add_score</span><span class="punctuation">(</span><span class="name">student</span><span class="punctuation">,</span> <span class="name">score</span><span class="punctuation">):</span> - <span class="name">record</span><span class="punctuation">[</span><span class="name">student</span><span class="punctuation">]</span> <span class="operator">=</span> <span class="name">score</span> - - -<span class="keyword">print</span><span class="punctuation">(</span><span class="name">record</span><span class="punctuation">)</span> -<span class="name">add_score</span><span class="punctuation">(</span><span class="literal string single">'s3'</span><span class="punctuation">,</span> <span class="literal number integer">75</span><span class="punctuation">)</span> -<span class="keyword">print</span><span class="punctuation">(</span><span class="name">record</span><span class="punctuation">)</span> -</pre> -<p>练习: 定义一个函数 <tt class="docutils literal">empty_dict</tt> 清空字典 <tt class="docutils literal">record</tt>。 要求: 不能用 <tt class="docutils literal">return</tt> 语句。 提示: 可以用 <tt class="docutils literal">pop</tt> 方法, 或者直接给 <tt class="docutils literal">record</tt> 赋值 <tt class="docutils literal">{}</tt> 。</p> -</div> -<div class="section" id="id16"> -<h2><a class="toc-backref" href="#id52">调用函数与传递参数</a></h2> -<p>在使用函数前要先确定函数已经被定义。</p> -<p>区别 <tt class="docutils literal">argument</tt> 与 <tt class="docutils literal">parameter</tt> 。传过去的是 <tt class="docutils literal">argument</tt> , 函数头的参数列表是 <tt class="docutils literal">parameter</tt> 。 <tt class="docutils literal">argument</tt> 的值赋给 <tt class="docutils literal">parameter</tt> , <tt class="docutils literal">parameter</tt> 是函数的局部变量。</p> -<p><tt class="docutils literal">argument</tt> 与 <tt class="docutils literal">parameter</tt> 的名字可以相同也可以不同。</p> -<pre class="code python literal-block"> -<span class="keyword">def</span> <span class="name function">reverse_string</span><span class="punctuation">(</span><span class="name">s</span><span class="punctuation">):</span> - <span class="name">t</span> <span class="operator">=</span> <span class="literal string single">''</span> - <span class="keyword">for</span> <span class="name">i</span> <span class="operator word">in</span> <span class="name builtin">range</span><span class="punctuation">(</span><span class="name builtin">len</span><span class="punctuation">(</span><span class="name">s</span><span class="punctuation">)</span><span class="operator">-</span><span class="literal number integer">1</span><span class="punctuation">,</span><span class="operator">-</span><span class="literal number integer">1</span><span class="punctuation">,</span><span class="operator">-</span><span class="literal number integer">1</span><span class="punctuation">):</span> - <span class="name">t</span> <span class="operator">+=</span> <span class="name">s</span><span class="punctuation">[</span><span class="name">i</span><span class="punctuation">]</span> - <span class="keyword">return</span> <span class="name">t</span> - - - -<span class="name">s</span> <span class="operator">=</span> <span class="literal string single">'put'</span> -<span class="name">t</span> <span class="operator">=</span> <span class="name">reverse_string</span><span class="punctuation">(</span><span class="name">s</span><span class="punctuation">)</span> -<span class="keyword">print</span><span class="punctuation">(</span><span class="name">t</span><span class="punctuation">)</span> -</pre> -<p>以上 s 一个是全局变量一个是局部变量。</p> -<p>以上 t 一个是全局变量一个是局部变量。</p> -</div> -<div class="section" id="flow-of-execution"> -<h2><a class="toc-backref" href="#id53">函数执行顺序 (flow of execution)</a></h2> -<p>函数的定义不执行,被调用时才执行。</p> -<p>顺序执行。 当遇到函数调用时,跳转到函数,执行函数,函数返回后继续执行跳转地后一条语句。</p> -</div> -</div> -<div class="section" id="id17"> -<h1><a class="toc-backref" href="#id54">文件</a></h1> -<p>信息多存储在文件中。所以文件的读写是最最常见的操作。 本节主要考虑纯文本文件。 以下后缀结尾的文件一般都是纯文本文件: txt, csv, html, rst, md。</p> -<dl class="docutils"> -<dt>实验: 读取纽约新生儿的名字统计文件 <a class="reference external" href="https://data.cityofnewyork.us/api/views/25th-nujf/rows.csv?accessType=DOWNLOAD">PopularBabyNames</a> 。</dt> -<dd>写命令行程序 lookupname.py 。给定性别与种族,输出最流行的头几个名字。 -命令行例子: <tt class="docutils literal">python lookupname.py girl white top5</tt> 。 这个命令输出最流行的5个白人女孩的名字。 -第一个参数可以是 <tt class="docutils literal">girl/boy</tt> , 第二个参数可以是 <tt class="docutils literal">asian/white/black/hispanic</tt> 。第三个参数以 <tt class="docutils literal">top</tt> 开始,默认是 1。</dd> -</dl> -<pre class="code python literal-block"> -<span class="comment single"># Copyright (C) 2019 Hui Lan</span> -<span class="comment single"># lanhui AT zjnu.edu.cn</span> -<span class="comment single"># Purpose: 1. Introduce command line argument parsing. 2. Introduce nested dictionaries.</span> -<span class="comment single"># Usage:</span> -<span class="comment single"># python lookupname.py asian boy top10</span> -<span class="comment single"># python lookupname.py white girl top5</span> -<span class="comment single"># python lookupname.py girl white top</span> - - -<span class="keyword">def</span> <span class="name function">map</span><span class="punctuation">(</span><span class="name">x</span><span class="punctuation">):</span> - <span class="name">d</span> <span class="operator">=</span> <span class="punctuation">{</span><span class="literal string single">'FEMALE'</span><span class="punctuation">:</span><span class="literal string single">'girl'</span><span class="punctuation">,</span> <span class="literal string single">'MALE'</span><span class="punctuation">:</span><span class="literal string single">'boy'</span><span class="punctuation">,</span> <span class="literal string single">'ASIAN AND PACIFIC ISLANDER'</span><span class="punctuation">:</span><span class="literal string single">'asian'</span><span class="punctuation">,</span> <span class="literal string single">'ASIAN AND PACI'</span><span class="punctuation">:</span><span class="literal string single">'asian'</span><span class="punctuation">,</span> - <span class="literal string single">'BLACK NON HISPANIC'</span><span class="punctuation">:</span><span class="literal string single">'black'</span><span class="punctuation">,</span> <span class="literal string single">'BLACK NON HISP'</span><span class="punctuation">:</span><span class="literal string single">'black'</span><span class="punctuation">,</span> <span class="literal string single">'HISPANIC'</span><span class="punctuation">:</span><span class="literal string single">'hispanic'</span><span class="punctuation">,</span> <span class="literal string single">'WHITE NON HISPANIC'</span><span class="punctuation">:</span><span class="literal string single">'white'</span><span class="punctuation">,</span> <span class="literal string single">'WHITE NON HISP'</span><span class="punctuation">:</span><span class="literal string single">'white'</span><span class="punctuation">}</span> - <span class="keyword">return</span> <span class="name">d</span><span class="punctuation">[</span><span class="name">x</span><span class="punctuation">]</span> - - -<span class="keyword">def</span> <span class="name function">file2dict</span><span class="punctuation">(</span><span class="name">fname</span><span class="punctuation">):</span> - <span class="name">d</span> <span class="operator">=</span> <span class="punctuation">{}</span> <span class="comment single"># will be a nested dictionary: e.g., d[gender] = {'asian':{'name':count}, 'black':[], 'white':[], 'hispanic':[]}</span> - <span class="name">f</span> <span class="operator">=</span> <span class="name builtin">open</span><span class="punctuation">(</span><span class="name">fname</span><span class="punctuation">)</span> - <span class="name">lines</span> <span class="operator">=</span> <span class="name">f</span><span class="operator">.</span><span class="name">readlines</span><span class="punctuation">()</span> - <span class="keyword">for</span> <span class="name">line</span> <span class="operator word">in</span> <span class="name">lines</span><span class="punctuation">[</span><span class="literal number integer">1</span><span class="punctuation">:]:</span> - <span class="name">line</span> <span class="operator">=</span> <span class="name">line</span><span class="operator">.</span><span class="name">strip</span><span class="punctuation">()</span> - <span class="name">lst</span> <span class="operator">=</span> <span class="name">line</span><span class="operator">.</span><span class="name">split</span><span class="punctuation">(</span><span class="literal string single">','</span><span class="punctuation">)</span> - <span class="name">gender</span> <span class="operator">=</span> <span class="name builtin">map</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">[</span><span class="literal number integer">1</span><span class="punctuation">])</span> - <span class="name">ethnicity</span> <span class="operator">=</span> <span class="name builtin">map</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">[</span><span class="literal number integer">2</span><span class="punctuation">])</span> - <span class="name">firstname</span> <span class="operator">=</span> <span class="name">lst</span><span class="punctuation">[</span><span class="literal number integer">3</span><span class="punctuation">]</span><span class="operator">.</span><span class="name">title</span><span class="punctuation">()</span> - <span class="name">count</span> <span class="operator">=</span> <span class="name builtin">int</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">[</span><span class="literal number integer">4</span><span class="punctuation">])</span> - <span class="keyword">if</span> <span class="operator word">not</span> <span class="name">gender</span> <span class="operator word">in</span> <span class="name">d</span><span class="punctuation">:</span> - <span class="name">d</span><span class="punctuation">[</span><span class="name">gender</span><span class="punctuation">]</span> <span class="operator">=</span> <span class="punctuation">{</span><span class="name">ethnicity</span><span class="punctuation">:</span> <span class="punctuation">{</span><span class="name">firstname</span><span class="punctuation">:</span><span class="name">count</span><span class="punctuation">}}</span> - <span class="keyword">else</span><span class="punctuation">:</span> - <span class="keyword">if</span> <span class="operator word">not</span> <span class="name">ethnicity</span> <span class="operator word">in</span> <span class="name">d</span><span class="punctuation">[</span><span class="name">gender</span><span class="punctuation">]:</span> - <span class="name">d</span><span class="punctuation">[</span><span class="name">gender</span><span class="punctuation">][</span><span class="name">ethnicity</span><span class="punctuation">]</span> <span class="operator">=</span> <span class="punctuation">{</span><span class="name">firstname</span><span class="punctuation">:</span><span class="name">count</span><span class="punctuation">}</span> - <span class="keyword">else</span><span class="punctuation">:</span> - <span class="keyword">if</span> <span class="operator word">not</span> <span class="name">firstname</span> <span class="operator word">in</span> <span class="name">d</span><span class="punctuation">[</span><span class="name">gender</span><span class="punctuation">][</span><span class="name">ethnicity</span><span class="punctuation">]:</span> - <span class="name">d</span><span class="punctuation">[</span><span class="name">gender</span><span class="punctuation">][</span><span class="name">ethnicity</span><span class="punctuation">][</span><span class="name">firstname</span><span class="punctuation">]</span> <span class="operator">=</span> <span class="name">count</span> - <span class="keyword">else</span><span class="punctuation">:</span> - <span class="name">d</span><span class="punctuation">[</span><span class="name">gender</span><span class="punctuation">][</span><span class="name">ethnicity</span><span class="punctuation">][</span><span class="name">firstname</span><span class="punctuation">]</span> <span class="operator">+=</span> <span class="name">count</span> - <span class="name">f</span><span class="operator">.</span><span class="name">close</span><span class="punctuation">()</span> - <span class="keyword">return</span> <span class="name">d</span> - - -<span class="keyword">def</span> <span class="name function">get_commandline_parameter</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">):</span> - <span class="name">d</span> <span class="operator">=</span> <span class="punctuation">{</span><span class="literal string single">'gender'</span><span class="punctuation">:</span><span class="literal string single">''</span><span class="punctuation">,</span> <span class="literal string single">'ethnicity'</span><span class="punctuation">:</span><span class="literal string single">''</span><span class="punctuation">,</span> <span class="literal string single">'top'</span><span class="punctuation">:</span><span class="literal number integer">1</span><span class="punctuation">}</span> - <span class="keyword">for</span> <span class="name">x</span> <span class="operator word">in</span> <span class="name">lst</span><span class="punctuation">:</span> - <span class="name">o</span> <span class="operator">=</span> <span class="name">x</span><span class="operator">.</span><span class="name">lower</span><span class="punctuation">()</span> - <span class="keyword">if</span> <span class="name">o</span> <span class="operator word">in</span> <span class="punctuation">[</span><span class="literal string single">'asian'</span><span class="punctuation">,</span> <span class="literal string single">'black'</span><span class="punctuation">,</span> <span class="literal string single">'white'</span><span class="punctuation">,</span> <span class="literal string single">'hispanic'</span><span class="punctuation">]:</span> - <span class="name">d</span><span class="punctuation">[</span><span class="literal string single">'ethnicity'</span><span class="punctuation">]</span> <span class="operator">=</span> <span class="name">o</span> - <span class="keyword">elif</span> <span class="name">o</span> <span class="operator word">in</span> <span class="punctuation">[</span><span class="literal string single">'girl'</span><span class="punctuation">,</span> <span class="literal string single">'boy'</span><span class="punctuation">]:</span> - <span class="name">d</span><span class="punctuation">[</span><span class="literal string single">'gender'</span><span class="punctuation">]</span> <span class="operator">=</span> <span class="name">o</span> - <span class="keyword">elif</span> <span class="name">o</span> <span class="operator">==</span> <span class="literal string single">'top'</span><span class="punctuation">:</span> - <span class="keyword">pass</span> <span class="comment single"># use default value 1</span> - <span class="keyword">elif</span> <span class="literal string single">'top'</span> <span class="operator word">in</span> <span class="name">o</span><span class="punctuation">:</span> - <span class="name">d</span><span class="punctuation">[</span><span class="literal string single">'top'</span><span class="punctuation">]</span> <span class="operator">=</span> <span class="name builtin">int</span><span class="punctuation">(</span><span class="name">o</span><span class="punctuation">[</span><span class="literal number integer">3</span><span class="punctuation">:])</span> - <span class="keyword">else</span><span class="punctuation">:</span> - <span class="keyword">raise</span> <span class="name exception">Exception</span><span class="punctuation">(</span><span class="literal string single">'Not recognised option </span><span class="literal string interpol">%s</span><span class="literal string single">'</span> <span class="operator">%</span> <span class="punctuation">(</span><span class="name">x</span><span class="punctuation">))</span> - <span class="keyword">return</span> <span class="name">d</span> - - -<span class="keyword">def</span> <span class="name function">sort_by_value</span><span class="punctuation">(</span><span class="name">d</span><span class="punctuation">):</span> - <span class="literal string doc">''' Return a sorted list of tuples, each tuple containing a key and a value. - Note that the tuples are order in descending order of the value.'''</span> - <span class="keyword namespace">import</span> <span class="name namespace">operator</span> - <span class="name">lst</span> <span class="operator">=</span> <span class="name builtin">sorted</span><span class="punctuation">(</span><span class="name">d</span><span class="operator">.</span><span class="name">items</span><span class="punctuation">(),</span> <span class="name">key</span><span class="operator">=</span><span class="name">operator</span><span class="operator">.</span><span class="name">itemgetter</span><span class="punctuation">(</span><span class="literal number integer">1</span><span class="punctuation">),</span> <span class="name">reverse</span><span class="operator">=</span><span class="name builtin pseudo">True</span><span class="punctuation">)</span> - <span class="keyword">return</span> <span class="name">lst</span> - - -<span class="keyword namespace">import</span> <span class="name namespace">sys</span> -<span class="keyword">if</span> <span class="name variable magic">__name__</span> <span class="operator">==</span> <span class="literal string single">'__main__'</span><span class="punctuation">:</span> - <span class="name">d</span> <span class="operator">=</span> <span class="name">file2dict</span><span class="punctuation">(</span><span class="literal string single">'Popular_Baby_Names.csv'</span><span class="punctuation">)</span> - <span class="name">args</span> <span class="operator">=</span> <span class="name">get_commandline_parameter</span><span class="punctuation">(</span><span class="name">sys</span><span class="operator">.</span><span class="name">argv</span><span class="punctuation">[</span><span class="literal number integer">1</span><span class="punctuation">:])</span> - <span class="name">gender</span> <span class="operator">=</span> <span class="name">args</span><span class="punctuation">[</span><span class="literal string single">'gender'</span><span class="punctuation">]</span> - <span class="name">ethnicity</span> <span class="operator">=</span> <span class="name">args</span><span class="punctuation">[</span><span class="literal string single">'ethnicity'</span><span class="punctuation">]</span> - <span class="name">top</span> <span class="operator">=</span> <span class="name">args</span><span class="punctuation">[</span><span class="literal string single">'top'</span><span class="punctuation">]</span> - <span class="name">d2</span> <span class="operator">=</span> <span class="name">d</span><span class="punctuation">[</span><span class="name">gender</span><span class="punctuation">][</span><span class="name">ethnicity</span><span class="punctuation">]</span> - <span class="name">lst</span> <span class="operator">=</span> <span class="name">sort_by_value</span><span class="punctuation">(</span><span class="name">d2</span><span class="punctuation">)</span> - <span class="keyword">for</span> <span class="name">i</span> <span class="operator word">in</span> <span class="name builtin">range</span><span class="punctuation">(</span><span class="name">top</span><span class="punctuation">):</span> - <span class="keyword">print</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">[</span><span class="name">i</span><span class="punctuation">][</span><span class="literal number integer">0</span><span class="punctuation">])</span> -</pre> -</div> -<div class="section" id="id18"> -<h1><a class="toc-backref" href="#id55">排序</a></h1> -<p>排序是常见重要的操作。 按照成绩排序。 按照文件名排序。 按照文件大小排序。 按照时间排序。</p> -<p>Python自带的 <tt class="docutils literal">sorted</tt> 可以很好满足排序需求。</p> -<div class="section" id="id19"> -<h2><a class="toc-backref" href="#id56">排序一组数或一组字符串</a></h2> -<p>如果需要从大到小排序, 添加 <tt class="docutils literal">reverse=True</tt> 。</p> -<pre class="code python literal-block"> -<span class="comment single"># Sort numbers</span> -<span class="keyword namespace">import</span> <span class="name namespace">random</span> -<span class="name">a</span> <span class="operator">=</span> <span class="punctuation">[</span><span class="name">random</span><span class="operator">.</span><span class="name">randint</span><span class="punctuation">(</span><span class="literal number integer">0</span><span class="punctuation">,</span><span class="literal number integer">100</span><span class="punctuation">)</span> <span class="keyword">for</span> <span class="name">i</span> <span class="operator word">in</span> <span class="name builtin">range</span><span class="punctuation">(</span><span class="literal number integer">5</span><span class="punctuation">)]</span> <span class="comment single"># a list of 5 random numbers between 0 and 100</span> -<span class="keyword">print</span><span class="punctuation">(</span><span class="name">a</span><span class="punctuation">)</span> - -<span class="name">sa_incr</span> <span class="operator">=</span> <span class="name builtin">sorted</span><span class="punctuation">(</span><span class="name">a</span><span class="punctuation">)</span> -<span class="keyword">print</span><span class="punctuation">(</span><span class="name">sa_incr</span><span class="punctuation">)</span> - -<span class="name">sa_decr</span> <span class="operator">=</span> <span class="name builtin">sorted</span><span class="punctuation">(</span><span class="name">a</span><span class="punctuation">,</span> <span class="name">reverse</span><span class="operator">=</span><span class="name builtin pseudo">True</span><span class="punctuation">)</span> -<span class="keyword">print</span><span class="punctuation">(</span><span class="name">sa_decr</span><span class="punctuation">)</span> - -<span class="comment single"># Sort a list of strings</span> -<span class="name">s</span> <span class="operator">=</span> <span class="literal string single">'D3.js is a JavaScript library for manipulating documents based on data. D3 helps you bring data to life using HTML, SVG, and CSS. D3’s emphasis on web standards gives you the full capabilities of modern browsers without tying yourself to a proprietary framework, combining powerful visualization components and a data-driven approach to DOM manipulation. https://d3js.org/'</span> -<span class="name">lst</span> <span class="operator">=</span> <span class="name builtin">list</span><span class="punctuation">(</span><span class="name builtin">set</span><span class="punctuation">(</span><span class="name">s</span><span class="operator">.</span><span class="name">split</span><span class="punctuation">()))</span> - -<span class="name">sa_incr</span> <span class="operator">=</span> <span class="name builtin">sorted</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">)</span> -<span class="keyword">print</span><span class="punctuation">(</span><span class="name">sa_incr</span><span class="punctuation">)</span> - -<span class="name">sa_decr</span> <span class="operator">=</span> <span class="name builtin">sorted</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">,</span> <span class="name">reverse</span><span class="operator">=</span><span class="name builtin pseudo">True</span><span class="punctuation">)</span> -<span class="keyword">print</span><span class="punctuation">(</span><span class="name">sa_decr</span><span class="punctuation">)</span> -</pre> -</div> -<div class="section" id="id20"> -<h2><a class="toc-backref" href="#id57">自定义排序算法</a></h2> -<p>为了弄清排序的原理, 我们看两种排序算法。</p> -<div class="section" id="id21"> -<h3><a class="toc-backref" href="#id58">选择排序</a></h3> -<p>遍历列表,每次把最小的那个放到最左边位置。</p> -<pre class="code python literal-block"> -<span class="comment single"># Copyright (C) 2019 Hui Lan</span> -<span class="comment single"># lanhui AT zjnu.edu.cn</span> - -<span class="keyword">def</span> <span class="name function">swap</span><span class="punctuation">(</span><span class="name">L</span><span class="punctuation">,</span> <span class="name">i</span><span class="punctuation">,</span> <span class="name">j</span><span class="punctuation">):</span> - <span class="name">L</span><span class="punctuation">[</span><span class="name">j</span><span class="punctuation">],</span> <span class="name">L</span><span class="punctuation">[</span><span class="name">i</span><span class="punctuation">]</span> <span class="operator">=</span> <span class="name">L</span><span class="punctuation">[</span><span class="name">i</span><span class="punctuation">],</span> <span class="name">L</span><span class="punctuation">[</span><span class="name">j</span><span class="punctuation">]</span> - - -<span class="keyword">def</span> <span class="name function">selection_sort</span><span class="punctuation">(</span><span class="name">L</span><span class="punctuation">):</span> - <span class="name">i</span> <span class="operator">=</span> <span class="literal number integer">0</span> - <span class="keyword">while</span> <span class="name">i</span> <span class="operator"><</span> <span class="name builtin">len</span><span class="punctuation">(</span><span class="name">L</span><span class="punctuation">):</span> - <span class="name">min_val</span> <span class="operator">=</span> <span class="name">L</span><span class="punctuation">[</span><span class="name">i</span><span class="punctuation">]</span> - <span class="name">k</span> <span class="operator">=</span> <span class="name">j</span> <span class="operator">=</span> <span class="name">i</span> - <span class="keyword">while</span> <span class="name">j</span> <span class="operator"><</span> <span class="name builtin">len</span><span class="punctuation">(</span><span class="name">L</span><span class="punctuation">):</span> - <span class="keyword">if</span> <span class="name">L</span><span class="punctuation">[</span><span class="name">j</span><span class="punctuation">]</span> <span class="operator"><</span> <span class="name">min_val</span><span class="punctuation">:</span> - <span class="name">min_val</span> <span class="operator">=</span> <span class="name">L</span><span class="punctuation">[</span><span class="name">j</span><span class="punctuation">]</span> - <span class="name">k</span> <span class="operator">=</span> <span class="name">j</span> - <span class="name">j</span> <span class="operator">+=</span> <span class="literal number integer">1</span> - <span class="name">swap</span><span class="punctuation">(</span><span class="name">L</span><span class="punctuation">,</span> <span class="name">i</span><span class="punctuation">,</span> <span class="name">k</span><span class="punctuation">)</span> <span class="comment single"># will change L</span> - <span class="name">i</span> <span class="operator">+=</span> <span class="literal number integer">1</span> - <span class="keyword">return</span> <span class="name">L</span> - -<span class="keyword">if</span> <span class="name variable magic">__name__</span> <span class="operator">==</span> <span class="literal string single">'__main__'</span><span class="punctuation">:</span> - - <span class="keyword namespace">import</span> <span class="name namespace">random</span> - <span class="keyword">for</span> <span class="name">n</span> <span class="operator word">in</span> <span class="name builtin">range</span><span class="punctuation">(</span><span class="literal number integer">10</span><span class="punctuation">):</span> - <span class="name">a</span> <span class="operator">=</span> <span class="punctuation">[</span><span class="name">random</span><span class="operator">.</span><span class="name">randint</span><span class="punctuation">(</span><span class="literal number integer">0</span><span class="punctuation">,</span><span class="literal number integer">100</span><span class="punctuation">)</span> <span class="keyword">for</span> <span class="name">i</span> <span class="operator word">in</span> <span class="name builtin">range</span><span class="punctuation">(</span><span class="name">n</span><span class="punctuation">)]</span> - <span class="name">sa</span> <span class="operator">=</span> <span class="name">selection_sort</span><span class="punctuation">(</span><span class="name">a</span><span class="punctuation">)</span> - <span class="keyword">print</span><span class="punctuation">(</span><span class="name">sa</span><span class="punctuation">)</span> - <span class="keyword">assert</span> <span class="name">sa</span> <span class="operator">==</span> <span class="name">a</span> - <span class="keyword">assert</span> <span class="name">sa</span> <span class="operator">==</span> <span class="name builtin">sorted</span><span class="punctuation">(</span><span class="name">a</span><span class="punctuation">)</span> -</pre> -</div> -<div class="section" id="merge-sort"> -<h3><a class="toc-backref" href="#id59">合并排序 (Merge sort)</a></h3> -<p>将列表一分为二,对每半部分排序,把排好序的两部分合并之(确保合并后同样是排好序的)。 注意到,以下的实现方式是递归。</p> -<pre class="code python literal-block"> -<span class="comment single"># Copyright (C) 2019 Hui Lan</span> -<span class="comment single"># lanhui AT zjnu.edu.cn</span> - -<span class="keyword">def</span> <span class="name function">_merge</span><span class="punctuation">(</span><span class="name">L</span><span class="punctuation">,</span> <span class="name">R</span><span class="punctuation">):</span> - <span class="literal string doc">''' Return a sorted list that combines the sorted list L and sorted list R.'''</span> - <span class="name">nL</span> <span class="operator">=</span> <span class="name builtin">len</span><span class="punctuation">(</span><span class="name">L</span><span class="punctuation">)</span> - <span class="name">nR</span> <span class="operator">=</span> <span class="name builtin">len</span><span class="punctuation">(</span><span class="name">R</span><span class="punctuation">)</span> - <span class="name">result</span> <span class="operator">=</span> <span class="punctuation">[]</span> - <span class="name">i</span> <span class="operator">=</span> <span class="name">j</span> <span class="operator">=</span> <span class="name">count</span> <span class="operator">=</span> <span class="literal number integer">0</span> - <span class="keyword">while</span> <span class="name">count</span> <span class="operator"><</span> <span class="name">nL</span> <span class="operator">+</span> <span class="name">nR</span><span class="punctuation">:</span> - <span class="keyword">if</span> <span class="name">i</span> <span class="operator">>=</span> <span class="name">nL</span> <span class="operator word">and</span> <span class="name">j</span> <span class="operator"><</span> <span class="name">nR</span><span class="punctuation">:</span> - <span class="name">result</span><span class="operator">.</span><span class="name">append</span><span class="punctuation">(</span><span class="name">R</span><span class="punctuation">[</span><span class="name">j</span><span class="punctuation">])</span> - <span class="name">j</span> <span class="operator">+=</span> <span class="literal number integer">1</span> - <span class="keyword">elif</span> <span class="name">j</span> <span class="operator">>=</span> <span class="name">nR</span> <span class="operator word">and</span> <span class="name">i</span> <span class="operator"><</span> <span class="name">nL</span><span class="punctuation">:</span> - <span class="name">result</span><span class="operator">.</span><span class="name">append</span><span class="punctuation">(</span><span class="name">L</span><span class="punctuation">[</span><span class="name">i</span><span class="punctuation">])</span> - <span class="name">i</span> <span class="operator">+=</span> <span class="literal number integer">1</span> - <span class="keyword">elif</span> <span class="name">L</span><span class="punctuation">[</span><span class="name">i</span><span class="punctuation">]</span> <span class="operator"><</span> <span class="name">R</span><span class="punctuation">[</span><span class="name">j</span><span class="punctuation">]:</span> - <span class="name">result</span><span class="operator">.</span><span class="name">append</span><span class="punctuation">(</span><span class="name">L</span><span class="punctuation">[</span><span class="name">i</span><span class="punctuation">])</span> - <span class="name">i</span> <span class="operator">+=</span> <span class="literal number integer">1</span> - <span class="keyword">else</span><span class="punctuation">:</span> - <span class="name">result</span><span class="operator">.</span><span class="name">append</span><span class="punctuation">(</span><span class="name">R</span><span class="punctuation">[</span><span class="name">j</span><span class="punctuation">])</span> - <span class="name">j</span> <span class="operator">+=</span> <span class="literal number integer">1</span> - <span class="name">count</span> <span class="operator">+=</span> <span class="literal number integer">1</span> - <span class="keyword">return</span> <span class="name">result</span> - - -<span class="keyword">def</span> <span class="name function">merge_sort</span><span class="punctuation">(</span><span class="name">L</span><span class="punctuation">):</span> - <span class="keyword">if</span> <span class="name builtin">len</span><span class="punctuation">(</span><span class="name">L</span><span class="punctuation">)</span> <span class="operator"><=</span> <span class="literal number integer">1</span><span class="punctuation">:</span> - <span class="keyword">return</span> <span class="name">L</span> - <span class="keyword">else</span><span class="punctuation">:</span> - <span class="name">i</span> <span class="operator">=</span> <span class="name builtin">int</span><span class="punctuation">(</span><span class="name builtin">len</span><span class="punctuation">(</span><span class="name">L</span><span class="punctuation">)</span><span class="operator">/</span><span class="literal number integer">2</span><span class="punctuation">)</span> - <span class="name">l</span> <span class="operator">=</span> <span class="name">merge_sort</span><span class="punctuation">(</span><span class="name">L</span><span class="punctuation">[:</span><span class="name">i</span><span class="punctuation">])</span> - <span class="name">r</span> <span class="operator">=</span> <span class="name">merge_sort</span><span class="punctuation">(</span><span class="name">L</span><span class="punctuation">[</span><span class="name">i</span><span class="punctuation">:])</span> - <span class="keyword">return</span> <span class="name">_merge</span><span class="punctuation">(</span><span class="name">l</span><span class="punctuation">,</span> <span class="name">r</span><span class="punctuation">)</span> - -<span class="keyword">if</span> <span class="name variable magic">__name__</span> <span class="operator">==</span> <span class="literal string single">'__main__'</span><span class="punctuation">:</span> - - <span class="keyword namespace">import</span> <span class="name namespace">random</span> - <span class="keyword">for</span> <span class="name">n</span> <span class="operator word">in</span> <span class="name builtin">range</span><span class="punctuation">(</span><span class="literal number integer">100</span><span class="punctuation">):</span> - <span class="name">a</span> <span class="operator">=</span> <span class="punctuation">[</span><span class="name">random</span><span class="operator">.</span><span class="name">randint</span><span class="punctuation">(</span><span class="literal number integer">0</span><span class="punctuation">,</span><span class="literal number integer">100</span><span class="punctuation">)</span> <span class="keyword">for</span> <span class="name">i</span> <span class="operator word">in</span> <span class="name builtin">range</span><span class="punctuation">(</span><span class="name">n</span><span class="punctuation">)]</span> - <span class="name">sa</span> <span class="operator">=</span> <span class="name">merge_sort</span><span class="punctuation">(</span><span class="name">a</span><span class="punctuation">)</span> - <span class="keyword">assert</span> <span class="name">sa</span> <span class="operator">==</span> <span class="name builtin">sorted</span><span class="punctuation">(</span><span class="name">a</span><span class="punctuation">)</span> -</pre> -</div> -</div> -<div class="section" id="id22"> -<h2><a class="toc-backref" href="#id60">比较排序速度</a></h2> -<p>排序是 Python 的核心算法,所以是优化了再优化。</p> -<p>Python 自带的排序算法最快, <tt class="docutils literal">selection_sort</tt> 最慢。</p> -<pre class="code python literal-block"> -<span class="keyword namespace">from</span> <span class="name namespace">merge_sort</span> <span class="keyword namespace">import</span> <span class="name">merge_sort</span> -<span class="keyword namespace">from</span> <span class="name namespace">selection_sort</span> <span class="keyword namespace">import</span> <span class="name">selection_sort</span> - -<span class="keyword namespace">import</span> <span class="name namespace">random</span><span class="operator">,</span> <span class="name namespace">time</span> -<span class="name">L</span> <span class="operator">=</span> <span class="punctuation">[</span><span class="name">random</span><span class="operator">.</span><span class="name">randint</span><span class="punctuation">(</span><span class="literal number integer">0</span><span class="punctuation">,</span><span class="literal number integer">10000</span><span class="punctuation">)</span> <span class="keyword">for</span> <span class="name">i</span> <span class="operator word">in</span> <span class="name builtin">range</span><span class="punctuation">(</span><span class="literal number integer">10000</span><span class="punctuation">)]</span> - -<span class="keyword">print</span><span class="punctuation">(</span><span class="literal string single">'Python sort ...'</span><span class="punctuation">)</span> -<span class="name">now</span> <span class="operator">=</span> <span class="name">time</span><span class="operator">.</span><span class="name">time</span><span class="punctuation">()</span> -<span class="name">result0</span> <span class="operator">=</span> <span class="name builtin">sorted</span><span class="punctuation">(</span><span class="name">L</span><span class="punctuation">)</span> -<span class="keyword">print</span><span class="punctuation">(</span><span class="name">time</span><span class="operator">.</span><span class="name">time</span><span class="punctuation">()</span> <span class="operator">-</span> <span class="name">now</span><span class="punctuation">)</span> - - -<span class="keyword">print</span><span class="punctuation">(</span><span class="literal string single">'Merge sort ...'</span><span class="punctuation">)</span> -<span class="name">now</span> <span class="operator">=</span> <span class="name">time</span><span class="operator">.</span><span class="name">time</span><span class="punctuation">()</span> -<span class="name">result1</span> <span class="operator">=</span> <span class="name">merge_sort</span><span class="punctuation">(</span><span class="name">L</span><span class="punctuation">)</span> -<span class="keyword">print</span><span class="punctuation">(</span><span class="name">time</span><span class="operator">.</span><span class="name">time</span><span class="punctuation">()</span> <span class="operator">-</span> <span class="name">now</span><span class="punctuation">)</span> - -<span class="keyword">print</span><span class="punctuation">(</span><span class="literal string single">'Selection sort ...'</span><span class="punctuation">)</span> -<span class="name">now</span> <span class="operator">=</span> <span class="name">time</span><span class="operator">.</span><span class="name">time</span><span class="punctuation">()</span> -<span class="name">result2</span> <span class="operator">=</span> <span class="name">selection_sort</span><span class="punctuation">(</span><span class="name">L</span><span class="punctuation">)</span> -<span class="keyword">print</span><span class="punctuation">(</span><span class="name">time</span><span class="operator">.</span><span class="name">time</span><span class="punctuation">()</span> <span class="operator">-</span> <span class="name">now</span><span class="punctuation">)</span> - -<span class="keyword">assert</span> <span class="name">result0</span> <span class="operator">==</span> <span class="name">result1</span> -<span class="keyword">assert</span> <span class="name">result1</span> <span class="operator">==</span> <span class="name">result2</span> -</pre> -<p>在命令行运行上面的程序,在作者的计算机上得到如下的结果。</p> -<div class="line-block"> -<div class="line">Python sort ...</div> -<div class="line">0.002</div> -<div class="line">Merge sort ...</div> -<div class="line">0.083</div> -<div class="line">Selection sort ...</div> -<div class="line">11.57</div> -</div> -</div> -<div class="section" id="id23"> -<h2><a class="toc-backref" href="#id61">排序元组列表</a></h2> -<p>一个元组由多个元素组成,多个元组组成元组列表, 如何按照某个元素进行排序呢?</p> -<p>可以有以下两种方案。一种用模块 <tt class="docutils literal">operator</tt> , 一种用 <tt class="docutils literal">lambda</tt> 函数。</p> -<pre class="code python literal-block"> -<span class="keyword">def</span> <span class="name function">sort_by_nth_element</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">,</span> <span class="name">n</span><span class="punctuation">):</span> - <span class="literal string doc">''' Return a sorted list of tuples lst, according to the nth element in each tuple.'''</span> - <span class="keyword namespace">import</span> <span class="name namespace">operator</span> - <span class="name">result</span> <span class="operator">=</span> <span class="name builtin">sorted</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">,</span> <span class="name">key</span><span class="operator">=</span><span class="name">operator</span><span class="operator">.</span><span class="name">itemgetter</span><span class="punctuation">(</span><span class="name">n</span><span class="punctuation">))</span> - <span class="keyword">return</span> <span class="name">result</span> - - -<span class="keyword">def</span> <span class="name function">sort_by_nth_element2</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">,</span> <span class="name">n</span><span class="punctuation">):</span> - <span class="literal string doc">''' Return a sorted list of tuples lst, according to the nth element in each tuple.'''</span> - <span class="keyword namespace">import</span> <span class="name namespace">operator</span> - <span class="name">result</span> <span class="operator">=</span> <span class="name builtin">sorted</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">,</span> <span class="name">key</span><span class="operator">=</span><span class="keyword">lambda</span> <span class="name">x</span><span class="punctuation">:</span> <span class="name">x</span><span class="punctuation">[</span><span class="name">n</span><span class="punctuation">])</span> <span class="comment single"># https://stackoverflow.com/questions/8966538/syntax-behind-sortedkey-lambda</span> - <span class="keyword">return</span> <span class="name">result</span> - - -<span class="keyword">if</span> <span class="name variable magic">__name__</span> <span class="operator">==</span> <span class="literal string single">'__main__'</span><span class="punctuation">:</span> - <span class="name">lst</span> <span class="operator">=</span> <span class="punctuation">[(</span><span class="literal number integer">1</span><span class="punctuation">,</span> <span class="literal string single">'xxx'</span><span class="punctuation">,</span> <span class="literal number integer">2</span><span class="punctuation">),</span> <span class="punctuation">(</span><span class="literal number integer">2</span><span class="punctuation">,</span> <span class="literal string single">'aaa'</span><span class="punctuation">,</span> <span class="literal number integer">1</span><span class="punctuation">)]</span> - <span class="keyword">print</span><span class="punctuation">(</span><span class="name">sort_by_nth_element</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">,</span> <span class="literal number integer">0</span><span class="punctuation">))</span> - <span class="keyword">print</span><span class="punctuation">(</span><span class="name">sort_by_nth_element</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">,</span> <span class="literal number integer">1</span><span class="punctuation">))</span> - <span class="keyword">print</span><span class="punctuation">(</span><span class="name">sort_by_nth_element</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">,</span> <span class="literal number integer">2</span><span class="punctuation">))</span> - - <span class="keyword">print</span><span class="punctuation">(</span><span class="name">sort_by_nth_element2</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">,</span> <span class="literal number integer">0</span><span class="punctuation">))</span> - <span class="keyword">print</span><span class="punctuation">(</span><span class="name">sort_by_nth_element2</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">,</span> <span class="literal number integer">1</span><span class="punctuation">))</span> - <span class="keyword">print</span><span class="punctuation">(</span><span class="name">sort_by_nth_element2</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">,</span> <span class="literal number integer">2</span><span class="punctuation">))</span> -</pre> -</div> -<div class="section" id="lambda"> -<h2><a class="toc-backref" href="#id62">巧用 lambda 函数进行灵活排序</a></h2> -<p>如何把一个由字符串组成的列表按照字符串的长短进行排序?</p> -<pre class="code python literal-block"> -<span class="name">lst</span> <span class="operator">=</span> <span class="punctuation">[</span><span class="literal string single">'this'</span><span class="punctuation">,</span> <span class="literal string single">'is'</span><span class="punctuation">,</span> <span class="literal string single">'a'</span><span class="punctuation">,</span> <span class="literal string single">'example'</span><span class="punctuation">]</span> -<span class="name">a</span> <span class="operator">=</span> <span class="name builtin">sorted</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">,</span> <span class="name">key</span><span class="operator">=</span><span class="keyword">lambda</span> <span class="name">x</span><span class="punctuation">:</span> <span class="name builtin">len</span><span class="punctuation">(</span><span class="name">x</span><span class="punctuation">))</span> -<span class="name">b</span> <span class="operator">=</span> <span class="name builtin">sorted</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">,</span> <span class="name">key</span><span class="operator">=</span><span class="keyword">lambda</span> <span class="name">x</span><span class="punctuation">:</span> <span class="operator">-</span><span class="name builtin">len</span><span class="punctuation">(</span><span class="name">x</span><span class="punctuation">))</span> -<span class="keyword">print</span><span class="punctuation">(</span><span class="literal string single">'</span><span class="literal string escape">\n</span><span class="literal string single">'</span><span class="operator">.</span><span class="name">join</span><span class="punctuation">(</span><span class="name">a</span><span class="punctuation">))</span> - -<span class="name">s</span> <span class="operator">=</span> <span class="literal string single">'''https://genius.com/William-shakespeare-romeo-and-juliet-act-1-prologue-annotated#note-2756596 -Romeo and Juliet -PROLOGUE -Two households, both alike in dignity, -In fair Verona, where we lay our scene, -From ancient grudge break to new mutiny, -Where civil blood makes civil hands unclean. -From forth the fatal loins of these two foes -A pair of star-cross'd lovers take their life; -Whose misadventured piteous overthrows -Doth with their death bury their parents' strife. -The fearful passage of their death-mark'd love, -And the continuance of their parents' rage, -Which, but their children's end, nought could remove, -Is now the two hours' traffic of our stage; -The which if you with patient ears attend, -What here shall miss, our toil shall strive to mend.'''</span> - -<span class="name">lst</span> <span class="operator">=</span> <span class="name">s</span><span class="operator">.</span><span class="name">split</span><span class="punctuation">(</span><span class="literal string single">'</span><span class="literal string escape">\n</span><span class="literal string single">'</span><span class="punctuation">)</span> -<span class="name">c</span> <span class="operator">=</span> <span class="name builtin">sorted</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">,</span> <span class="name">key</span><span class="operator">=</span><span class="keyword">lambda</span> <span class="name">x</span><span class="punctuation">:</span> <span class="name builtin">len</span><span class="punctuation">(</span><span class="name">x</span><span class="punctuation">))</span> -<span class="name">d</span> <span class="operator">=</span> <span class="name builtin">sorted</span><span class="punctuation">(</span><span class="name">lst</span><span class="punctuation">,</span> <span class="name">key</span><span class="operator">=</span><span class="keyword">lambda</span> <span class="name">x</span><span class="punctuation">:</span> <span class="operator">-</span><span class="name builtin">len</span><span class="punctuation">(</span><span class="name">x</span><span class="punctuation">))</span> -<span class="keyword">print</span><span class="punctuation">(</span><span class="literal string single">'</span><span class="literal string escape">\n</span><span class="literal string single">'</span><span class="operator">.</span><span class="name">join</span><span class="punctuation">(</span><span class="name">c</span><span class="punctuation">))</span> -</pre> -<p>以上程序运行会输出如下结果。</p> -<pre class="literal-block"> -a -is -this -example -PROLOGUE -Romeo and Juliet -Two households, both alike in dignity, -Whose misadventured piteous overthrows -In fair Verona, where we lay our scene, -From ancient grudge break to new mutiny, -The which if you with patient ears attend, -And the continuance of their parents' rage, -Is now the two hours' traffic of our stage; -Where civil blood makes civil hands unclean. -From forth the fatal loins of these two foes -A pair of star-cross'd lovers take their life; -The fearful passage of their death-mark'd love, -Doth with their death bury their parents' strife. -What here shall miss, our toil shall strive to mend. -Which, but their children's end, nought could remove, -https://genius.com/William-shakespeare-romeo-and-juliet-act-1-prologue-annotated#note-2756596 -</pre> -</div> -</div> -<div class="section" id="id24"> -<h1><a class="toc-backref" href="#id63">参考</a></h1> -<ul class="simple"> -<li>Think Python 2e – Green Tea Press. <a class="reference external" href="http://greenteapress.com/thinkpython2/thinkpython2.pdf">http://greenteapress.com/thinkpython2/thinkpython2.pdf</a>.</li> -</ul> -<!-- Make a html page from this file. Issue the following command: -pip install docutils && rst2html.py LectureNotesOnPython.rst LectureNotesOnPython.html --> -</div> -</div> -</body> -</html> |