DBA必须掌握的工具之一,Linux系统环境下的监控命令 top。
top命令是Linux下常用的性能分析工具,能够实时显示系统中各个进程的资源占用状况,类似于Windows的任务管理器。
	
	下面详细介绍它的使用方法。
	top - 01:06:48 up 1:22, 1 user, load average: 0.06, 0.60, 0.48
	Tasks: 29 total, 1 running, 28 sleeping, 0 stopped, 0 zombie
	Cpu(s): 0.3% us, 1.0% sy, 0.0% ni, 98.7% id, 0.0% wa, 0.0% hi, 0.0% si
	Mem: 191272k total, 173656k used, 17616k free, 22052k buffers
	Swap: 192772k total, 0k used, 192772k free, 123988k cached
	PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
	1379 root 16 0 7976 2456 1980 S 0.7 1.3 0:11.03 sshd
	14704 root 16 0 2128 980 796 R 0.7 0.5 0:02.72 top
	1 root 16 0 1992 632 544 S 0.0 0.3 0:00.90 init
	2 root 34 19 0 0 0 S 0.0 0.0 0:00.00 ksoftirqd/0
	3 root RT 0 0 0 0 S 0.0 0.0 0:00.00 watchdog/0
	
	统计信息区前五行是系统整体的统计信息。第一行是任务队列信息,同 uptime 命令的执行结果。
	
	其内容如下:
	01:06:48 当前时间
	up 1:22 系统运行时间,格式为时:分
	1 user 当前登录用户数
	load average: 0.06, 0.60, 0.48 系统负载,即任务队列的平均长度。三个数值分别为 1分钟、5分钟、15分钟前到现在的平均值。
第二、三行为进程和CPU的信息。当有多个CPU时,这些内容可能会超过两行。
	
	内容如下:
	Tasks: 29 total 进程总数
	1 running 正在运行的进程数
	28 sleeping 睡眠的进程数
	0 stopped 停止的进程数
	0 zombie 僵尸进程数
	Cpu(s): 0.3% us 用户空间占用CPU百分比
	1.0% sy 内核空间占用CPU百分比
	0.0% ni 用户进程空间内改变过优先级的进程占用CPU百分比
	98.7% id 空闲CPU百分比
	0.0% wa 等待输入输出的CPU时间百分比
	0.0% hi
	0.0% si
	
	最后两行为内存信息。内容如下:
	Mem: 191272k total 物理内存总量
	173656k used 使用的物理内存总量
	17616k free 空闲内存总量
	22052k buffers 用作内核缓存的内存量
	Swap: 192772k total 交换区总量
	0k used 使用的交换区总量
	192772k free 空闲交换区总量
	123988k cached 缓冲的交换区总量。内存中的内容被换出到交换区,而后又被换入到内存,但使用过的交换区尚未被覆盖,该数值即为这些内容已存在于内存中的交换区的大小。相应的内存再次被换出时可不必再对交换区写入。
	
	进程信息区统计信息区域的下方显示了各个进程的详细信息。首先来认识一下各列的含义。
	序号列名含义
	a PID 进程id
	b PPID 父进程id
	c RUSER Real user name
	d UID 进程所有者的用户id
	e USER 进程所有者的用户名
	f GROUP 进程所有者的组名
	g TTY 启动进程的终端名。不是从终端启动的进程则显示为 ?
	h PR 优先级
	i NI nice值。负值表示高优先级,正值表示低优先级
	j P 最后使用的CPU,仅在多CPU环境下有意义
	k %CPU 上次更新到现在的CPU时间占用百分比
	l TIME 进程使用的CPU时间总计,单位秒
	m TIME+ 进程使用的CPU时间总计,单位1/100秒
	n %MEM 进程使用的物理内存百分比
	o VIRT 进程使用的虚拟内存总量,单位kb。VIRT=SWAP+RES
	p SWAP 进程使用的虚拟内存中,被换出的大小,单位kb。
	q RES 进程使用的、未被换出的物理内存大小,单位kb。RES=CODE+DATA
	r CODE 可执行代码占用的物理内存大小,单位kb
	s DATA 可执行代码以外的部分(数据段+栈)占用的物理内存大小,单位kb
	t SHR 共享内存大小,单位kb
	u nFLT 页面错误次数
	v nDRT 最后一次写入到现在,被修改过的页面数。
	w S 进程状态。
	D=不可中断的睡眠状态
	R=运行
	S=睡眠
	T=跟踪/停止
	Z=僵尸进程
	x COMMAND 命令名/命令行
	y WCHAN 若该进程在睡眠,则显示睡眠中的系统函数名
	z Flags 任务标志,参考 sched.h
默认情况下仅显示比较重要的 PID、USER、PR、NI、VIRT、RES、SHR、S、%CPU、%MEM、TIME+、COMMAND 列。可以通过下面的快捷键来更改显示内容。
更改显示内容通过 f 键可以选择显示的内容。按 f 键之后会显示列的列表,按 a-z 即可显示或隐藏对应的列,最后按回车键确定。
按 o 键可以改变列的显示顺序。按小写的 a-z 可以将相应的列向右移动,而大写的 A-Z 可以将相应的列向左移动。最后按回车键确定。
按大写的 F 或 O 键,然后按 a-z 可以将进程按照相应的列进行排序。而大写的 R 键可以将当前的排序倒转。
	
	命令使用
	1.工具(命令)名称
	top
	2.工具(命令)作用显示系统当前的进程和其他状况; top是一个动态显示过程,即可以通过用户按键来不断刷新当前状态.如果在前台执行该命令,它将独占前台,直到用户终止该程序为止. 比较准确的说,top命令提供了实时的对系统处理器的状态监视.它将显示系统中CPU最“敏感”的任务列表.该命令可以按CPU使用.内存使用和执行时间对任务进行排序;而且该命令的很多特性都可以通过交互式命令或者在个人定制文件中进行设定.
	3.环境设置在Linux下使用。
	4.使用方法
	4.1使用格式
	top [-] [d] [p] [q] [c] [C] [S] [s] [n]
	4.2参数说明
	d 指定每两次屏幕信息刷新之间的时间间隔。当然用户可以使用s交互命令来改变之。
	p 通过指定监控进程ID来仅仅监控某个进程的状态。
	q该选项将使top没有任何延迟的进行刷新。如果调用程序有超级用户权限,那么top将以尽可能高的优先级运行。
	S 指定累计模式
	s 使top命令在安全模式中运行。这将去除交互命令所带来的潜在危险。
	i 使top不显示任何闲置或者僵死进程。
	c 显示整个命令行而不只是显示命令名
	4.3其他
	下面介绍在top命令执行过程中可以使用的一些交互命令。从使用角度来看,熟练的掌握这些命令比掌握选项还重要一些。这些命令都是单字母的,如果在命令行选项中使用了s选项,则可能其中一些命令会被屏蔽掉。
	Ctrl+L 擦除并且重写屏幕。
	h或者? 显示帮助画面,给出一些简短的命令总结说明。
	k 终止一个进程。系统将提示用户输入需要终止的进程PID,以及需要发送给该进程什么样的信号。一般的终止进程可以使用15信号;如果不能正常结束那就使用信号9强制结束该进程。默认值是信号15。在安全模式中此命令被屏蔽。
	i 忽略闲置和僵死进程。这是一个开关式命令。
	q 退出程序。
	r 重新安排一个进程的优先级别。系统提示用户输入需要改变的进程PID以及需要设置的进程优先级值。输入一个正值将使优先级降低,反之则可以使该进程拥有更高的优先权。默认值是10。
	S 切换到累计模式。
	s 改变两次刷新之间的延迟时间。系统将提示用户输入新的时间,单位为s。如果有小数,就换算成m s。输入0值则系统将不断刷新,默认值是5 s。需要注意的是如果设置太小的时间,很可能会引起不断刷新,从而根本来不及看清显示的情况,而且系统负载也会大大增加。
	f或者F 从当前显示中添加或者删除项目。
	o或者O 改变显示项目的顺序。
	l 切换显示平均负载和启动时间信息。
	m 切换显示内存信息。
	t 切换显示进程和CPU状态信息。
	c 切换显示命令名称和完整命令行。
	M 根据驻留内存大小进行排序。
	P 根据CPU使用百分比大小进行排序。
	T 根据时间/累计时间进行排序。
	W 将当前设置写入~/.toprc文件中。这是写top配置文件的推荐方法。
	
	top 的man 命令解释如下:
Listed below are top's available fields. They are always associated with the letter shown, regardless of the position you may have established for them with the 'o' (Order fields) interactive command.Any field is selectable as the sort field, and you control whether they are sorted high-to-low or low-to-high. For additional information on sort provisions see topic 3c. TASK Area Commands.
a: PID -- Process Id
The task's unique process ID, which periodically wraps, though never restarting at zero.
b: PPID -- Parent Process Pid
The process ID of a task's parent.
c: RUSER -- Real User Name
The real user name of the task's owner.
d: UID -- User Id
The effective user ID of the task's owner.
e: USER -- User Name
The effective user name of the task's owner.
f: GROUP -- Group Name
The effective group name of the task's owner.
g: TTY -- Controlling Tty
The name of the controlling terminal. This is usually the device (serial port, pty, etc.) from which the process was started, and which it uses for input oroutput. However, a task need not be associated with a terminal, in which case you'll see '?' displayed.
h: PR -- Priority
The priority of the task.
i: NI -- Nice value
The nice value of the task. A negative nice value means higher priority, whereas a positive nice value means lower priority. Zero in this field simply means priority will not be adjusted in determining a task's dispatchability.
j: P -- Last used CPU (SMP)
A number representing the last used processor. In a true SMP environment this will likely change frequently since the kernel intentionally uses weak affinity. Also, the very act of running top may break this weak affinity and cause more processes to change CPUs more often (because of the extra demand for cpu time).
k: %CPU -- CPU usage
The task's share of the elapsed CPU time since the last screen update, expressed as a percentage of total CPU time. In a true SMP environment, if 'Irix mode' is Off, top will operate in 'Solaris mode' where a task's cpu usage will be divided by the total number of CPUs. You toggle 'Irix/Solaris' modes with the 'I' interactive command.
l: TIME -- CPU Time
Total CPU time the task has used since it started. When 'Cumulative mode' is On, each process is listed with the cpu time that it and its dead children has used. You toggle 'Cumulative mode' with 'S', which is a command-line option and an interactive command. See the 'S' interactive command for additional information regarding this mode.
m: TIME+ -- CPU Time, hundredths
The same as 'TIME', but reflecting more granularity through hundredths of a sec ond.
n: %MEM -- Memory usage (RES)
A task's currently used share of available physical memory.
o: VIRT -- Virtual Image (kb)
The total amount of virtual memory used by the task. It includes all code, data and shared libraries plus pages that have been swapped out. (Note: you can define the STATSIZE=1 environment variable and the VIRT will be calculated from the /proc/#/state VmSize field.)
VIRT = SWAP + RES.
p: SWAP -- Swapped size (kb)
The swapped out portion of a task's total virtual memory image.
q: RES -- Resident size (kb)
The non-swapped physical memory a task has used.
RES = CODE + DATA.
r: CODE -- Code size (kb)
The amount of physical memory devoted to executable code, also known as the'text resident set' size or TRS.
s: DATA -- Data+Stack size (kb)
The amount of physical memory devoted to other than executable code, also known the 'data resident set' size or DRS.
t: SHR -- Shared Mem size (kb)
The amount of shared memory used by a task. It simply reflects memory that could be potentially shared with other processes.
u: nFLT -- Page Fault count
The number of major page faults that have occurred for a task. A page fault occurs when a process attempts to read from or write to a virtual page that is not currently present in its address space. A major page fault is when disk access is involved in making that page available.
v: nDRT -- Dirty Pages count
The number of pages that have been modified since they were last written to disk. Dirty pages must be written to disk before the corresponding physical memory location can be used for some other virtual page.
w: S -- Process Status
The status of the task which can be one of:
'D' = uninterruptible sleep
'R' = running
'S' = sleeping
'T' = traced or stopped
'Z' = zombie
Tasks shown as running should be more properly thought of as 'ready to run' --their task_struct is simply represented on the Linux run-queue. Even without a true SMP machine, you may see numerous tasks in this state depending on top's delay interval and nice value.
x: Command -- Command line or Program name
Display the command line used to start a task or the name of the associated program. You toggle between command line and name with 'c', which is both a command-line option and an interactive command. When you've chosen to display command lines, processes without a command line (like kernel threads) will be shown with only the program name in parentheses, as in this example: ( mdrecoveryd ) Either form of display is subject to potential truncation if it's too long to fit in this field's current width. That width depends upon other fields selected, their order and the current screen width.
Note: The 'Command' field/column is unique, in that it is not fixed-width. When displayed, this column will be allocated all remaining screen width (up to the maximum 512 characters) to provide for the potential growth of program names into command lines.
y: WCHAN -- Sleeping in Function
Depending on the availability of the kernel link map ('System.map'), this field will show the name or the address of the kernel function in which the task is currently sleeping. Running tasks will display a dash ('-') in this column.
Note: By displaying this field, top's own working set will be increased by over 700Kb. Your only means of reducing that overhead will be to stop and restart top.
z: Flags -- Task Flags
This column represents the task's current scheduling flags which are expressed in hexadecimal notation and with zeros suppressed. These flags are officially documented in <linux/sched.h>. Less formal documentation can also be found on the 'Fields select' and 'Order fields' screens.

