要得到一組數(shù)據(jù)的中位數(shù)(例如某個(gè)地區(qū)或某家公司的收入中位數(shù)),我們一般要將這一任務(wù)細(xì)分為 3 個(gè)小任務(wù):
1.將數(shù)據(jù)排序,并給每一行數(shù)據(jù)給出其在所有數(shù)據(jù)中的排名;
2.找出中位數(shù)的排名數(shù)字;
3.找出中間排名對(duì)應(yīng)的值;
下面以某公司員工月收入為例,示例 MySQL 的一些復(fù)雜語(yǔ)句的使用。
方法一
創(chuàng)建測(cè)試表
首先創(chuàng)建一個(gè)收入表,建表語(yǔ)句為:
CREATE TABLE IF NOT EXISTS `employee` ( `id` INT AUTO_INCREMENT PRIMARY KEY, `name` VARCHAR(10) NOT NULL DEFAULT '', `income` INT NOT NULL DEFAULT '0' ) ENGINE = InnoDB DEFAULT CHARSET = utf8; INSERT INTO `employee` (`name`, `income`) VALUES ('麻子', 20000); INSERT INTO `employee` (`name`, `income`) VALUES ('李四', 12000); INSERT INTO `employee` (`name`, `income`) VALUES ('張三', 10000); INSERT INTO `employee` (`name`, `income`) VALUES ('王二', 16000); INSERT INTO `employee` (`name`, `income`) VALUES ('土豪', 40000);
完成任務(wù) 1
將數(shù)據(jù)排序,并給每一行數(shù)據(jù)給出其在所有數(shù)據(jù)中的排名:
SELECT t1.name, t1.income, COUNT(*) AS rank FROM employee AS t1, employee AS t2 WHERE t1.income < t2.income OR (t1.income = t2.income AND t1.name <= t2.name) GROUP BY t1.name, t1.income ORDER BY rank;
查詢(xún)結(jié)果為:
完成小任務(wù) 2
找出中位數(shù)的排名數(shù)字:
SELECT (COUNT(*) + 1) DIV 2 as rank FROM employee;
查詢(xún)結(jié)果為:
完成小任務(wù) 3
SELECT income AS median FROM (SELECT t1.name, t1.income, COUNT(*) AS rank FROM employee AS t1, employee AS t2 WHERE t1.income < t2.income OR (t1.income = t2.income AND t1.name <= t2.name) GROUP BY t1.name, t1.income ORDER BY rank) t3 WHERE rank = (SELECT (COUNT(*) + 1) DIV 2 FROM employee)
查詢(xún)結(jié)果為:
至此,我們就找到了如何從一組數(shù)據(jù)中獲得中位數(shù)的方法。
方法二
下面,來(lái)介紹另外一種優(yōu)化排名語(yǔ)句的方法。
我們都知道如何給一組數(shù)據(jù)做排序操作,在本例中,實(shí)現(xiàn)方法如下:
SELECT name, income FROM employee ORDER BY income DESC
查詢(xún)結(jié)果為:
那我們可不可以更進(jìn)一步,對(duì)查詢(xún)出的結(jié)果加一列,這一列的數(shù)據(jù)為排名呢?
我們可以通過(guò) 3 個(gè)自定義變量的方法來(lái)實(shí)現(xiàn)這一目標(biāo):
第一個(gè)變量用來(lái)記錄當(dāng)前行數(shù)據(jù)的收入
第二個(gè)變量用來(lái)記錄上一行數(shù)據(jù)的收入
第三個(gè)變量用來(lái)記錄當(dāng)前行數(shù)據(jù)的排名
SET @curr_income := 0; SET @prev_income := 0; SET @rank := 0; SELECT `name`, @curr_income := income AS income, @rank := if(@prev_income != @curr_income, @rank + 1, @rank) AS rank, @prev_income := @curr_income AS dummy FROM employee ORDER BY income DESC
查詢(xún)結(jié)果如下:
然后再找出中位數(shù)的排名數(shù)字,進(jìn)一步找出收入的中位數(shù):
SET @curr_income := 0; SET @prev_income := 0; SET @rank := 0; SELECT income AS median FROM (SELECT `name`, @curr_income := income AS income, @rank := if(@prev_income != @curr_income, @rank + 1, @rank) AS rank, @prev_income := @curr_income AS dummy FROM employee ORDER BY income DESC) AS t1 WHERE t1.rank = (SELECT (COUNT(*) + 1) DIV 2 FROM employee)
查詢(xún)結(jié)果為:
至此,我們找了兩種方法來(lái)解決中位數(shù)的問(wèn)題。撒花。
推薦:《mysql教程》