仅对连续的行进行SQL分组

21

假设我有以下表格:

MyTable
---------
| 1 | A |
| 2 | A |
| 3 | A |
| 4 | B |
| 5 | B |
| 6 | B |
| 7 | A |
| 8 | A |
---------

我需要 SQL 查询输出以下内容:
---------
| 3 | A |
| 3 | B |
| 2 | A |
---------

我基本上正在做一个group by,但只针对在序列中在一起的行。有什么想法吗?

请注意,数据库是在SQL Server 2008上。关于这个主题有一篇文章,但它使用了Oracle的lag()函数。


嗨,使用Oracle的lag函数的帖子在哪里? - Mohsin
3个回答

30
这被称为“岛屿”问题。使用Itzik Ben Gan的方法:
;WITH YourTable AS
(
SELECT 1 AS N, 'A' AS C UNION ALL
SELECT 2 AS N, 'A' AS C UNION ALL
SELECT 3 AS N, 'A' AS C UNION ALL
SELECT 4 AS N, 'B' AS C UNION ALL
SELECT 5 AS N, 'B' AS C UNION ALL
SELECT 6 AS N, 'B' AS C UNION ALL
SELECT 7 AS N, 'A' AS C UNION ALL
SELECT 8 AS N, 'A' AS C
),
     T
     AS (SELECT N,
                C,
                DENSE_RANK() OVER (ORDER BY N) - 
                DENSE_RANK() OVER (PARTITION BY C ORDER BY N) AS Grp
         FROM   YourTable)
SELECT COUNT(*),
       C
FROM   T
GROUP  BY C,
          Grp 
ORDER BY MIN(N)

1
太棒了!这个解决方案将被放入工具箱中。 - Steve Homer

0

这将适用于您...

SELECT 
  Total=COUNT(*), C 
FROM 
(
 SELECT 
 NGroup = ROW_NUMBER() OVER (ORDER BY N) - ROW_NUMBER() OVER (PARTITION BY C ORDER BY N),
 N,
 C
 FROM MyTable 
)RegroupedTable
GROUP BY C,NGroup

0

仅仅是为了好玩,没有使用任何特定于SQL的函数,也不假设ID列单调递增:

WITH starters(name, minid, maxid) AS (
    SELECT
        a.name, MIN(a.id), MAX(a.id)
    FROM
        mytable a RIGHT JOIN
        mytable b ON
            (a.name <> b.name AND a.id < b.id) 
    WHERE 
        a.id IS NOT NULL
    GROUP BY 
        a.name
),
both(name, minid, maxid) AS (
    SELECT
        name, minid, maxid
    FROM
        starters
    UNION ALL
    SELECT
        name, MIN(id), MAX(id)
    FROM
        mytable
    WHERE
        id > (SELECT MAX(maxid) from starters)
    GROUP BY
        name
)
SELECT
    COUNT(*), m.name, minid
FROM 
    both INNER JOIN 
    mytable m ON
        id BETWEEN minid AND maxid
GROUP BY
    m.name, minid

结果(忽略midid列):

(No column name)    name    minid
3   A   1
3   B   4
2   A   7

网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接