从 coordinate 值中获取行和列 numbers 在 openpyxl 中。

Question

从 coordinate 值中获取行和列 numbers 在 openpyxl 中。

63

我试图将Excel中的坐标值转换为openpyxl中的行列号。

例如，如果我的单元格坐标是D4，则希望找到相应的行和列号以便于后续操作，此时行=3，列=4。我可以轻松地使用代码ws.cell('D4').row获取行号，它返回4，然后只需减去1。但是类似的参数ws.cell('D4').column返回D，我不知道如何将其轻松转换为int类型以供后续操作。因此，我求助于stackoverflow上聪明的人们，请帮帮我吧！

- A Alstone

9个回答

85

openpyxl有一个名为get_column_letter的函数，可以将数字转换为对应的列字母。

from openpyxl.utils import get_column_letter
print(get_column_letter(1))

1 --> A

50 --> AX

1234-- AUL

I have been using it like:

from openpyxl import Workbook
from openpyxl.utils import get_column_letter

#create excel type item
wb = Workbook()
# select the active worksheet
ws = wb.active

counter = 0
for column in range(1,6):
    column_letter = get_column_letter(column)
    for row in range(1,11):
        counter = counter +1
        ws[column_letter + str(row)] = counter

wb.save("sample.xlsx")

- Ohm

4

楼主想把列字母转换成索引数字，但反过来做则更加困难且不够明显。这个解决方案很好地解决了这个问题，并且非常有效和简洁。 - user2097818

这很棒，可以在 Excel 列的范围内设置列宽和其他列维度属性，因为 openpyxl 的 column_dimensions[x].width 似乎只接受列的字符串字母值，而不是整数。 - Will Croxford

6

在openpyxl.utils.cell模块中有一个方法可以满足所需功能。该方法名为openpyxl.utils.cell.coordinate_to_tuple()，它将字母数字格式的Excel坐标作为字符串输入，并将这些坐标作为整数元组返回。

openpyxl.utils.cell.coordinate_to_tuple('B1')
>> (1, 2)

这提供了一个更加简洁的解决方案，只需使用指定的库即可实现。

- graingerous

我需要非常简短的答案。 - Mounesh

3

这是基于Nathan的回答。基本上，当行和/或列超过一个字符宽度时，他的答案无法正常工作。抱歉-我有点过了。以下是完整的脚本：

def main():
    from sys import argv, stderr

    cells = None

    if len(argv) == 1:
        cells = ['Ab102', 'C10', 'AFHE3920']
    else:
        cells = argv[1:]

    from re import match as rematch

    for cell in cells:
        cell = cell.lower()

        # generate matched object via regex (groups grouped by parentheses)
        m = rematch('([a-z]+)([0-9]+)', cell)

        if m is None:
            from sys import stderr
            print('Invalid cell: {}'.format(cell), file=stderr)
        else:
            row = 0
            for ch in m.group(1):
                # ord('a') == 97, so ord(ch) - 96 == 1
                row += ord(ch) - 96
            col = int(m.group(2))

            print('Cell: [{},{}] '.format(row, col))

if __name__ == '__main__':
    main()

简而言之，附有一堆注释...

# make cells with multiple characters in length for row/column
# feel free to change these values
cells = ['Ab102', 'C10', 'AFHE3920']

# import regex
from re import match as rematch

# run through all the cells we made
for cell in cells:
    # make sure the cells are lower-case ... just easier
    cell = cell.lower()

    # generate matched object via regex (groups grouped by parentheses)
    ############################################################################
    # [a-z] matches a character that is a lower-case letter
    # [0-9] matches a character that is a number
    # The + means there must be at least one and repeats for the character it matches
    # the parentheses group the objects (useful with .group())
    m = rematch('([a-z]+)([0-9]+)', cell)

    # if m is None, then there was no match
    if m is None:
        # let's tell the user that there was no match because it was an invalid cell
        from sys import stderr
        print('Invalid cell: {}'.format(cell), file=stderr)
    else:
        # we have a valid cell!
        # let's grab the row and column from it

        row = 0

        # run through all of the characters in m.group(1) (the letter part)
        for ch in m.group(1):
            # ord('a') == 97, so ord(ch) - 96 == 1
            row += ord(ch) - 96
        col = int(m.group(2))

        # phew! that was a lot of work for one cell ;)
        print('Cell: [{},{}] '.format(row, col))

print('I hope that helps :) ... of course, you could have just used Adam\'s answer,\
but that isn\'t as fun, now is it? ;)')

- dylnmc

有点过度，但使用 rematch 来切分坐标是一种干净的方式。这绝对是将坐标转换为其他问题的基础。 - user2097818

0

老话题了，但答案不正确！

dylnmc 方法是个好方法，但有一些错误。计算单元格坐标如 "AA1" 或者 "AAB1" 的结果是不正确的。

以下是修正版本作为一个函数。

注意：这个函数返回真实的坐标。如果你想在 ExcelWriter 中使用它，ROW 和 COL 都应该减去一。所以用return(row-1,col-1)替换最后一行。

例如 'AA1' 是 [1,27]，'AAA1' 是 [1,703]；但在 Python 中它们必须是 [0,26] 和 [0,702]。

import re

def coord2num(coord):
    cell = coord.lower()

    # generate matched object via regex (groups grouped by parentheses)
    m = re.match('([a-z]+)([0-9]+)', cell)

    if m is None:
        print('Invalid cell: {}'.format(cell))
        return [None,None]
    else:
        col = 0
        for i,ch in enumerate(m.group(1)[::-1]):
            n = ord(ch)-96
            col+=(26**i)*(n)

        row = int(m.group(2))

    return[row,col]

- Mehdi

0

这将给出列号

col = "BHF"
num = 0
for i in range(len(col)):
    num = num + (ord(col[i]) - 64) * pow(26, len(col) - 1 - i)
print(num)

- Deo

0

如果您想在不使用任何库的情况下完成它：

def col_row(s):
    """ 'AA13' -> (27, 13) """

    def col_num(col):
        """ 'AA' -> 27 """
        s = 0
        for i, char in enumerate(reversed(col)):
            p = (26 ** i)
            s += (1 + ord(char.upper()) - ord('A')) * p
        return s

    def split_A1(s):
        """ 'AA13' -> (AA, 13) """
        for i, char in enumerate(s):
            if char.isdigit():
                return s[:i], int(s[i:])

    col, row = split_A1(s)
    return col_num(col), row

col_row('ABC13')
# Out[124]: (731, 13)

- Andreas

0

看起来较新版本的openpyxl支持cell.col_idx，该属性提供所涉及单元格的基于1的列编号。

因此，ws.cell('D4').col_idx应该返回4，而不是D。

- jwcoder

-4

你可以直接使用纯Python：

cell = "D4"
col = ord(cell[0]) - 65
row = int(cell[1:]) - 1

这使用了 ord 函数，它接受一个字符并返回其 ASCII 码。在 ASCII 中，字母 A 是 65，B 是 66，依此类推。

- Nathan Villaescusa

4

谢谢您的回复，不幸的是，在列大于Z的情况下（例如AD），这种方法行不通。 - A Alstone

网页内容由stack overflow 提供, 点击上面的

可以查看英文原文，
原文链接

- Adam Morris · Accepted Answer

91

你需要的是 openpyxl.utils.coordinate_from_string() 和 openpyxl.utils.column_index_from_string()

from openpyxl.utils.cell import coordinate_from_string, column_index_from_string
xy = coordinate_from_string('A4') # returns ('A',4)
col = column_index_from_string(xy[0]) # returns 1
row = xy[1]

- Adam Morris

1

亚当，还有一个问题，我在哪里可以找到这些不同命令的信息？我能够找到的在线文档http://packages.python.org/openpyxl/index.html非常简略，也没有包含你列出的函数。 - A Alstone

2

文档不多 - 它是基于phpexcel构建的 - http://phpexcel.codeplex.com，并且存在显著重叠。对于一个Python项目，代码结构相当良好 - 只需查看源代码中的单元测试，就可以找到很多东西。 - Adam Morris

8

对于使用最新版本的openpyxl（版本为2.3.1，约于2015年11月20日），请注意一些函数定义已被移动到软件包的不同部分。 "coordinate_from_string"和"column_index_from_string"现在必须从"openpyxl.utils"中导入。查看此处的文档。 - Mackie Messer

3

不确定这是从什么时候开始的，但至少在openpyxl-utilities==0.5（2019年）之后，您可以通过使用from openpyxl.utils.cell import coordinate_from_string, column_index_from_string进行导入。 - mwag