使用LINQ在DataTable上进行数据透视的逻辑

7

我看到了一些描述这个问题的帖子,但是对于我的情况还无法完全理解。我曾经有一个使用PIVOT命令来排序我的表的SQL查询,现在我正在尝试通过LINQ将这个逻辑移入我们的应用程序中。该表存储在一个DataTable中,看起来像这样。

ObjectName | ColumnName  |  Property  |  Value
----------------------------------------------
foo        | bar         |  a         | w
foo        | bar         |  b         | x
foo        | bar         |  c         | y
foo        | bar         |  d         | z
foo        | test        |  a         | i
foo        | test        |  b         | j
foo        | test        |  c         | k
foo        | test        |  d         | l

我希望将其转换为一个类似于下面这样的数据表格。
ObjectName   |  ColumnName  |  a  |  b  |  c  |  d 
---------------------------------------------------
foo          |  bar         |  w  |  x  |  y  |  z
foo          |  test        |  i  |  j  |  k  |  l

所以我尝试了这样的操作...

var query = dt.AsEnumerable()
    .GroupBy(row => row.Field<string>("ColumnName"))
    .Select(g => new {
        ColumnName = g.Key,
        a = g.Where(row => row.Field<string>("Property") == "a").Select(c => c.Field<string>("Value")),
        b = g.Where(row => row.Field<string>("Property") == "b").Select(c => c.Field<string>("Value")),
        c = g.Where(row => row.Field<string>("Property") == "c").Select(c => c.Field<string>("Value")),
        d = g.Where(row => row.Field<string>("Property") == "d").Select(c => c.Field<string>("Value"))
    });

对于某些原因,它没有包括ObjectName(尝试添加它会导致编译错误?)。 在调试器中查看ColumnName是正确的,但其余大部分都是无意义的。 抱歉我的LINQ技能非常差,我正在努力学习,但很容易混淆。

我猜测我的数据类型不正确,以便使用那个扩展方法,但我有点超纲了。 有什么建议吗?

编辑后仍然出现一些错误,我在与这行代码作斗争

DataTable newDT = query.CopyToDataTable();

但是我收到了错误信息

类型“AnonymousType#1”不能用作类型参数“T”,因为在泛型类型或方法“System.Data.DataTableExtensions.CopyToDataTable(System.Collections.Generic.IEnumerable)”中不存在从“AnonymousType#1”到“System.Data.DataRow”的隐式引用转换。


能否发布你所得到的输出结果? - Justin Pihony
1个回答

4

试试这个:

class Program
{
//Helper method to make the Select cleaner:
private static string GetProperty(IEnumerable<DataRow> rows, string propertyName)
{
    return rows
        .Where(row => row.Field<string>("Property") == propertyName)
        .Select(c => c.Field<string>("Value"))
        .FirstOrDefault();
}

//helper method for populating the datatable
private static void addRow(DataTable dt, string objectName, string columnName
    , string property, string value)
{
    var row = dt.NewRow();
    row["ObjectName"] = objectName;
    row["ColumnName"] = columnName;
    row["Property"] = property;
    row["Value"] = value;
    dt.Rows.Add(row);
}

public static void Main(string[] args)
{

    DataTable dt = new DataTable();
    dt.Columns.Add("ObjectName");
    dt.Columns.Add("ColumnName");
    dt.Columns.Add("Property");
    dt.Columns.Add("Value");

    addRow(dt, "foo", "bar", "a", "w");
    addRow(dt, "foo", "bar", "b", "x");
    addRow(dt, "foo", "bar", "c", "y");
    addRow(dt, "foo", "bar", "d", "z");
    addRow(dt, "foo", "test", "a", "i");
    addRow(dt, "foo", "test", "b", "j");
    addRow(dt, "foo", "test", "c", "k");
    addRow(dt, "foo", "test", "d", "l");

    var query = dt.AsEnumerable()
        .GroupBy(row => new
        {
            ObjectName = row.Field<string>("ObjectName"),
            ColumnName = row.Field<string>("ColumnName")
        })
        .Select(g => new
        {
            ObjectName = g.Key.ObjectName,
            ColumnName = g.Key.ColumnName,
            a = GetProperty(g, "a"),
            b = GetProperty(g, "b"),
            c = GetProperty(g, "c"),
            d = GetProperty(g, "d"),
        })
        .CopyToDataTable();

    foreach (DataRow row in query.Rows)
    {
        foreach (DataColumn column in query.Columns)
        {
            System.Console.Write(row[column] + "\t");
        }
        System.Console.WriteLine();
    }


    Console.WriteLine("Press any key to exit. . .");
    Console.ReadKey(true);
}
}

以下是我用于复制到数据表的代码,由于您没有说明您正在使用什么:

using System;
using System.Data;
using System.Collections.Generic;
using System.Reflection;


/// <summary>
/// Code copied directly from http://msdn.microsoft.com/en-us/library/bb669096.aspx
/// </summary>
/// <typeparam name="T"></typeparam>
public class ObjectShredder<T>
{
    private System.Reflection.FieldInfo[] _fi;
    private System.Reflection.PropertyInfo[] _pi;
    private System.Collections.Generic.Dictionary<string, int> _ordinalMap;
    private System.Type _type;

    // ObjectShredder constructor.
    public ObjectShredder()
    {
        _type = typeof(T);
        _fi = _type.GetFields();
        _pi = _type.GetProperties();
        _ordinalMap = new Dictionary<string, int>();
    }

    /// <summary>
    /// Loads a DataTable from a sequence of objects.
    /// </summary>
    /// <param name="source">The sequence of objects to load into the DataTable.</param>
    /// <param name="table">The input table. The schema of the table must match that 
    /// the type T.  If the table is null, a new table is created with a schema 
    /// created from the public properties and fields of the type T.</param>
    /// <param name="options">Specifies how values from the source sequence will be applied to 
    /// existing rows in the table.</param>
    /// <returns>A DataTable created from the source sequence.</returns>
    public DataTable Shred(IEnumerable<T> source, DataTable table, LoadOption? options)
    {
        // Load the table from the scalar sequence if T is a primitive type.
        if (typeof(T).IsPrimitive)
        {
            return ShredPrimitive(source, table, options);
        }

        // Create a new table if the input table is null.
        if (table == null)
        {
            table = new DataTable(typeof(T).Name);
        }

        // Initialize the ordinal map and extend the table schema based on type T.
        table = ExtendTable(table, typeof(T));

        // Enumerate the source sequence and load the object values into rows.
        table.BeginLoadData();
        using (IEnumerator<T> e = source.GetEnumerator())
        {
            while (e.MoveNext())
            {
                if (options != null)
                {
                    table.LoadDataRow(ShredObject(table, e.Current), (LoadOption)options);
                }
                else
                {
                    table.LoadDataRow(ShredObject(table, e.Current), true);
                }
            }
        }
        table.EndLoadData();

        // Return the table.
        return table;
    }

    public DataTable ShredPrimitive(IEnumerable<T> source, DataTable table, LoadOption? options)
    {
        // Create a new table if the input table is null.
        if (table == null)
        {
            table = new DataTable(typeof(T).Name);
        }

        if (!table.Columns.Contains("Value"))
        {
            table.Columns.Add("Value", typeof(T));
        }

        // Enumerate the source sequence and load the scalar values into rows.
        table.BeginLoadData();
        using (IEnumerator<T> e = source.GetEnumerator())
        {
            Object[] values = new object[table.Columns.Count];
            while (e.MoveNext())
            {
                values[table.Columns["Value"].Ordinal] = e.Current;

                if (options != null)
                {
                    table.LoadDataRow(values, (LoadOption)options);
                }
                else
                {
                    table.LoadDataRow(values, true);
                }
            }
        }
        table.EndLoadData();

        // Return the table.
        return table;
    }

    public object[] ShredObject(DataTable table, T instance)
    {

        FieldInfo[] fi = _fi;
        PropertyInfo[] pi = _pi;

        if (instance.GetType() != typeof(T))
        {
            // If the instance is derived from T, extend the table schema
            // and get the properties and fields.
            ExtendTable(table, instance.GetType());
            fi = instance.GetType().GetFields();
            pi = instance.GetType().GetProperties();
        }

        // Add the property and field values of the instance to an array.
        Object[] values = new object[table.Columns.Count];
        foreach (FieldInfo f in fi)
        {
            values[_ordinalMap[f.Name]] = f.GetValue(instance);
        }

        foreach (PropertyInfo p in pi)
        {
            values[_ordinalMap[p.Name]] = p.GetValue(instance, null);
        }

        // Return the property and field values of the instance.
        return values;
    }

    public DataTable ExtendTable(DataTable table, Type type)
    {
        // Extend the table schema if the input table was null or if the value 
        // in the sequence is derived from type T.            
        foreach (FieldInfo f in type.GetFields())
        {
            if (!_ordinalMap.ContainsKey(f.Name))
            {
                // Add the field as a column in the table if it doesn't exist
                // already.
                DataColumn dc = table.Columns.Contains(f.Name) ? table.Columns[f.Name]
                    : table.Columns.Add(f.Name, f.FieldType);

                // Add the field to the ordinal map.
                _ordinalMap.Add(f.Name, dc.Ordinal);
            }
        }
        foreach (PropertyInfo p in type.GetProperties())
        {
            if (!_ordinalMap.ContainsKey(p.Name))
            {
                // Add the property as a column in the table if it doesn't exist
                // already.
                DataColumn dc = table.Columns.Contains(p.Name) ? table.Columns[p.Name]
                    : table.Columns.Add(p.Name, p.PropertyType);

                // Add the property to the ordinal map.
                _ordinalMap.Add(p.Name, dc.Ordinal);
            }
        }

        // Return the table.
        return table;
    }
}

/// <summary>
/// Code copied directly from http://msdn.microsoft.com/en-us/library/bb669096.aspx
/// </summary>
public static class CustomLINQtoDataSetMethods
{
    public static DataTable CopyToDataTable<T>(this IEnumerable<T> source)
    {
        return new ObjectShredder<T>().Shred(source, null, null);
    }

    public static DataTable CopyToDataTable<T>(this IEnumerable<T> source,
                                                DataTable table, LoadOption? options)
    {
        return new ObjectShredder<T>().Shred(source, table, options);
    }

}

所以我运行了这段代码,它执行并打印了预期的结果。我已经修改了答案,使其成为一个完全可编译和可执行的程序。 - Servy
哇,谢谢!我本来还想避免那篇MSDN文章中的那个庞大的方法,因为我完全不理解它,但现在这个方法可以用了,直到我找到更好的方法为止。 - Kevin DiTraglia
这是一个很棒的解决方案。我想处理的一个测试是如果缺少一列。例如,下面的数据: addRow(dt, "foo", "bar", "a", "w"); addRow(dt, "foo", "bar", "c", "y"); addRow(dt, "foo", "bar", "d", "z"); addRow(dt, "foo", "test", "a", "i"); addRow(dt, "foo", "test", "c", "k"); addRow(dt, "foo", "test", "d", "l"); - mpora

网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接