You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "helmi (Jira)" <ji...@apache.org> on 2022/09/07 15:01:00 UTC

[jira] [Updated] (ARROW-17644) Exception when reading binary arrow file (Value cannot be null. (Parameter 'name'))

     [ https://issues.apache.org/jira/browse/ARROW-17644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

helmi updated ARROW-17644:
--------------------------
    Description: 
Hi everyone,

I'm trying to read binary file using csharp apache arrow library v9.0.0 and I'm facing this exception
{code:java}
Unhandled exception. System.AggregateException: One or more errors occurred. (Value cannot be null. (Parameter 'name'))
 ---> System.ArgumentNullException: Value cannot be null. (Parameter 'name')
   at Apache.Arrow.Field..ctor(String name, IArrowType dataType, Boolean nullable)
   at Apache.Arrow.Ipc.MessageSerializer.FieldFromFlatbuffer(Field flatbufField, DictionaryMemo& dictionaryMemo)
   at Apache.Arrow.Ipc.MessageSerializer.FieldFromFlatbuffer(Field flatbufField, DictionaryMemo& dictionaryMemo)
   at Apache.Arrow.Ipc.MessageSerializer.GetSchema(Schema schema, DictionaryMemo& dictionaryMemo)
   at Apache.Arrow.Ipc.ArrowStreamReaderImplementation.<ReadSchemaAsync>b__10_0(Memory`1 buff)
   at Apache.Arrow.ArrayPoolExtensions.RentReturnAsync(ArrayPool`1 pool, Int32 length, Func`2 action)
   at Apache.Arrow.Ipc.ArrowStreamReaderImplementation.ReadSchemaAsync()
   at Apache.Arrow.Ipc.ArrowStreamReaderImplementation.ReadRecordBatchAsync(CancellationToken cancellationToken)
   at Apache.Arrow.Ipc.ArrowStreamReaderImplementation.ReadNextRecordBatchAsync(CancellationToken cancellationToken) {code}

As far as I do understand, the library is complaining about field name being null, not sure if it's the case since I tried to read the same file using apache arrow golang library and it seems to work without issue.

Please find attached the `sample.arrow` file

Below a sample code I'm using to read this arrow file:
*Csharp sample
{code:java}
using System;
using System.IO;
using System.Threading.Tasks;
using Apache.Arrow;
using Apache.Arrow.Ipc;
namespace arrow_csharp_issue
{
    class Program
    {
        static async Task AsyncMain()
        {
            byte[] bytes = File.ReadAllBytes("./inputs/sample.arrow");
            using (var memoryStream = new MemoryStream(bytes))
            using (var reader = new ArrowStreamReader(memoryStream))
           
{                 RecordBatch record = await reader.ReadNextRecordBatchAsync();                 Console.WriteLine(record);             }
        }
        static void Main(string[] args)
       
{             AsyncMain().Wait();         }
    }
} {code}
* Golang sample
{code:java}
package main

import (
"bytes"
"fmt"
"os"
"github.com/apache/arrow/go/v9/arrow/ipc"
)
func main() {
data, err := os.ReadFile("./inputs/sample.arrow")
if err != nil
{ panic(err) }

reader, err := ipc.NewReader(bytes.NewReader(data))
if err != nil { panic(err) }
defer reader.Release() {code}




reader.Next()
record := reader.Record()
fmt.Println(record)
}
```

Thank you

  was:
Hi everyone,

I'm trying to read binary file using csharp apache arrow library v9.0.0 and I'm facing this exception
```
Unhandled exception. System.AggregateException: One or more errors occurred. (Value cannot be null. (Parameter 'name'))
 ---> System.ArgumentNullException: Value cannot be null. (Parameter 'name')
   at Apache.Arrow.Field..ctor(String name, IArrowType dataType, Boolean nullable)
   at Apache.Arrow.Ipc.MessageSerializer.FieldFromFlatbuffer(Field flatbufField, DictionaryMemo& dictionaryMemo)
   at Apache.Arrow.Ipc.MessageSerializer.FieldFromFlatbuffer(Field flatbufField, DictionaryMemo& dictionaryMemo)
   at Apache.Arrow.Ipc.MessageSerializer.GetSchema(Schema schema, DictionaryMemo& dictionaryMemo)
   at Apache.Arrow.Ipc.ArrowStreamReaderImplementation.<ReadSchemaAsync>b__10_0(Memory`1 buff)
   at Apache.Arrow.ArrayPoolExtensions.RentReturnAsync(ArrayPool`1 pool, Int32 length, Func`2 action)
   at Apache.Arrow.Ipc.ArrowStreamReaderImplementation.ReadSchemaAsync()
   at Apache.Arrow.Ipc.ArrowStreamReaderImplementation.ReadRecordBatchAsync(CancellationToken cancellationToken)
   at Apache.Arrow.Ipc.ArrowStreamReaderImplementation.ReadNextRecordBatchAsync(CancellationToken cancellationToken)
```
As far as I do understand, the library is complaining about field name being null, not sure if it's the case since I tried to read the same file using apache arrow golang library and it seems to work without issue.

Please find attached the `sample.arrow` file

Below a sample code I'm using to read this arrow file:
*Csharp sample
```

using System;
using System.IO;
using System.Threading.Tasks;
using Apache.Arrow;
using Apache.Arrow.Ipc;

namespace arrow_csharp_issue
{
    class Program
    {
        static async Task AsyncMain()
        {
            byte[] bytes = File.ReadAllBytes("./inputs/sample.arrow");
            using (var memoryStream = new MemoryStream(bytes))
            using (var reader = new ArrowStreamReader(memoryStream))
            {
                RecordBatch record = await reader.ReadNextRecordBatchAsync();
                Console.WriteLine(record);
            }
        }

        static void Main(string[] args)
        {
            AsyncMain().Wait();
        }
    }
}
```

* Golang sample
```
package main

import (
"bytes"
"fmt"
"os"

"github.com/apache/arrow/go/v9/arrow/ipc"
)

func main() {
data, err := os.ReadFile("./inputs/sample.arrow")
if err != nil {
panic(err)
}

reader, err := ipc.NewReader(bytes.NewReader(data))
if err != nil {
panic(err)
}

defer reader.Release()

reader.Next()
record := reader.Record()
fmt.Println(record)
}
```

Thank you


> Exception when reading binary arrow file (Value cannot be null. (Parameter 'name'))
> -----------------------------------------------------------------------------------
>
>                 Key: ARROW-17644
>                 URL: https://issues.apache.org/jira/browse/ARROW-17644
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: C#
>            Reporter: helmi
>            Priority: Major
>         Attachments: sample.arrow
>
>
> Hi everyone,
> I'm trying to read binary file using csharp apache arrow library v9.0.0 and I'm facing this exception
> {code:java}
> Unhandled exception. System.AggregateException: One or more errors occurred. (Value cannot be null. (Parameter 'name'))
>  ---> System.ArgumentNullException: Value cannot be null. (Parameter 'name')
>    at Apache.Arrow.Field..ctor(String name, IArrowType dataType, Boolean nullable)
>    at Apache.Arrow.Ipc.MessageSerializer.FieldFromFlatbuffer(Field flatbufField, DictionaryMemo& dictionaryMemo)
>    at Apache.Arrow.Ipc.MessageSerializer.FieldFromFlatbuffer(Field flatbufField, DictionaryMemo& dictionaryMemo)
>    at Apache.Arrow.Ipc.MessageSerializer.GetSchema(Schema schema, DictionaryMemo& dictionaryMemo)
>    at Apache.Arrow.Ipc.ArrowStreamReaderImplementation.<ReadSchemaAsync>b__10_0(Memory`1 buff)
>    at Apache.Arrow.ArrayPoolExtensions.RentReturnAsync(ArrayPool`1 pool, Int32 length, Func`2 action)
>    at Apache.Arrow.Ipc.ArrowStreamReaderImplementation.ReadSchemaAsync()
>    at Apache.Arrow.Ipc.ArrowStreamReaderImplementation.ReadRecordBatchAsync(CancellationToken cancellationToken)
>    at Apache.Arrow.Ipc.ArrowStreamReaderImplementation.ReadNextRecordBatchAsync(CancellationToken cancellationToken) {code}
> As far as I do understand, the library is complaining about field name being null, not sure if it's the case since I tried to read the same file using apache arrow golang library and it seems to work without issue.
> Please find attached the `sample.arrow` file
> Below a sample code I'm using to read this arrow file:
> *Csharp sample
> {code:java}
> using System;
> using System.IO;
> using System.Threading.Tasks;
> using Apache.Arrow;
> using Apache.Arrow.Ipc;
> namespace arrow_csharp_issue
> {
>     class Program
>     {
>         static async Task AsyncMain()
>         {
>             byte[] bytes = File.ReadAllBytes("./inputs/sample.arrow");
>             using (var memoryStream = new MemoryStream(bytes))
>             using (var reader = new ArrowStreamReader(memoryStream))
>            
> {                 RecordBatch record = await reader.ReadNextRecordBatchAsync();                 Console.WriteLine(record);             }
>         }
>         static void Main(string[] args)
>        
> {             AsyncMain().Wait();         }
>     }
> } {code}
> * Golang sample
> {code:java}
> package main
> import (
> "bytes"
> "fmt"
> "os"
> "github.com/apache/arrow/go/v9/arrow/ipc"
> )
> func main() {
> data, err := os.ReadFile("./inputs/sample.arrow")
> if err != nil
> { panic(err) }
> reader, err := ipc.NewReader(bytes.NewReader(data))
> if err != nil { panic(err) }
> defer reader.Release() {code}
> reader.Next()
> record := reader.Record()
> fmt.Println(record)
> }
> ```
> Thank you



--
This message was sent by Atlassian Jira
(v8.20.10#820010)