You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@druid.apache.org by GitBox <gi...@apache.org> on 2020/04/09 22:41:09 UTC

[GitHub] [druid] vogievetsky opened a new issue #9658: Sampler with Inline input source swallows up columns that are all `null`

vogievetsky opened a new issue #9658: Sampler with Inline input source swallows up columns that are all `null`
URL: https://github.com/apache/druid/issues/9658
 
 
   ### Affected Version
   
   0.17.0
   
   ### Description
   
   When posting to the sampler with an inline input source that has a column that is all `null` that column would not be returned as part of the parsed fields without explicit dimensions being set.
   
   Try sending:
   
   ```json
   {
     "type": "index",
     "spec": {
       "ioConfig": {
         "type": "index",
         "inputSource": {
           "type": "inline",
           "data": "{\"make\":\"Honda\",\"model\":\"Odyssey\",\"a\":null}\n{\"make\":\"Tesla\",\"model\":\"Cybertruck\",\"a\":null}"
         },
         "inputFormat": {
           "type": "json"
         }
       },
       "dataSchema": {
         "dataSource": "sample",
         "timestampSpec": {
           "column": "!!!_no_such_column_!!!",
           "missingValue": "1970-01-01T00:00:00Z"
         },
         "dimensionsSpec": {}
       },
       "type": "index",
       "tuningConfig": {
         "type": "index"
       }
     },
     "samplerConfig": {
       "numRows": 500,
       "timeoutMs": 15000
     }
   }
   ```
   
   to:
   
   `/druid/indexer/v1/sampler`
   
   You get back:
   
   ```json
   {
     "numRowsRead": 2,
     "numRowsIndexed": 2,
     "data": [
       {
         "input": {
           "make": "Honda",
           "model": "Odyssey",
           "a": null
         },
         "parsed": {
           "__time": 0,
           "model": "Odyssey",
           "make": "Honda"
         }
       },
       {
         "input": {
           "make": "Tesla",
           "model": "Cybertruck",
           "a": null
         },
         "parsed": {
           "__time": 0,
           "model": "Cybertruck",
           "make": "Tesla"
         }
       }
     ]
   }
   ```
   
   Notice how the `a` column is in not returned as part of a the parsed section.
   
   This creates issues with columns that happen to be all `null` in the sample (of 20) that is used by the console.
   
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@druid.apache.org
For additional commands, e-mail: commits-help@druid.apache.org