You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@zeppelin.apache.org by "Mathew (JIRA)" <ji...@apache.org> on 2018/05/15 21:51:00 UTC
[jira] [Created] (ZEPPELIN-3462) DataFrames with tabs get corrupted
in SQL interpreter.
Mathew created ZEPPELIN-3462:
--------------------------------
Summary: DataFrames with tabs get corrupted in SQL interpreter.
Key: ZEPPELIN-3462
URL: https://issues.apache.org/jira/browse/ZEPPELIN-3462
Project: Zeppelin
Issue Type: Bug
Components: Interpreters
Affects Versions: 0.7.3
Reporter: Mathew
Fix For: 0.8.0
Attachments: image-2018-05-16-09-49-44-647.png
If there is a tab in a dataframe, the SQL interpreter will interpret this as a new column, causing the table display to chomp of some of the following columns.
*Steps to Reproduce:*
Create dataframe with tab:
{code:java}
%spark.pyspark
from pyspark.sql import Row
# Create dataframe with 3 cols
df = sc.parallelize([
Row(u'First col, \u0009 still first col.', 'Second col', 'Third col')
]
).toDF()
# Display table
df.show()
# Register table for SQL
df.registerTempTable("df"){code}
Query in SQL interpreter:
{code:java}
%sql
SELECT * FROM df
{code}
Output:
!image-2018-05-16-09-49-44-647.png!
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)