You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@zeppelin.apache.org by "Khalid Huseynov (JIRA)" <ji...@apache.org> on 2018/07/15 08:45:00 UTC

[jira] [Created] (ZEPPELIN-3625) Pandasql interpreter fails to query over python interpreter dataframe

Khalid Huseynov created ZEPPELIN-3625:
-----------------------------------------

             Summary: Pandasql interpreter fails to query over python interpreter dataframe
                 Key: ZEPPELIN-3625
                 URL: https://issues.apache.org/jira/browse/ZEPPELIN-3625
             Project: Zeppelin
          Issue Type: Bug
          Components: Interpreters
    Affects Versions: 0.8.0, 0.9.0
            Reporter: Khalid Huseynov


run below paragraphs to reproduce
{code:java}
%python

import pandas as pd
import numpy as np
df = pd.DataFrame(
np.array([
np.arange(0, 200),
np.random.randn(200).cumsum(),
np.random.randn(200).cumsum(),
np.random.randint(1, 3, 200)
]).transpose(),
columns=["X", "Y", "Z", "Group"])

z.show(df){code}
{code:java}
%python.sql
select * from df{code}
Note that master and branch 0.8 both fail to query but errors in each case are different



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)