You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@vxquery.apache.org by "sagarsharma (JIRA)" <ji...@apache.org> on 2015/03/04 22:30:38 UTC

[jira] [Commented] (VXQUERY-131) Supporting Hadoop data and cluster management

    [ https://issues.apache.org/jira/browse/VXQUERY-131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14347597#comment-14347597 ] 

sagarsharma commented on VXQUERY-131:
-------------------------------------

i know some kind of Big-Data techniques like Hadoop , H-Base , Cassendra , Hive and i really want to do this project so can anybody help me please ..... 


> Supporting Hadoop data and cluster management
> ---------------------------------------------
>
>                 Key: VXQUERY-131
>                 URL: https://issues.apache.org/jira/browse/VXQUERY-131
>             Project: VXQuery
>          Issue Type: Improvement
>            Reporter: Preston Carman
>            Assignee: Preston Carman
>              Labels: gsoc, gsoc2015, hadoop, java, mentor, xml
>
> Many organizations support Hadoop. It would be nice to be able to read data from this source. The project will include creating a strategy (with the mentor's guidance) for reading XML data from HDFS and implementing it. When connecting VXQuery to HDFS, the strategy may need to consider how to read sections of an XML file. 
> In addition, we could use Yarn as our cluster manager. The Apache Hadoop YARN (Yet Another Resource Negotiator) would be a good cluster management tool for VXQuery. If VXQuery can read data from HDFS, then why not also manage the cluster with a tool provided by Hadoop. The solution would replace the current custom python scripts for cluster management.
> Goal
> - Read XML from HDFS
> - Manage the VXQuery cluster with Yarn



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Re: [jira] [Commented] (VXQUERY-131) Supporting Hadoop data and cluster management

Posted by Eldon Carman <ec...@ucr.edu>.
Thanks for you interest in the project. We (or just me) are here to help.
Do you have questions about the project?

On Wed, Mar 4, 2015 at 1:30 PM, sagarsharma (JIRA) <ji...@apache.org> wrote:

>
>     [
> https://issues.apache.org/jira/browse/VXQUERY-131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14347597#comment-14347597
> ]
>
> sagarsharma commented on VXQUERY-131:
> -------------------------------------
>
> i know some kind of Big-Data techniques like Hadoop , H-Base , Cassendra ,
> Hive and i really want to do this project so can anybody help me please
> .....
>
>
> > Supporting Hadoop data and cluster management
> > ---------------------------------------------
> >
> >                 Key: VXQUERY-131
> >                 URL: https://issues.apache.org/jira/browse/VXQUERY-131
> >             Project: VXQuery
> >          Issue Type: Improvement
> >            Reporter: Preston Carman
> >            Assignee: Preston Carman
> >              Labels: gsoc, gsoc2015, hadoop, java, mentor, xml
> >
> > Many organizations support Hadoop. It would be nice to be able to read
> data from this source. The project will include creating a strategy (with
> the mentor's guidance) for reading XML data from HDFS and implementing it.
> When connecting VXQuery to HDFS, the strategy may need to consider how to
> read sections of an XML file.
> > In addition, we could use Yarn as our cluster manager. The Apache Hadoop
> YARN (Yet Another Resource Negotiator) would be a good cluster management
> tool for VXQuery. If VXQuery can read data from HDFS, then why not also
> manage the cluster with a tool provided by Hadoop. The solution would
> replace the current custom python scripts for cluster management.
> > Goal
> > - Read XML from HDFS
> > - Manage the VXQuery cluster with Yarn
>
>
>
> --
> This message was sent by Atlassian JIRA
> (v6.3.4#6332)
>