You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@sqoop.apache.org by "Veena Basavaraj (JIRA)" <ji...@apache.org> on 2015/02/04 00:25:36 UTC
[jira] [Updated] (SQOOP-1549) Simplifying the Configuration class
concept in Connector api
[ https://issues.apache.org/jira/browse/SQOOP-1549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Veena Basavaraj updated SQOOP-1549:
-----------------------------------
Fix Version/s: 1.99.5
> Simplifying the Configuration class concept in Connector api
> ------------------------------------------------------------
>
> Key: SQOOP-1549
> URL: https://issues.apache.org/jira/browse/SQOOP-1549
> Project: Sqoop
> Issue Type: Sub-task
> Reporter: Veena Basavaraj
> Assignee: Veena Basavaraj
> Fix For: 1.99.5
>
>
> Here is what happens today ( SQOOP-1367 ) when someone needs to write a connector.
> First they start looking at the connector api and sees that they need to implement configuration classes. Well after some thinking they realize, they need 3 classes. Why they wonder? But they continue on and implement 3 classes. In some cases there is really nothing for Link Configuration, but they still have to create this dummy class for a Configuration Class and then another dummy one for config class, which if it were me would find it absurd.
> Then after creating 3 configuration classes, they need to then create atleast 3 config classes. Note the use of word atleast. The api is not at all obvious in telling them that they infact can create more than 3 config classes. It seems like a hidden feature unless until someone sees some sample code where there is more than one config class per configuration class. !!
> The naming "getJobConfigurationClass" tells them nothing. You may say javadoc could explain it, But I wonder why we need to even support 3 configuration classes and more than 3 config classes.
> {code}
> /**
> * @return Get link configuration class
> */
> public abstract Class getLinkConfigurationClass();
> /**
> * @return Get job configuration group per direction type or null if not supported
> */
> public abstract Class getJobConfigurationClass(Direction jobType);
> {code}
> Here is my proposal ( if at all you want to support groups of configs, they atleast name the class to "ConfiguratioGroup"
> Here is how the apis makes it obvious, that this class can contain a group of link configs
> {code}
> /**
> * @return Get link configuration group class
> */
> public abstract Class getLinkConfigurationGroupClass();
> /**
> * @return Get job configuration group class per direction type or null if not supported
> */
> public abstract Class getJobConfigurationGroupClass(Direction jobType);
> {code}
> [~abec] seems to need some validation from the group on why it should be called "Group". I have explained my reasoning for this change in https://reviews.apache.org/r/26295/
> Alternatively I think the current design/ implementation to support config parameters grouping is overkill ( over designed)
> I prefer simple apis, less things for a developer to code and intuitive names to everything they represent
> 1. Remove the ConfigList and support grouping of configs by the "group" attribute on inputs
> 2. Have one configuration class annotation that will mandate 3 classes with specific annotations attributes on it FromConfig, ToConfig and LinkConfig to be filled.
> So having one class, gives a complete picture of all configs this connector uses/ provides. There is one resource bundle we require, so it maps to one configuration class as well.
> In code this is how it will look
> {code}
> */
> @ConfigurationGroupClass
> public class HdfsCongifuration {
> @LinkConfig public LinkConfig linkConfig;
> @FromConfig public FromJobConfig fromJobConfig;
> @ToConfig public ToJobConfig toJobConfig;
>
> ....
> }
> {code}
> or
> {code}
> @ConfigurationGroupClass(link="LinkConfig.class", from="fromConfig.class", to="toConfig.class")
> public class HdfsCongifuration {
> ....
> }
> {code}
> {code}
> */
> @ConfigClass(validators = {@Validator(ToJobConfig.ConfigValidator.class)})
> public class ToJobConfig {
> @Input(size = 50, group="foo") public String schemaName;
> @Input(size = 2000, group="bar") public String tableName;
> @Input(size = 50) public String sql;
> @Input(size = 50) public String columns;
> @Input(size = 2000) public String stageTableName;
> @Input public Boolean clearStageTable;
>
> }
> {code}
>
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)