Patent application number | Description | Published |
20130191404 | USING VIEWS OF SUBSETS OF NODES OF A SCHEMA TO GENERATE DATA TRANSFORMATION JOBS TO TRANSFORM INPUT FILES IN FIRST DATA FORMATS TO OUTPUT FILES IN SECOND DATA FORMATS - Provided are a computer program product, system, and method for processing input data in a storage system and in communication with a repository. Views are generated that comprise a tree of nodes selected from a subset of nodes in a hierarchical representation of a schema. The views are saved to the repository. At least one of the views are used to create a job comprising a sequence of data transformation steps to transform the input data described by input schemas to the output data described by output schemas. | 07-25-2013 |
20130191419 | USING VIEWS OF SUBSETS OF NODES OF A SCHEMA TO GENERATE DATA TRANSFORMATION JOBS TO TRANSFORM INPUT FILES IN FIRST DATA FORMATS TO OUTPUT FILES IN SECOND DATA FORMATS - Provided is a method for processing input data in a storage system and in communication with a repository. Views are generated that comprise a tree of nodes selected from a subset of nodes in a hierarchical representation of a schema. The views are saved to the repository. At least one of the views are used to create a job comprising a sequence of data transformation steps to transform the input data described by input schemas to the output data described by output schemas. | 07-25-2013 |
20130191421 | GENERATING VIEWS OF SUBSETS OF NODES OF A SCHEMA - Provided are a computer program product, system, and method for processing schemas in a storage system. A presentation of a schema in a graphical user interface (GUI) is comprised of multiple type nodes in a tree structure. Each type node comprises a hierarchical arrangement of a plurality of nodes including group nodes including a plurality of nodes and content nodes providing values. First user input selects one of the type nodes in the schema for a view. Second user input selects one of the nodes in the selected type node in the schema for a view. Third user input selects a node in the schema to indicate a root node of the schema for the view. The view includes the root node and at least one sub node of the root node and is added as a child to the selected type node. | 07-25-2013 |
20130191780 | GENERATING VIEWS OF SUBSETS OF NODES OF A SCHEMA - Provided is a method for processing schemas in a storage system. A presentation of a schema in a graphical user interface (GUI) is comprised of multiple type nodes in a tree structure. Each type node comprises a hierarchical arrangement of a plurality of nodes including group nodes including a plurality of nodes and content nodes providing values. First user input selects one of the type nodes in the schema for a view. Second user input selects one of the nodes in the selected type node in the schema for a view. Third user input selects a node in the schema to indicate a root node of the schema for the view. The view includes the root node and at least one sub node of the root node and is added as a child to the selected type node. | 07-25-2013 |
20140059064 | USING VIEWS OF SUBSETS OF NODES OF A SCHEMA TO GENERATE DATA TRANSFORMATION JOBS TO TRANSFORM INPUT FILES IN FIRST DATA FORMATS TO OUTPUT FILES IN SECOND DATA FORMATS - Provided is a method for processing input data in a storage system and in communication with a repository. Views are generated that comprise a tree of nodes selected from a subset of nodes in a hierarchical representation of a schema. The views are saved to the repository. At least one of the views are used to create a job comprising a sequence of data transformation steps to transform the input data described by input schemas to the output data described by output schemas. | 02-27-2014 |
20140207826 | GENERATING XML SCHEMA FROM JSON DATA - A computer receives a first JSON data that includes at least one JSON array or JSON object value. The computer parses a stream of JSON data, wherein the stream of JSON data includes at least a part of the first JSON data. The computer determines the logical structure of the first JSON data using the parsed stream of JSON data. The computer generates an XML schema based on the logical structure of the first JSON data. | 07-24-2014 |
20140207828 | DECOMPOSING XML SCHEMA DOCUMENTS INTO SUBSETS - According to one embodiment of the present invention, a system decomposes a set of schema files. The system receives a set of schema files and automatically identifies a plurality of root schema files in the set, where a root schema file is determined based on remaining schema files in the set lacking a reference to that schema file. For each root schema file, the system creates a subset of the original set of schema files. The subset contains the root schema file, and at least one subset further includes one or more schema files that provide information for that root schema file. Embodiments of the present invention further include a method and computer program product for decomposing a set of schema files in substantially the same manners described above. | 07-24-2014 |
20140279828 | CONTROL DATA DRIVEN MODIFICATIONS AND GENERATION OF NEW SCHEMA DURING RUNTIME OPERATIONS - A computational device receives input data and control data, where the control data includes instructions to modify one or more operations performed during a runtime execution associated with the input data. The control data is processed to modify the one or more operations during the runtime execution associated of the input data. | 09-18-2014 |
20140279835 | SELF-ANALYZING DATA PROCESSING JOB TO DETERMINE DATA QUALITY ISSUES - Techniques are disclosed to determine data quality issues in data processing jobs. The data processing job is received, the data processing job specifying one or more processing steps designed based on one or more data schemas and further specifies one or more desired quality metrics to measure at the one or more processing steps. One or more state machines are provided, that are generated based on the quality metrics and on the data schemas. Input data to the data process job are processed using the one or more state machines, in order to generate output data and a set of data quality records characterizing a set of data quality issues identified during the execution of the data processing job. | 09-18-2014 |
20140279934 | SELF-ANALYZING DATA PROCESSING JOB TO DETERMINE DATA QUALITY ISSUES - Techniques are disclosed to determine data quality issues in data processing jobs. The data processing job is received, the data processing job specifying one or more processing steps designed based on one or more data schemas and further specifies one or more desired quality metrics to measure at the one or more processing steps. One or more state machines are provided, that are generated based on the quality metrics and on the data schemas. Input data to the data process job are processed using the one or more state machines, in order to generate output data and a set of data quality records characterizing a set of data quality issues identified during the execution of the data processing job. | 09-18-2014 |
20140280366 | OUTPUT DRIVEN GENERATION OF A COMBINED SCHEMA FROM A PLURALITY OF INPUT DATA SCHEMAS - A computational device receives a plurality of versions of an input data schema. At least one element is selected from the plurality of versions of the input data schema based on an expected result. A combined schema is generated based on the at least one selected element. The input data is processed according to the combined schema. | 09-18-2014 |
20150019477 | OUTPUT DRIVEN GENERATION OF A COMBINED SCHEMA FROM A PLURALITY OF INPUT DATA SCHEMAS - A computational device receives a plurality of versions of an input data schema. At least one element is selected from the plurality of versions of the input data schema based on an expected result. A combined schema is generated based on the at least one selected element. The input data is processed according to the combined schema. | 01-15-2015 |
20150058279 | CONTROL DATA DRIVEN MODIFICATIONS AND GENERATION OF NEW SCHEMA DURING RUNTIME OPERATIONS - A computational device receives input data and control data, where the control data includes instructions to modify one or more operations performed during a runtime execution associated with the input data. The control data is processed to modify the one or more operations during the runtime execution associated of the input data. | 02-26-2015 |