IBM Unveils New Data Prep Tool Designed to Help Speed DataOps

IBM Unveils New Data Prep Tool Designed to Help Speed DataOps
Depositphotos

IBM announced a new data preparation solution designed to help clients improve their dataops processes to get their data ready for AI quickly and efficiently.

Data preparation is an integral step in building machine learning and predictive models, but it's also one of the most cumbersome and time-consuming, leading many data scientists to devote up to 80 percent of their time to it. And while the quality of the data remains a critical factor in producing accurate models, and more accurate insights, the time-intensive process can stall AI projects.

To ease this process, IBM introduced InfoSphere Advanced Data Preparation, a new solution designed to help clients transform raw datasets by formatting, structuring and enriching the datasets for analytic processing and standard reporting. Jointly developed with data prep software provider, Trifacta, the new InfoSphere solution is engineered to work in conjunction with clients' existing data environments, including data lakes.

Among its many features, the new InfoSphere solution includes an intuitive dashboard for visualizing the data prep process, including the progress of tracking data quality and lineage (where the data originated, and where it's been). With the resulting cleaned datasets, clients can move them into the business analytics tool of their choice.

Advanced Data Preparation resides on top of a client's data lake or data warehouse and provides automated transformation capabilities. Through the solution's self-service user interface, business users, as well as data scientists, can access, explore, prepare and enrich datasets for analytics. In addition to data prep, the tool is designed to empower users of all levels of technical expertise to generate business-ready data insights.