Читать книгу Artificial Intelligence for Business - Jason L. Anderson - Страница 14

3. Data Curation and Governance

Оглавление

Data is paramount to every AI system. A system can only be as good as the data that is used to build it. Therefore, it is important to take stock of all the possible data sources at your disposal. This is true whether it is data being collected and stored internally or data that you externally license.

After you have identified your data, it is time to leverage technology to further improve the data's quality and prepare it to train an AI system. Crowdsourcing can be a valuable tool to enhance existing data, and data platforms such as Apache Hadoop can help consolidate data from multiple sources. Data scientists will be key in orchestrating this process and ensuring success. The quality of your data will determine the success of your project in a huge way. It is therefore essential to choose the best available data on hand. The old saying about “garbage in, garbage out” applies to AI as well.

Artificial Intelligence for Business

Подняться наверх