How to update data in the data pipeline

Ensure the file to be uploaded has been checked for the following:

  • file is in CSV format

  • text exactly matches headings (as well as and capitals used) to original file uploaded

  • no additional spaces in the headings

  • no additional blank rows or unfinished data

  • no extra spaces in cell data

Login to the Drupal Content Management System (CMS) as a Data importer.

Navigate to Content, then select Datasets.

image-20240416-031457.png

Select the dataset to edit by selecting the Edit button at the end of the summary row.

image-20240416-031528.png

Check the following fields:

  • Name: matches original name

  • Machine name: matches original machine name

  • Pipeline: matches original pipeline

  • Destinations: SDP ElasticSearch

  • Batch size: 1,000

  • Invalid values handling: remove records that are now invalid

  • Delimiter: comma separated

To update the data in the table, select Remove to delete the existing CSV file.

Add a new dataset by selecting Choose File.

Browse to the new file to be uploaded on your local computer.

Select Save.

Please note: the Save and process button on the Edit dataset page doesn’t work.

Select the Edit button again under the Operations column on the dataset summary page.

Select the Process tab.

Select the Confirm button to process the data.

The data will validate and index.

Once completed, you will be returned to the dataset summary page.

Do not select the dropdown arrow next to Edit under the Operations column on the dataset summary page and click on Process. This will not process the dataset.

Related pages