Insertion of foreign keys by the kettle

Question

Insertion of foreign keys by the kettle

Asked 6 years, 10 months ago

Viewed 287 times

0

I need to load some data that are in spreadsheets in a relational database, but I have been facing some challenges regarding the insertion of foreign keys.

In these images you can see that I first insert the project data, the auto increment of the database generates a key, I recover it in the table input and then turn this step with Excel Input of the relief data sheet.

I have only one project and several reliefs related to it.

The project data is in a spreadsheet and the relief data is in a different spreadsheet.

When trying to enter no error is returned, but the relief data is not included. It’s a seemingly simple problem, but I’m starting to understand Kettle now.

Thank you in advance!

EDIT2:

1 answer

Browser other questions tagged foreign-key pentaho pentaho-kettle

You are not signed in. Login or sign up in order to post.

by Cristian Curti • **186** points · Answer 1 · 2018-08-27T20:17:09+00:00

You are using multiple input sequences:

Excel input - Table input - Table input - Excel input - Table input

Each time an input step is used in the flow, the resulting table of this input is reset, that is, in the next step you only have what was added in the last input. What you need is for these 5 inputs to be made in separate streams on the same KTR, and unified by a common key, using the Join Teps (I recommend Multiway Merge Join, since there are more than 2 streams).

I also see that you are using the "Accept filenames from Previous step" option in your Excel input. This way Excel input will receive the absolute paths of the files by the table that is arriving at the input step, and will not use the desired path in the list of files/directories.

EDIT:

If no other update parameter is required, you can use the "Run SQL statements" step. In this step you can perform queries with variable substitution, that is, table rows that are fed to this step, will be part of the Query. In the query you will use, you should put a question mark in the query attributes, these "?" will be replaced in the same order as the parameter list.

Ex. If a table this way reaches the step:

With a query this way:

The query will be executed 2 times, changing the "?" in the id_project order, name and location.