Optimization in sql

Question

Optimization in sql

Asked 7 years, 4 months ago

Viewed 95 times

1

Considering these two tables in the database:

Product Table:

| id  | nome      |
|-----|-----------|
| aaa | Produto A |
| bbb | Produto B |
| ccc | Produto C |

Attributes Table:

| id_produto | atributo | valor   |
|------------|----------|---------|
| aaa        | cor      | azul    |
| aaa        | tamanho  | M       |
| bbb        | cor      | preto   |
| bbb        | tamanho  | P       |
| ccc        | cor      | amarelo |
| ccc        | tamanho  | G       |

and the following SQL query:

select
    p.nome,
    c.valor,
    t.valor
from
    Produto p,
    Atributos c,
    Atributos t
where
    p.id = c.id_produto and
    p.id = t.id_produto and
    c.atributo = 'cor' and
    t.atributo = 'tamanho'

Is there any way to make this select without duplicating the table attributes?

Edit #1

Upshot:

| nome      | cor     | Tamanho |
|-----------|---------|---------|
| Produto A | azul    | M       |
| Produto B | preto   | P       |
| Produto C | amarelo | G       |

Note: I cannot change the table structure, the real database has several different attribute types (they are generated dynamically) and thousands of records.

You’re using letters on primary_key?

– rbz

2018/03/21 at 19:44
Is slowing down?

– gabrielfalieri

2018/03/21 at 19:45
All data is received via integration, and yes, it can have letters in the key Primary. the point is not the database structure, but rather how select is done

– Pilati

2018/03/21 at 19:53
I believe what you want is to transpose the result of the table atributos for each occurrence of produto. Research on pivot

– Diego Rafael Souza

2018/03/21 at 20:33
How would I get this result using pivot?

– Pilati

2018/03/21 at 20:38
I posted as an answer based on your example model. Apply there in your real scenario and check if the molhora of performance is reflected. I believe so, because for every 'attribute' you wanted to include in the result you would have to generate a Cartesian product in the table atributos if you follow the current approach.

– Diego Rafael Souza

2018/03/21 at 21:31

Show 1 more comment

4 answers

2

It is good you check if it compensates with respect to the performance for your real case, but for the presented example would be the following:

SELECT NOME,
       COR,
       TAMANHO
FROM (
    SELECT 
        P.NOME AS NOME,
        A.ATRIBUTO AS ATRIBUTO ,
        A.VALOR AS VALOR
    FROM PRODUTO P
        JOIN ATRIBUTOS A ON A.ID_PRODUTO = P.ID_PRODUTO ) AS FONTE 
        PIVOT ( MIN(VALOR) FOR ATRIBUTO IN (COR, TAMANHO)) AS PVT

In the case of your example snipet, running the queries in the first two approaches (crossing the table) costs 71% while the second form, 29%, that is to say, about 1 /3 of effort. I believe the same is reflected for you in the real case, but be sure to measure.

Browser other questions tagged sql database oracle

You are not signed in. Login or sign up in order to post.

by Adriano Martins • **454** points · Answer 1 · 2018-03-21T19:47:51+00:00

No, because there are two different lines being searched.
The best solution here would be to redo the tables, avoiding the comparison of strings of attributes (which even with indexed strings, would be slower than the direct attribute declaration in type):

Product Table:

| id  | nome      |
|-----|-----------|
| aaa | Produto A |
| bbb | Produto B |
| ccc | Produto C |

Attributes Table:

| id_produto | tamanho | cor     |
|------------|---------|---------|
| aaa        | M       | azul    |
| bbb        | P       | preto   |
| ccc        | G       | amarelo |

by Miguel Vidigal • 1 point · Answer 2 · 2018-03-21T20:38:09+00:00

You can make an attribute with clause by id and house one of the attribute table attributes and then cross-check with the product. Use the performance hint " Materialize" to optimize performance. You can activate parelism as well.

That way you don’t duplicate information. If I have to try to explain it better

by DiegoSantos • **1,004** points · Answer 3 · 2018-03-21T21:01:02+00:00

Here’s an example with Pivot, see if it fits your case:

Script Pivot:

SELECT 
    p.nome,
    p.cor,
    p.tamanho

FROM 
    (
    select
        p.nome, 
        a.atributo,
        a.valor
    from #produto p
    inner join #atributos a on p.id = a.id_produto
    group by p.nome, a.atributo, a.valor
    ) as x
 PIVOT (max(valor) 
FOR atributo IN ([cor],[tamanho]))P
ORDER BY 1;

Upshot

**nome | cor | tamanho**

Produto A | azul |  M

Produto B | preto | P

Produto C | amarelo |   G

Script for database creation (SQL Server):

create table #produto(
    id varchar(50) primary key,
    nome varchar(50)
);

create table #atributos(
    id_produto varchar(50) references #produto(id),
    atributo varchar(50) not null,
    valor varchar(50) not null
);

insert into #produto values
('aaa', 'Produto A'),
('bbb', 'Produto B'),
('ccc', 'Produto C')

insert into #atributos values
('aaa', 'cor', 'azul'),
('aaa', 'tamanho', 'M'),
('bbb', 'cor', 'preto'),
('bbb', 'tamanho', 'P'),
('ccc', 'cor', 'amarelo'),
('ccc', 'tamanho', 'G')