Timestamp difference and its average by date

balaji81_k · Active User Joined: 29 Jun 2005 Posts: 155

Hi Team,

I am working on the call log table where we have connection timestamp between customer(A) and agent (B) at two different TS .Need is to calculate the difference between two connection ts and take the average by date .

Table Data looks like :-

Rohit Umarjikar · Posted: Fri May 05, 2023 11:51 am

I don’t find you are referencing column names consistently right in all your queries from the input table data .
Second , you don’t need spaces in last column so you can filter that as well .. you may represent the column names as correct as used in query.

sergeyken · Posted: Fri May 05, 2023 2:06 pm

From you last SELECT code one cannot understand your intentions…

Definitely, one single SELECT is not able to handle several different grouping and distinct fields; you may need to use extra sub-SELECT, or VIEWS, or WITH, or some other tricks.

Please, give a clear sample of your input, and desired output.

balaji81_k · Active User Joined: 29 Jun 2005 Posts: 155

Hi Sergeyken,
I tried with inner join but still i am not getting the expected output.

Table data :-

balaji81_k · Active User Joined: 29 Jun 2005 Posts: 155

Basically, i need to find the difference of connected_at which is timestamp and then group by call_id and take the average of difference by date .
Please correct me if i need to use call_id alone. I feel strongly by call_id and then find TS difference and take average by call_id and then calculate the average = TS(Difference) by call_id/ total call_id of the day

sergeyken · Posted: Sat May 06, 2023 2:10 pm

As I’ve mentioned above, you need the construction of two SELECTs, the outer one, and the inner one.

balaji81_k · Active User Joined: 29 Jun 2005 Posts: 155

Hi Sergeyken, I am trying the build SQL with subselect (inner and outer SQL's) but the outer SQL fails with reference column (connect_at) is not found and it is not part of inner query.
Can you please help me on how to get aggregation by date(conected_at).

sergeyken · Posted: Mon May 08, 2023 4:50 pm

I deliberately don’t want to give you a ready-to-copy-and-paste solution, but insist you to construct it by yourself. Otherwise you’ll never be able to complete a similar task for other projects.

Please, try to do it step by step, not throwing all together in one huge pile.

Step 1. Write and debug separately the SELECT supposed to become the inner part of your future combined query.

Step 2. Do it only after step 1 is 100% done. Write the outer SELECT, using the SQL code from step 1 within the brackets after the outer FROM keyword.

The SQL for step 1 may be like this one (though I hate to give out the finished code…)

balaji81_k · Active User Joined: 29 Jun 2005 Posts: 155

Thankyou Sergeyken, its working . I understand on taking the distinct values and calculate the average as we have two entries for each call_id. I calculated randomly for 5 days and it is matching .

sergeyken · Posted: Mon May 08, 2023 7:25 pm

The extra DISTINCT keyword is wrong. It may eliminate required values when the same time_difference amounts do appear more than once on the same date.

This operation must be done ONLY via GROUP BY t.connected_date clause!

This situation remains unnoticed during a primitive test with limited input data, but it may produce wrong results in production run.

balaji81_k · Active User Joined: 29 Jun 2005 Posts: 155

Hi Sergeyken,
I have removed the DISTINCT Clause. Thankyou .

sergeyken · Posted: Wed May 10, 2023 11:49 pm

Just to make this tricky thing more clear. Let's say we have the following input data: