Saturday, February 25, 2012

Question on data mining report performance optimization

Hi, all experts here,

Would any of you give me any ideas for how could we optimize the report on data mining models? (as we know, for the data mining report, we have to select the mining model and the case table)

Hope it is clear for your advices and help.

Thanks a lot in advance and I am looking forward to hearing from you shortly.

With best regards,

Yours sincerely,

What kind of report is this? An accuracy report, such as lift chart or scatter plot?

Also, what kind of algorithm is involved?

|||

Yes, accuracy report and lift chart report. All SQL Server 2005 built-in algorithms there.

Any suggestions?

With best regards,

Yours sincerely,

|||

Service Pack 2 added some performance improvements in generating accuracy reports for the Naive Bayes algorithm.

Besides this, there is not much you can do to improve the performance of these reports.

You could try to use the tip here: http://sqlserverdatamining.com/DMCommunity/TipsNTricks/4566.aspx and implement your own accuracy chart using reporting services

What kind of problems are you running into? Could you please let us know how many test cases are you using and how much time does it take to populate the report?

|||The performance of the lift chart/etc. is directly related to the number of cases you are testing. I.e. if you cut the number of cases in half, you will cut the time in half.|||

Hi, all,

Thanks a lot for your very kind advices and help.

With best regards,

Yours sincerely,

No comments:

Post a Comment