Good news: I use nsight today, bad news: nisght cannot watch the performance of each kernel in the code.