Your problem is not related to the shared memory configuration but with the number of threads you are launching.
Devices of compute capability 2.0 and higher have 64KB of on-chip memory per SM. This is configurable as 16KB L1 and 48KB smem or 48KB L1 and 16KB smem (also 32/32 on compute capability 3. X).
I cant really gove you an answer,but what I can give you is a way to a solution, that is you have to find the anglde that you relate to or peaks your interest. A good paper is one that people get drawn into because it reaches them ln some way.As for me WW11 to me, I think of the holocaust and the effect it had on the survivors, their families and those who stood by and did nothing until it was too late.