Here are my prompts and images in my current testing:
Japanese Token
style: No Style
prompt: (3d model in top down view:1.1), (from above:1.3), from behind, plain background, person with a katana, edo style
n_prompt: soft shadows, hard shadows, reflections
res: 512x512
seed: random
Thief in white Clothing
style: No Style
prompt: (3d model in top down view:1.1), (from above:1.3), from behind, plain background BREAK a male thief with white clothes knife in hand, fantasy d&d style
n_prompt: soft shadows, hard shadows, reflections, disc
res: 512x512
seed: random
We can probably increase more emphasis on the from above or at the 3d model in top down view, but any higher than 1.5 is muddy.
Essentially adding:
(3d model in top down view:1.1), (from above:1.3), from behind, plain background
Before your actual prompt will make the image like it is from top down.