Only supporting single GPU per instance is a huge issue. Also, do you plan on adding an api, or at least remote webpage?
Ok, planning to support multiple GPU per instance. Personally found that running on instance per GPU very stable and more flexible, but there seem some reports that it does influence the other GPUs. Should be no big deal to implement.
Which kind of API would you suggest? Is there an standard?