I ran an uncensored model on a CPU server. as expected its dead slow (min or two per query).
What kinda hardware (GPU) do i need to serve 1k RPS?
I could not find APIs for uncensored models that kinda forced me to run locally
I ran an uncensored model on a CPU server. as expected its dead slow (min or two per query).
What kinda hardware (GPU) do i need to serve 1k RPS?
I could not find APIs for uncensored models that kinda forced me to run locally