@Antix_ This isn't the place for seeking advice, but the introduction section - only for introducing yourself to the community. Next time, please direct any questions you may have here:
http://utaforum.net/boards/questions-support/
To answer your question: the output of your voicebank is dependent on the following factors:
- mic quality: In order from least to greatest: built in laptop mic, phone, headset, blue brand name usb mics, berhinger c1u usb, low-end studio xlr [without sophisticated interface], low-end studio xlr [with more sophisticated interface], high end studio xlr with high end interface. Anything beyond your laptop or phone is going to involve money - $150 at the very least for low end xlr mics (cheap inferface included). That said, don't let it dissuade you if your only just starting out with your first bank.
Just make the best out of what you have and can afford.
- oto: While the raw output is very important (mic/voice clarity, same stable pitch for all samples, and lack of noise), the configuration (oto) is more important. You said that everything is 'set to 0,' so I'm going to assume that the bank isn't oto'd at all. No matter the quality of the raw samples, they'll never sound good if they aren't oto'd properly. If you're unable to oto, there are people here who are willing to help (Adlez, JeremyB796, myself).
- resampler: not every resampler plays nice with every voice. If you're using UTAU for PC (Windows), there are plenty of resampler options that may give a better result than the default resampler.exe (which is more true to the voice, but can be a little harsh).