I used RAG with LLAMA3 for AI bot. I find RAG with chromadb is much slower than call LLM itself. Following the test result, with just one simple web page about 1000 words, it takes more than 2 seconds
问题 I have access to a 12 core machine and some matlab code that relies heavily on fftn. I would like to speed up my code. Since the fft can be parallelized I would think that more cores would help bu