W
walala
Guest
Dear all,
I guess this is a ray-tracing problem... But I need to do this task in as
high as possible speed/throughput. Here is my problem:
Suppose I am given 25 rays and I am given a 3D cube and all parameters of
these rays and cube are given...
I need to compute the length of the intersecting segment of the rays with
this cube as fast as possible. If some rays completely fall outside of the
cube, then it outputs 0, otherwise gives the length.
I heard there are some very good graphic card with accelerator... and I
heard about the bus bandwidth to be as high as 500MHz... I am not sure if
they have good accelaration function for doing my task?
I also think of doing this using an FPGA which is hooked onto a Intel PC
with Linux... I don't know the details, but I guess it uses PCI or other bus
to interact with the CPU and serve as an coprocessor...
I want to know which method is better?
Considering that after solving this throughput problem, the next bottleneck
will be a 1GB memory that I need... I wonder if the graphic card has 1GB
cache/memory inside it? Since a lot time it needs to do triple-buffling, I
guess... it should have a high speed huge memory, right?
I also don't know what is the maximum processing speed of a high-end
graphical card comparing with a high end FPGA implementation?
Can anybody give me some comments/suggestions/advice/hints/pointers on this?
Thanks a lot,
-Walalal
I guess this is a ray-tracing problem... But I need to do this task in as
high as possible speed/throughput. Here is my problem:
Suppose I am given 25 rays and I am given a 3D cube and all parameters of
these rays and cube are given...
I need to compute the length of the intersecting segment of the rays with
this cube as fast as possible. If some rays completely fall outside of the
cube, then it outputs 0, otherwise gives the length.
I heard there are some very good graphic card with accelerator... and I
heard about the bus bandwidth to be as high as 500MHz... I am not sure if
they have good accelaration function for doing my task?
I also think of doing this using an FPGA which is hooked onto a Intel PC
with Linux... I don't know the details, but I guess it uses PCI or other bus
to interact with the CPU and serve as an coprocessor...
I want to know which method is better?
Considering that after solving this throughput problem, the next bottleneck
will be a 1GB memory that I need... I wonder if the graphic card has 1GB
cache/memory inside it? Since a lot time it needs to do triple-buffling, I
guess... it should have a high speed huge memory, right?
I also don't know what is the maximum processing speed of a high-end
graphical card comparing with a high end FPGA implementation?
Can anybody give me some comments/suggestions/advice/hints/pointers on this?
Thanks a lot,
-Walalal