Quote: "it uses modified DBProBasic3DDebug.dll"
Exactly

Thus it is a recompilation of a base library rather than an external one. But yeah, I see your point.
As for the vertex data speed-up, have you tried using
get vertexdata ptr? That will give you as much of a speedup as you can get in DBPro since all you'll be doing is writing directly to the buffer without any extra function calling overhead or safety checks (unfortunately you will still get the function call overhead from the memory manipulation functions unless you only push 32-bit data, which may also be implemented internally as a function call, I'm not quite sure).
Also, what kind of speed increase to do with instances do you propose?
"Why do programmers get Halloween and Christmas mixed up?" Because Oct(31) = Dec(25)