help needed with Perlin noise generator

Author

Message

Green Gandalf

VIP Member

19

Years of Service

User Offline

Joined: 3rd Jan 2005

Playing: Malevolence:Sword of Ahkranox, Skyrim, Civ6.

Posted: 17th Feb 2008 21:09

Link

I've been having one of those really frustrating days.

I've written a Perlin noise generator to replace my earlier cloud image generating program which used the "diamond-square algorithm".

The resulting image was produced quickly and efficiently and looked like a green cloudy black sky (before you ask it was supposed to look like a green cloudy black sky

) - but wasn't quite right (see attached image). At first glance it seems OK, but on closer inspection there are visible lines or seams along the "power of two pixel" boundaries.

After checking for all sorts of rounding errors, misaligned byte boundaries and such, it dawned on me that I was using simple bilinear interpolation instead of "Perlin" interpolation. This explained everything.

I therefore changed the interpolation to Perlin interpolation (so I believe

) and expected a perfect image to pop out. It didn't

. And it's much worse - and I just can't see why.

Here's the code I'm using (it's not optimised yet, so I know it can be speeded up in places):

` Green Gandalf's Perlin style noise function - Version 3
` Created 25 June 2007, modified 17 February 2008.

`   Uses suggestions from following website
`      http://www.mandelbrot-dazibao.com/Perlin/Perlin1.htm

`   basic idea works with simple bilinear interpolation
`    - but has visible interior seams
`   doesn't work well at all with Perlin style interpolation

sync on: sync rate 0: sync
set display mode 800, 600, 32

randomize 140549 ` arbitrary fixed number for reproducibility

autocam off
position camera 0, 50, -300
point camera 0, 0, 0

create bitmap 1, 512, 512

global nOct = 8
global nGrid
global twoPiInv as float
twoPiInv = 0.159154943

nGrid = 2^nOct

dim weight#(nOct)
dim interpWeight(3) as float
dim tangent(3) as float

dim rawNoise(511, 511) as float

p# = 2.0
for i=0 to nOct
  weight#(i) = 1.0/p#^i
next i

dim a(nGrid, nGrid, nOct) as float
dim b(nGrid, nGrid, nOct) as float

for oct = 0 to nOct
  for i = 0 to nGrid
    for j = 0 to nGrid
      a(i, j, oct) = rnd(100) * 0.01  ` random value in range 0 to 1
      b(i, j, oct) = rnd(100) * 0.01  ` random value in range 0 to 1
                                             ` not seamless yet
    next j
  next i
next oct

` calculate raw noise and find scale factors for
` rescaling noise values to byte range
minNoise# = 1000  ` arbitrary large number out of range
maxNoise# = -1000 ` arbitrary small number out of range
for x = 0 to 511
  for y = 0 to 511
    u# = x * 0.001953125 ` value in range 0 to 1
    v# = y * 0.001953125
    rawNoise(x, y) = noise(u#, v#)
    if minNoise# > rawNoise(x, y)
      minNoise# = rawNoise(x, y)
    else
      if maxNoise# < rawNoise(x, y) then maxNoise# = rawNoise(x, y)
    endif
  next y
next x

f1# = 255.0/(maxNoise# - minNoise#)
f2# = minNoise# * 255.0/(maxNoise# - minNoise#)

lock pixels
  for x = 0 to 511
    for y = 0 to 511
      ` convert raw noise to byte range 0 - 255
      c = rawNoise(x, y) * f1# - f2#
      ` just in case
      if c<0 then c=0
      if c>255 then c=255
      dot x, y, rgb(0, c, 0) ` a nice green colour :)
    next y
  next x
unlock pixels

copy bitmap 1, 0, 0, 511, 511,  0, 144, 44, 655, 555

set current bitmap 0

repeat
  text 20, 20, "All done!"
  sync
until spacekey()

set current bitmap 1
get image 1, 0, 0, 512, 512
save image "test v3.png", 1
end

function noiseBase(oct, x as float, y as float)
  ` x and y both in range 0 to 1
  ` find correct "tile" to use
  tGrid = 2^oct
  x# = x * tGrid
  y# = y * tGrid
  i = floor(x#)
  if i >= tGrid then i = tGrid - 1 ` just in case
  j = floor(y#)
  if j >= tGrid then j = tGrid - 1 ` just in case
  x# = x# - i  ` number in range 0 to 1
  y# = y# - j
  ` calculate interpolation weights
  ` Perlin weights - image not right at all WHY?
  remstart
  ix1# = x# * x# * (3.0 - 2.0 * x#)
  ix0# = 1.0 - ix1#
  iy1# = y# * y# * (3.0 - 2.0 * y#)
  iy0# = 1.0 - iy1#
  remend
  ` simple bilinear weights - should show slight seams
  ` because derivatives not matched correctly
`  remstart
  ix0# = 1.0 - x#
  ix1# = x#
  iy0# = 1.0 - y#
  iy1# = y#
`  remend
  ` simple sine weights (a variation of the Perlin idea)
  remstart
  ix1# = x# - sin(360.0 * x#) * twoPiInv
  ix0# = 1.0 - ix1#
  iy1# = y# - sin(360.0 * y#) * twoPiInv
  iy0# = 1.0 - iy1#
  remend
  interpWeight(0) = ix0# * iy0#
  interpWeight(1) = ix0# * iy1#
  interpWeight(2) = ix1# * iy0#
  interpWeight(3) = ix1# * iy1#
  ` calculate tangents at four corners of tile
  tangent(0) = a(i, j, oct) * x# + b(i, j, oct) * y#
  tangent(1) = a(i, j + 1, oct) * x# + b(i, j + 1, oct) * (y# - 1.0)
  tangent(2) = a(i + 1 , j, oct) * (x# - 1.0) + b(i + 1, j, oct) * y#
  tangent(3) = a(i + 1, j + 1, oct) * (x# - 1.0) + b(i + 1, j + 1, oct) * (y# - 1.0)
  ` calculate interpolated noise value
  result# = 0.0
  for corner = 0 to 3
    result# = result# + interpWeight(corner) * tangent(corner)
  next corner
endfunction result#

function noise(x as float, y as float)
  result# = 0.0
`  result# = noiseBase(1, x, y)
  for oct = 0 to nOct
    result# = noiseBase(oct, x, y) * weight#(oct) + result#
  next oct
endfunction result#

+ Code Snippet

` Green Gandalf's Perlin style noise function - Version 3
` Created 25 June 2007, modified 17 February 2008.

`   Uses suggestions from following website
`      http://www.mandelbrot-dazibao.com/Perlin/Perlin1.htm

`   basic idea works with simple bilinear interpolation
`    - but has visible interior seams
`   doesn't work well at all with Perlin style interpolation

sync on: sync rate 0: sync
set display mode 800, 600, 32

randomize 140549 ` arbitrary fixed number for reproducibility

autocam off
position camera 0, 50, -300
point camera 0, 0, 0

create bitmap 1, 512, 512

global nOct = 8
global nGrid
global twoPiInv as float
twoPiInv = 0.159154943

nGrid = 2^nOct

dim weight#(nOct)
dim interpWeight(3) as float
dim tangent(3) as float

dim rawNoise(511, 511) as float

p# = 2.0
for i=0 to nOct
  weight#(i) = 1.0/p#^i
next i

dim a(nGrid, nGrid, nOct) as float
dim b(nGrid, nGrid, nOct) as float

for oct = 0 to nOct
  for i = 0 to nGrid
    for j = 0 to nGrid
      a(i, j, oct) = rnd(100) * 0.01  ` random value in range 0 to 1
      b(i, j, oct) = rnd(100) * 0.01  ` random value in range 0 to 1
                                             ` not seamless yet
    next j
  next i
next oct

` calculate raw noise and find scale factors for
` rescaling noise values to byte range
minNoise# = 1000  ` arbitrary large number out of range
maxNoise# = -1000 ` arbitrary small number out of range
for x = 0 to 511
  for y = 0 to 511
    u# = x * 0.001953125 ` value in range 0 to 1
    v# = y * 0.001953125
    rawNoise(x, y) = noise(u#, v#)
    if minNoise# > rawNoise(x, y)
      minNoise# = rawNoise(x, y)
    else
      if maxNoise# < rawNoise(x, y) then maxNoise# = rawNoise(x, y)
    endif
  next y
next x

f1# = 255.0/(maxNoise# - minNoise#)
f2# = minNoise# * 255.0/(maxNoise# - minNoise#)

lock pixels
  for x = 0 to 511
    for y = 0 to 511
      ` convert raw noise to byte range 0 - 255
      c = rawNoise(x, y) * f1# - f2#
      ` just in case
      if c<0 then c=0
      if c>255 then c=255
      dot x, y, rgb(0, c, 0) ` a nice green colour :)
    next y
  next x
unlock pixels

copy bitmap 1, 0, 0, 511, 511,  0, 144, 44, 655, 555

set current bitmap 0

repeat
  text 20, 20, "All done!"
  sync
until spacekey()

set current bitmap 1
get image 1, 0, 0, 512, 512
save image "test v3.png", 1
end

function noiseBase(oct, x as float, y as float)
  ` x and y both in range 0 to 1
  ` find correct "tile" to use
  tGrid = 2^oct
  x# = x * tGrid
  y# = y * tGrid
  i = floor(x#)
  if i >= tGrid then i = tGrid - 1 ` just in case
  j = floor(y#)
  if j >= tGrid then j = tGrid - 1 ` just in case
  x# = x# - i  ` number in range 0 to 1
  y# = y# - j
  ` calculate interpolation weights
  ` Perlin weights - image not right at all WHY?
  remstart
  ix1# = x# * x# * (3.0 - 2.0 * x#)
  ix0# = 1.0 - ix1#
  iy1# = y# * y# * (3.0 - 2.0 * y#)
  iy0# = 1.0 - iy1#
  remend
  ` simple bilinear weights - should show slight seams
  ` because derivatives not matched correctly
`  remstart
  ix0# = 1.0 - x#
  ix1# = x#
  iy0# = 1.0 - y#
  iy1# = y#
`  remend
  ` simple sine weights (a variation of the Perlin idea)
  remstart
  ix1# = x# - sin(360.0 * x#) * twoPiInv
  ix0# = 1.0 - ix1#
  iy1# = y# - sin(360.0 * y#) * twoPiInv
  iy0# = 1.0 - iy1#
  remend
  interpWeight(0) = ix0# * iy0#
  interpWeight(1) = ix0# * iy1#
  interpWeight(2) = ix1# * iy0#
  interpWeight(3) = ix1# * iy1#
  ` calculate tangents at four corners of tile
  tangent(0) = a(i, j, oct) * x# + b(i, j, oct) * y#
  tangent(1) = a(i, j + 1, oct) * x# + b(i, j + 1, oct) * (y# - 1.0)
  tangent(2) = a(i + 1 , j, oct) * (x# - 1.0) + b(i + 1, j, oct) * y#
  tangent(3) = a(i + 1, j + 1, oct) * (x# - 1.0) + b(i + 1, j + 1, oct) * (y# - 1.0)
  ` calculate interpolated noise value
  result# = 0.0
  for corner = 0 to 3
    result# = result# + interpWeight(corner) * tangent(corner)
  next corner
endfunction result#

function noise(x as float, y as float)
  result# = 0.0
`  result# = noiseBase(1, x, y)
  for oct = 0 to nOct
    result# = noiseBase(oct, x, y) * weight#(oct) + result#
  next oct
endfunction result#

The next post contains the very faulty image produced using what I believe is the correct Perlin interpolation.

The only changes in the code used for the two images are in the following lines where I just comment/uncomment the appropriate sections:

+ Code Snippet

  ` calculate interpolation weights
  ` Perlin weights - image not right at all WHY?
  remstart
  ix1# = x# * x# * (3.0 - 2.0 * x#)
  ix0# = 1.0 - ix1#
  iy1# = y# * y# * (3.0 - 2.0 * y#)
  iy0# = 1.0 - iy1#
  remend
  ` simple bilinear weights - should show slight seams
  ` because derivatives not matched correctly
`  remstart
  ix0# = 1.0 - x#
  ix1# = x#
  iy0# = 1.0 - y#
  iy1# = y#
`  remend

The above gives the bilinear filtering which is almost right.

I'm probably missing something very obvious but just can't see it.

Attachments

Login to view attachments

Back to top

Profile PM Email

Green Gandalf

VIP Member

19

Years of Service

User Offline

Joined: 3rd Jan 2005

Playing: Malevolence:Sword of Ahkranox, Skyrim, Civ6.

Posted: 17th Feb 2008 21:10

Link

Here's the second image.

Attachments

Login to view attachments

Back to top

Profile PM Email

El Goorf

17

Years of Service

User Offline

Joined: 17th Sep 2006

Location: Uni: Manchester, Home: Dunstable

Posted: 18th Feb 2008 01:52

Link

ah yeh, i noticed this issue with diamond square. it should kinda be expected from the fact you're breaking the image down into squares. i couldnt rly find a lot to do to fix this, though i didnt think it would be an issue with perlin noise, though i guess you just prooved me wrong.

if you find a solution, feel free to eemail it to me ^_^

http://notmybase.com
All my base are not belong to anyone.

Back to top

Profile PM Website

Omen

17

Years of Service

User Offline

Joined: 7th Nov 2006

Location: Maple Grove, MN US

Posted: 18th Feb 2008 02:20 Edited at: 18th Feb 2008 02:23

Link

@GG,

Take a look at this page, and especially the "putting it all together" section at the bottom:

http://freespace.virgin.net/hugo.elias/models/m_perlin.htm

or you could go straight to the source:

http://mrl.nyu.edu/~perlin/

...hope that helps.

Lakehome Games

Back to top

Profile PM Email Website

Green Gandalf

VIP Member

19

Years of Service

User Offline

Joined: 3rd Jan 2005

Playing: Malevolence:Sword of Ahkranox, Skyrim, Civ6.

Posted: 18th Feb 2008 12:15

Link

Quote: "though i didnt think it would be an issue with perlin noise"

Neither did I - and it shouldn't be according to my maths. But I've obviously gone wrong somewhere.

Quote: "or you could go straight to the source:"

I've already seen that - but it's quite possible I've slipped up somehow. Just can't see where. Might be worth another look at the source now that I've got the basic idea working. So, thanks for the nudge.

Quote: "if you find a solution, feel free to eemail it to me"

Will do - but don't expect an instant solution. No time for thinking till the weekend now.

Back to top

Profile PM Email

Mr Kohlenstoff

17

Years of Service

User Offline

Joined: 7th Jun 2006

Location: Germany

Posted: 18th Feb 2008 12:36

Link

I used cosine interpolation for my perlin noise-algorithms and it works quite well. You could also use this cubic interpolation to make it more natural, but because it's able to grow out of the amplitude you should be careful with converting your noise-values into colors.

By the way, aren't diamont square and perlin noise actually the same.. more or less? Both are Fractal Noise-Algorithms and work in more or less the same way, at least that's what I thought.

However, good luck with your green clouds and black sky.

Back to top

Profile PM Email

dark coder

21

Years of Service

User Offline

Joined: 6th Oct 2002

Location: Japan

Posted: 18th Feb 2008 13:38

Link

Didn't really look at your code but I just made my own Perlin noise gen which I find easier to follow, it has both Bicubic and Bilinear filtering(you can comment, uncomment them). Hopefully it will help

.

Load DLL "user32.dll"   , 1
displayX = Call DLL( 1 , "GetSystemMetrics" , 0 )
displayY = Call DLL( 1 , "GetSystemMetrics" , 1 )
Delete DLL 1

Set Display Mode displayX, displayY, 32, 1
Sync On

// Seed
Randomize 1337

// You can manually specify the resolution for higher detail noise
`startingRes     = 512
`totalOctaves    = 9

totalOctaves    = 8
startingRes     = 2 ^ totalOctaves

benchmark1Start = Timer()

// Generate noise for all octaves
for octaveID = 1 to totalOctaves
    memblockID  = octaveID
    octaveRes   = startingRes / 2 ^ (octaveID - 1)
    Make Memblock memblockID, octaveRes * octaveRes * 4 // A single float
    for x = 0 to octaveRes - 1
        for y = 0 to octaveRes - 1
            Write Memblock Float memblockID, ( x + y * octaveRes ) * 4, ( Rnd(2000) - 1000 ) * 0.001
        next
    next
next

benchmark1elapsed   = Timer() - benchmark1Start

benchmark2Start = Timer()

// add all layers together
memblockID  = totalOctaves + 1

Make Memblock memblockID, 12 + startingRes * startingRes * 4

Write Memblock DWord memblockID, 0, startingRes
Write Memblock DWord memblockID, 4, startingRes
Write Memblock DWord memblockID, 8, 32

minHeight#      = -1.9
maxHeight#      =  1.9
minInv#         = -minHeight#
heightRange#    =  maxHeight# - minHeight#
_255DivRange#   =  255.0 / heightRange#
sResReciprocal# =    1.0 / (startingRes)

for x = 0 to startingRes - 1
    for y = 0 to startingRes - 1
        x#          = x * sResReciprocal#
        y#          = y * sResReciprocal#
        height#     = 0.0
        Strength#   = 1.0
        `height# = Memblock Float( 1, ( x + y * startingRes ) * 4 )

for OctaveID = totalOctaves to 1 step -1
        `for OctaveID = 7 to 7 step -1
            octaveRes = startingRes / 2 ^ (octaveID - 1)
            // Store the float pixel we are over on the current octave
            memblockX# = x# * octaveRes
            memblockY# = y# * octaveRes
            // Store the int version for pixel sampling
            memblockX  = memblockX#
            memblockY  = memblockY#
            // Get the local offset
            memblockX# = memblockX# mod 1.0
            memblockY# = memblockY# mod 1.0

// BICUBIC ///////////
            `REMSTART
            sample1#  = Sample( OctaveID, memblockX - 1, memblockY - 1, octaveRes )
            sample2#  = Sample( OctaveID, memblockX    , memblockY - 1, octaveRes )
            sample3#  = Sample( OctaveID, memblockX + 1, memblockY - 1, octaveRes )
            sample4#  = Sample( OctaveID, memblockX + 2, memblockY - 1, octaveRes )
            mHeight1# = CubicInterpolate( sample1#, sample2#, sample3#, sample4#, memblockX# )

sample1#  = Sample( OctaveID, memblockX - 1, memblockY    , octaveRes )
            sample2#  = Sample( OctaveID, memblockX    , memblockY    , octaveRes )
            sample3#  = Sample( OctaveID, memblockX + 1, memblockY    , octaveRes )
            sample4#  = Sample( OctaveID, memblockX + 2, memblockY    , octaveRes )
            mHeight2# = CubicInterpolate( sample1#, sample2#, sample3#, sample4#, memblockX# )

sample1#  = Sample( OctaveID, memblockX - 1, memblockY + 1, octaveRes )
            sample2#  = Sample( OctaveID, memblockX    , memblockY + 1, octaveRes )
            sample3#  = Sample( OctaveID, memblockX + 1, memblockY + 1, octaveRes )
            sample4#  = Sample( OctaveID, memblockX + 2, memblockY + 1, octaveRes )
            mHeight3# = CubicInterpolate( sample1#, sample2#, sample3#, sample4#, memblockX# )

sample1#  = Sample( OctaveID, memblockX - 1, memblockY + 2, octaveRes )
            sample2#  = Sample( OctaveID, memblockX    , memblockY + 2, octaveRes )
            sample3#  = Sample( OctaveID, memblockX + 1, memblockY + 2, octaveRes )
            sample4#  = Sample( OctaveID, memblockX + 2, memblockY + 2, octaveRes )
            mHeight4# = CubicInterpolate( sample1#, sample2#, sample3#, sample4#, memblockX# )

mHeight#  = CubicInterpolate( mHeight1#, mHeight2#, mHeight3#, mHeight4#, memblockY# )
            `REMEND
            //////////////////////

// BILINEAR //////////
            REMSTART
            sample1#  = Sample( OctaveID, memblockX    , memblockY, octaveRes )
            sample2#  = Sample( OctaveID, memblockX + 1, memblockY, octaveRes )
            mHeightX# = Linear_Interpolate( sample1#, sample2#, memblockX# )

sample1#  = Sample( OctaveID, memblockX    , memblockY + 1, octaveRes)
            sample2#  = Sample( OctaveID, memblockX + 1, memblockY + 1, octaveRes)
            mHeightY# = Linear_Interpolate( sample1#, sample2#, memblockX# )

mHeight#  = Linear_Interpolate( mHeightX#, mHeightY#, memblockY# )
            REMEND
            //////////////////////

height#   = height# + mHeight# * Strength#
            // Make the next octave affect the output by half as much as this one
            Strength# = Strength# * 0.5
        next

height  = (height# + minInv#) * _255DivRange#
        if height > 255 then height = 255
        if height <   0 then height = 0
        Write Memblock Byte memblockID, 12 + ( x + y * startingRes ) * 4    , height // B
        Write Memblock Byte memblockID, 12 + ( x + y * startingRes ) * 4 + 1, height // G
        Write Memblock Byte memblockID, 12 + ( x + y * startingRes ) * 4 + 2, height // R
        Write Memblock Byte memblockID, 12 + ( x + y * startingRes ) * 4 + 3, 255    // A
    next
next

benchmark2elapsed   = Timer() - benchmark2Start

// Create image
Make Image From Memblock 1, memblockID

// Loop to output result
do
    CLS

Paste Image 1, 64, 64

Text 5,  5, "Time to generate noise: " + Str$(benchmark1elapsed) + "ms"
    Text 5, 15, "Time to add octaves: "    + Str$(benchmark2elapsed) + "ms"

Sync
loop

END

Function Sample( memblockID, x, y, memblockRes )

// Wrap
    if x < 0 then x = x + memblockRes
    if y < 0 then y = y + memblockRes

if x > memblockRes-1 then x = x - memblockRes
    if y > memblockRes-1 then y = y - memblockRes

returnValue# = Memblock Float( memblockID, ( x + y * memblockRes ) * 4 )

Endfunction returnValue#

Function Linear_Interpolate( a as float, b as float, x as float )

returnValue# =  a * (1.0 - x) + b * x

Endfunction returnValue#

Function CubicInterpolate( aMinOne as float, a as float, b as float, bAddOne as float, acrossAB as float )

P as float
    Q as float
    R as float
    S as float

P = (bAddOne - b) - (aMinOne - a)
    Q = (aMinOne - a) - P
    R = b - aMinOne
    S = a

returnValue# =  P * acrossAB ^ 3 + Q * acrossAB ^ 2 + R * acrossAB + S

EndFunction returnValue#

+ Code Snippet

Load DLL "user32.dll"   , 1
displayX = Call DLL( 1 , "GetSystemMetrics" , 0 )
displayY = Call DLL( 1 , "GetSystemMetrics" , 1 )
Delete DLL 1

Set Display Mode displayX, displayY, 32, 1
Sync On

// Seed
Randomize 1337

// You can manually specify the resolution for higher detail noise
`startingRes     = 512
`totalOctaves    = 9

totalOctaves    = 8
startingRes     = 2 ^ totalOctaves

benchmark1Start = Timer()

// Generate noise for all octaves
for octaveID = 1 to totalOctaves
    memblockID  = octaveID
    octaveRes   = startingRes / 2 ^ (octaveID - 1)
    Make Memblock memblockID, octaveRes * octaveRes * 4 // A single float
    for x = 0 to octaveRes - 1
        for y = 0 to octaveRes - 1
            Write Memblock Float memblockID, ( x + y * octaveRes ) * 4, ( Rnd(2000) - 1000 ) * 0.001
        next
    next
next

benchmark1elapsed   = Timer() - benchmark1Start

benchmark2Start = Timer()

// add all layers together
memblockID  = totalOctaves + 1

Make Memblock memblockID, 12 + startingRes * startingRes * 4

Write Memblock DWord memblockID, 0, startingRes
Write Memblock DWord memblockID, 4, startingRes
Write Memblock DWord memblockID, 8, 32

minHeight#      = -1.9
maxHeight#      =  1.9
minInv#         = -minHeight#
heightRange#    =  maxHeight# - minHeight#
_255DivRange#   =  255.0 / heightRange#
sResReciprocal# =    1.0 / (startingRes)

for x = 0 to startingRes - 1
    for y = 0 to startingRes - 1
        x#          = x * sResReciprocal#
        y#          = y * sResReciprocal#
        height#     = 0.0
        Strength#   = 1.0
        `height# = Memblock Float( 1, ( x + y * startingRes ) * 4 )

        for OctaveID = totalOctaves to 1 step -1
        `for OctaveID = 7 to 7 step -1
            octaveRes = startingRes / 2 ^ (octaveID - 1)
            // Store the float pixel we are over on the current octave
            memblockX# = x# * octaveRes
            memblockY# = y# * octaveRes
            // Store the int version for pixel sampling
            memblockX  = memblockX#
            memblockY  = memblockY#
            // Get the local offset
            memblockX# = memblockX# mod 1.0
            memblockY# = memblockY# mod 1.0

            // BICUBIC ///////////
            `REMSTART
            sample1#  = Sample( OctaveID, memblockX - 1, memblockY - 1, octaveRes )
            sample2#  = Sample( OctaveID, memblockX    , memblockY - 1, octaveRes )
            sample3#  = Sample( OctaveID, memblockX + 1, memblockY - 1, octaveRes )
            sample4#  = Sample( OctaveID, memblockX + 2, memblockY - 1, octaveRes )
            mHeight1# = CubicInterpolate( sample1#, sample2#, sample3#, sample4#, memblockX# )

            sample1#  = Sample( OctaveID, memblockX - 1, memblockY    , octaveRes )
            sample2#  = Sample( OctaveID, memblockX    , memblockY    , octaveRes )
            sample3#  = Sample( OctaveID, memblockX + 1, memblockY    , octaveRes )
            sample4#  = Sample( OctaveID, memblockX + 2, memblockY    , octaveRes )
            mHeight2# = CubicInterpolate( sample1#, sample2#, sample3#, sample4#, memblockX# )

            sample1#  = Sample( OctaveID, memblockX - 1, memblockY + 1, octaveRes )
            sample2#  = Sample( OctaveID, memblockX    , memblockY + 1, octaveRes )
            sample3#  = Sample( OctaveID, memblockX + 1, memblockY + 1, octaveRes )
            sample4#  = Sample( OctaveID, memblockX + 2, memblockY + 1, octaveRes )
            mHeight3# = CubicInterpolate( sample1#, sample2#, sample3#, sample4#, memblockX# )

            sample1#  = Sample( OctaveID, memblockX - 1, memblockY + 2, octaveRes )
            sample2#  = Sample( OctaveID, memblockX    , memblockY + 2, octaveRes )
            sample3#  = Sample( OctaveID, memblockX + 1, memblockY + 2, octaveRes )
            sample4#  = Sample( OctaveID, memblockX + 2, memblockY + 2, octaveRes )
            mHeight4# = CubicInterpolate( sample1#, sample2#, sample3#, sample4#, memblockX# )

            mHeight#  = CubicInterpolate( mHeight1#, mHeight2#, mHeight3#, mHeight4#, memblockY# )
            `REMEND
            //////////////////////

            // BILINEAR //////////
            REMSTART
            sample1#  = Sample( OctaveID, memblockX    , memblockY, octaveRes )
            sample2#  = Sample( OctaveID, memblockX + 1, memblockY, octaveRes )
            mHeightX# = Linear_Interpolate( sample1#, sample2#, memblockX# )

            sample1#  = Sample( OctaveID, memblockX    , memblockY + 1, octaveRes)
            sample2#  = Sample( OctaveID, memblockX + 1, memblockY + 1, octaveRes)
            mHeightY# = Linear_Interpolate( sample1#, sample2#, memblockX# )

            mHeight#  = Linear_Interpolate( mHeightX#, mHeightY#, memblockY# )
            REMEND
            //////////////////////

            height#   = height# + mHeight# * Strength#
            // Make the next octave affect the output by half as much as this one
            Strength# = Strength# * 0.5
        next

        height  = (height# + minInv#) * _255DivRange#
        if height > 255 then height = 255
        if height <   0 then height = 0
        Write Memblock Byte memblockID, 12 + ( x + y * startingRes ) * 4    , height // B
        Write Memblock Byte memblockID, 12 + ( x + y * startingRes ) * 4 + 1, height // G
        Write Memblock Byte memblockID, 12 + ( x + y * startingRes ) * 4 + 2, height // R
        Write Memblock Byte memblockID, 12 + ( x + y * startingRes ) * 4 + 3, 255    // A
    next
next

benchmark2elapsed   = Timer() - benchmark2Start

// Create image
Make Image From Memblock 1, memblockID

// Loop to output result
do
    CLS

    Paste Image 1, 64, 64

    Text 5,  5, "Time to generate noise: " + Str$(benchmark1elapsed) + "ms"
    Text 5, 15, "Time to add octaves: "    + Str$(benchmark2elapsed) + "ms"

    Sync
loop

END

Function Sample( memblockID, x, y, memblockRes )

    // Wrap
    if x < 0 then x = x + memblockRes
    if y < 0 then y = y + memblockRes

    if x > memblockRes-1 then x = x - memblockRes
    if y > memblockRes-1 then y = y - memblockRes

    returnValue# = Memblock Float( memblockID, ( x + y * memblockRes ) * 4 )

Endfunction returnValue#

Function Linear_Interpolate( a as float, b as float, x as float )

    returnValue# =  a * (1.0 - x) + b * x

Endfunction returnValue#


Function CubicInterpolate( aMinOne as float, a as float, b as float, bAddOne as float, acrossAB as float )

    P as float
    Q as float
    R as float
    S as float

    P = (bAddOne - b) - (aMinOne - a)
    Q = (aMinOne - a) - P
    R = b - aMinOne
    S = a

    returnValue# =  P * acrossAB ^ 3 + Q * acrossAB ^ 2 + R * acrossAB + S

EndFunction returnValue#

Back to top

Profile PM Email

Green Gandalf

VIP Member

19

Years of Service

User Offline

Joined: 3rd Jan 2005

Playing: Malevolence:Sword of Ahkranox, Skyrim, Civ6.

Posted: 18th Feb 2008 22:57 Edited at: 19th Feb 2008 00:23

Link

Quote: "Didn't really look at your code but I just made my own Perlin noise gen which I find easier to follow, it has both Bicubic and Bilinear filtering(you can comment, uncomment them). Hopefully it will help "

Thanks for that. It works nicely.

However, how would you extend it to 3D? The changes to my code are "obvious". How would you change yours?

I'm not yet sure how your cubic interpolation differs from the interpolation I'm trying to use - but whatever it is yours obviously works.

Omen

I've aleady seen that first reference you gave and decided not to use it because it looked unnecessarily complicated. The reference I gave made more sense to me:

http://www.mandelbrot-dazibao.com/Perlin/Perlin1.htm

and seemed closer to Perlin's suggestions to me. However, it's a while since I checked his website so perhaps my memory is playing tricks. I'll check it again and post back.

I notice that your first reference seems to use the same cubic interpolation that Dark Coder is using. It is also obvious from that first reference that the method extends to 3D (I found it hard to unravel what was going on in Dark Coder's memblock arrays - no doubt he thought the same about mine

). At least his works.

Edit

I've just checked Perlin's website from Omen's second link and it confirms that I'm using (or trying to use) Perlin's original interpolators (you need to follow the links to his 2002 paper). Perlin has suggested a revised version which is supposed to be better. However, in my code it's still worse than bilinear filtering, so I must be mis-applying both of them in some way. Still no idea where the problem lies - or why simple bilinear filtering is so good in comparison in my code. There has to be a simple bug staring me in the face somewhere in my code ...

Back to top

Profile PM Email

Gamer Making

17

Years of Service

User Offline

Joined: 20th Sep 2006

Location: sitting at the comp programming

Posted: 19th Feb 2008 03:09

Link

What is a perlin Noise generator?

Bach Tran

Back to top

Profile PM Email

Benjamin

21

Years of Service

User Offline

Joined: 24th Nov 2002

Location: France

Posted: 19th Feb 2008 03:12

Link

I think it generates Perlin noise.

Multisync - TCP Server/Client Multiplayer Plugin (DBP/DBCe)

Back to top

Profile PM Email

dark coder

21

Years of Service

User Offline

Joined: 6th Oct 2002

Location: Japan

Posted: 19th Feb 2008 11:01

Link

Quote: "However, how would you extend it to 3D? The changes to my code are "obvious". How would you change yours?"

I haven't tried it, but I'd imagine for linear interpolation at least you'd just do the standard 4 samples(2 per axis) on a 2D(X/Z) axis(while reading the current height), then do the same but for the plane above you then linearly interpolate between the two results based on the distance from the lower Y value.

Also if anyone's interested(while not totally related to these boards) I ported my code as best I could to GDK, without any compiler optimizations it's almost exactly 2x the speed on my machine, code's below:

#include "DarkGDK.h"
#include "Windows.h"
#include "Winuser.h"
#include <string>
#include "Math.h"

int displayX = GetSystemMetrics(SM_CXSCREEN);
int displayY = GetSystemMetrics(SM_CYSCREEN);

#define startingRes  512
#define totalOctaves 8

float* noiseMap[totalOctaves+1];

char globalString[256];

//#define INTERPOLATION_LINEAR
//#define INTERPOLATION_COSINE
#define INTERPOLATION_CUBIC

// Prototypes
float Sample( int octaveID, int x, int y, int octaveRes );
float LinearInterpolate( float a, float b, float x );
float CubicInterpolate( float aMinOne, float a, float b, float bAddOne, float acrossAB );
float CosineInterpolate( float a, float b, float x );

void DarkGDK ( void )
{
	// Set display
	dbSetWindowPosition( 0, 0 );
	dbSetDisplayMode( displayX, displayY, 32 );
	dbSetWindowLayout( 0, 0, 0 );
	
	// Init
	dbSyncOn();
	
	dbRandomize(1337);
	
	int benchmark1Start = dbTimer();
	
	// Generate noise for all octaves
	for( int octaveID = 1; octaveID <= totalOctaves; octaveID ++ )
	{
		int octaveRes   = startingRes / pow( 2.0f, (float) (octaveID - 1) );
		noiseMap[octaveID] = new float[octaveRes * octaveRes];
		int resLoopMax = octaveRes - 1;
		for( int x = 0; x <= resLoopMax; x ++ )
		{
			for( int y = 0; y <= resLoopMax; y ++ )
			{
				noiseMap[octaveID][x+y*octaveRes] = ( ((float) dbRnd(2000)) - 1000.0f ) * 0.001;
			}
		}
	}
	
	int benchmark1Elapsed = dbTimer() - benchmark1Start;
	
	int benchmark2Start = dbTimer();
	
	int memblockID  = 1;
	
	dbMakeMemblock( memblockID, 12 + startingRes * startingRes * 4 );
	
	dbWriteMemblockDword( memblockID, 0, startingRes );
	dbWriteMemblockDword( memblockID, 4, startingRes );
	dbWriteMemblockDword( memblockID, 8, 32			 );
	
	float minHeight			= -1.9;
	float maxHeight			=  1.9;
	float minInv			= -minHeight;
	float heightRange		=  maxHeight - minHeight;
	float _255DivRange		=  255.0 / heightRange;
	float sResReciprocal	=    1.0 / (startingRes);
	
	int resLoopMax = startingRes - 1;
	
	for( int x = 0; x <= resLoopMax; x ++ )
	{
		for( int y = 0; y <= resLoopMax; y ++ )
		{
			float fX		= ((float) x) * sResReciprocal;
			float fY		= ((float) y) * sResReciprocal;
			float height	= 0.0;
			float strength	= 1.0;
			float mHeight;
			
			for ( int octaveID = totalOctaves; octaveID >= 1; octaveID -- )
			{
				int   octaveRes = startingRes / pow( 2.0f, (float) (octaveID - 1) );
				float octavefX = fX * (float) octaveRes;
				float octavefY = fY * (float) octaveRes;
				int	  octaveX  = (int) octavefX;
				int	  octaveY  = (int) octavefY;
				octavefX = fmod( octavefX, 1.0f);
				octavefY = fmod( octavefY, 1.0f);
			
				// BICUBIC ///////////
				#ifdef INTERPOLATION_CUBIC
					float sample1  = Sample( octaveID, octaveX - 1, octaveY - 1, octaveRes );
					float sample2  = Sample( octaveID, octaveX    , octaveY - 1, octaveRes );
					float sample3  = Sample( octaveID, octaveX + 1, octaveY - 1, octaveRes );
					float sample4  = Sample( octaveID, octaveX + 2, octaveY - 1, octaveRes );
					float mHeight1 = CubicInterpolate( sample1, sample2, sample3, sample4, octavefX );

sample1  = Sample( octaveID, octaveX - 1, octaveY    , octaveRes );
						  sample2  = Sample( octaveID, octaveX    , octaveY    , octaveRes );
						  sample3  = Sample( octaveID, octaveX + 1, octaveY    , octaveRes );
						  sample4  = Sample( octaveID, octaveX + 2, octaveY    , octaveRes );
					float mHeight2 = CubicInterpolate( sample1, sample2, sample3, sample4, octavefX );

sample1  = Sample( octaveID, octaveX - 1, octaveY + 1, octaveRes );
						  sample2  = Sample( octaveID, octaveX    , octaveY + 1, octaveRes );
						  sample3  = Sample( octaveID, octaveX + 1, octaveY + 1, octaveRes );
						  sample4  = Sample( octaveID, octaveX + 2, octaveY + 1, octaveRes );
					float mHeight3 = CubicInterpolate( sample1, sample2, sample3, sample4, octavefX );

sample1  = Sample( octaveID, octaveX - 1, octaveY + 2, octaveRes );
						  sample2  = Sample( octaveID, octaveX    , octaveY + 2, octaveRes );
						  sample3  = Sample( octaveID, octaveX + 1, octaveY + 2, octaveRes );
						  sample4  = Sample( octaveID, octaveX + 2, octaveY + 2, octaveRes );
					float mHeight4 = CubicInterpolate( sample1, sample2, sample3, sample4, octavefX );

mHeight  = CubicInterpolate( mHeight1, mHeight2, mHeight3, mHeight4, octavefY );
				#endif
				//////////////////////
				// COSINE ////////////
				#ifdef INTERPOLATION_COSINE
					float sample1  = Sample( octaveID, octaveX    , octaveY, octaveRes );
					float sample2  = Sample( octaveID, octaveX + 1, octaveY, octaveRes );
					float mHeightX = CosineInterpolate( sample1, sample2, octavefX );
					
						  sample1  = Sample( octaveID, octaveX    , octaveY + 1, octaveRes);
						  sample2  = Sample( octaveID, octaveX + 1, octaveY + 1, octaveRes);
					float mHeightY = CosineInterpolate( sample1, sample2, octavefX );
					
					mHeight = CosineInterpolate( mHeightX, mHeightY, octavefY );
				#endif
				/////////////////////
				// BILINEAR /////////
				#ifdef INTERPOLATION_LINEAR
					float sample1  = Sample( octaveID, octaveX    , octaveY, octaveRes );
					float sample2  = Sample( octaveID, octaveX + 1, octaveY, octaveRes );
					float mHeightX = LinearInterpolate( sample1, sample2, octavefX );
					
						  sample1  = Sample( octaveID, octaveX    , octaveY + 1, octaveRes);
						  sample2  = Sample( octaveID, octaveX + 1, octaveY + 1, octaveRes);
					float mHeightY = LinearInterpolate( sample1, sample2, octavefX );
					
					mHeight = LinearInterpolate( mHeightX, mHeightY, octavefY );
				#endif
				/////////////////////
				
				height   = height + mHeight * strength;
				strength = strength * 0.5f;
			}
			height   = (height + minInv) * _255DivRange;
			if (height > 255)
				height = 255;
			if (height <   0)
				height = 0;
			int iHeight = (int) height;
			dbWriteMemblockByte( memblockID, 12 + ( x + y * startingRes ) * 4    , iHeight ); // B
			dbWriteMemblockByte( memblockID, 12 + ( x + y * startingRes ) * 4 + 1, iHeight ); // G
			dbWriteMemblockByte( memblockID, 12 + ( x + y * startingRes ) * 4 + 2, iHeight ); // R
			dbWriteMemblockByte( memblockID, 12 + ( x + y * startingRes ) * 4 + 3, 255    ); // A
		}
	}
	
	dbMakeImageFromMemblock( 1, memblockID );
	
	int benchmark2Elapsed = dbTimer() - benchmark1Start;
	
	// Output image and info
	while ( LoopGDK() )
	{
		sprintf( globalString, "Time to generate noise: %ims", benchmark1Elapsed );
		dbText( 5, 5, globalString );
		sprintf( globalString, "Time to add octaves: %ims", benchmark2Elapsed );
		dbText( 5, 15, globalString );
		
		dbPasteImage( 1, 64, 64 );
		
		dbSync();
	}
	
	return;
}

float Sample( int octaveID, int x, int y, int octaveRes )
{
	// Wrap
	if (x < 0)
		x += octaveRes;
	if (y < 0)
		y += octaveRes;
	
	if (x > octaveRes-1)
		x -= octaveRes;
	if (y > octaveRes-1)
		y -= octaveRes;
	
	return noiseMap[octaveID][x+y*octaveRes];
}

float LinearInterpolate( float a, float b, float x )
{
	return a * (1.0 - x) + b * x;
}
float CosineInterpolate( float a, float b, float x )
{
    float f = (1.0 - cos(x * 3.141592654)) * 0.5;
    
	return a * (1.0 - f) + b * f;
}
float CubicInterpolate( float aMinOne, float a, float b, float bAddOne, float acrossAB )
{
	float P, Q, R, S;
	
	P = (bAddOne - b) - (aMinOne - a);
	Q = (aMinOne - a) - P;
	R = b - aMinOne;
	S = a;
	
	return P * pow( acrossAB, 3.0f ) + Q * pow( acrossAB, 2.0f ) + R * acrossAB + S;
}

+ Code Snippet

#include "DarkGDK.h"
#include "Windows.h"
#include "Winuser.h"
#include <string>
#include "Math.h"

int displayX = GetSystemMetrics(SM_CXSCREEN);
int displayY = GetSystemMetrics(SM_CYSCREEN);

#define startingRes  512
#define totalOctaves 8

float* noiseMap[totalOctaves+1];

char globalString[256];

//#define INTERPOLATION_LINEAR
//#define INTERPOLATION_COSINE
#define INTERPOLATION_CUBIC

// Prototypes
float Sample( int octaveID, int x, int y, int octaveRes );
float LinearInterpolate( float a, float b, float x );
float CubicInterpolate( float aMinOne, float a, float b, float bAddOne, float acrossAB );
float CosineInterpolate( float a, float b, float x );

void DarkGDK ( void )
{
	// Set display
	dbSetWindowPosition( 0, 0 );
	dbSetDisplayMode( displayX, displayY, 32 );
	dbSetWindowLayout( 0, 0, 0 );
	
	// Init
	dbSyncOn();
	
	dbRandomize(1337);
	
	int benchmark1Start = dbTimer();
	
	// Generate noise for all octaves
	for( int octaveID = 1; octaveID <= totalOctaves; octaveID ++ )
	{
		int octaveRes   = startingRes / pow( 2.0f, (float) (octaveID - 1) );
		noiseMap[octaveID] = new float[octaveRes * octaveRes];
		int resLoopMax = octaveRes - 1;
		for( int x = 0; x <= resLoopMax; x ++ )
		{
			for( int y = 0; y <= resLoopMax; y ++ )
			{
				noiseMap[octaveID][x+y*octaveRes] = ( ((float) dbRnd(2000)) - 1000.0f ) * 0.001;
			}
		}
	}
	
	int benchmark1Elapsed = dbTimer() - benchmark1Start;
	
	int benchmark2Start = dbTimer();
	
	int memblockID  = 1;
	
	dbMakeMemblock( memblockID, 12 + startingRes * startingRes * 4 );
	
	dbWriteMemblockDword( memblockID, 0, startingRes );
	dbWriteMemblockDword( memblockID, 4, startingRes );
	dbWriteMemblockDword( memblockID, 8, 32			 );
	
	float minHeight			= -1.9;
	float maxHeight			=  1.9;
	float minInv			= -minHeight;
	float heightRange		=  maxHeight - minHeight;
	float _255DivRange		=  255.0 / heightRange;
	float sResReciprocal	=    1.0 / (startingRes);
	
	int resLoopMax = startingRes - 1;
	
	for( int x = 0; x <= resLoopMax; x ++ )
	{
		for( int y = 0; y <= resLoopMax; y ++ )
		{
			float fX		= ((float) x) * sResReciprocal;
			float fY		= ((float) y) * sResReciprocal;
			float height	= 0.0;
			float strength	= 1.0;
			float mHeight;
			
			for ( int octaveID = totalOctaves; octaveID >= 1; octaveID -- )
			{
				int   octaveRes = startingRes / pow( 2.0f, (float) (octaveID - 1) );
				float octavefX = fX * (float) octaveRes;
				float octavefY = fY * (float) octaveRes;
				int	  octaveX  = (int) octavefX;
				int	  octaveY  = (int) octavefY;
				octavefX = fmod( octavefX, 1.0f);
				octavefY = fmod( octavefY, 1.0f);
			
				// BICUBIC ///////////
				#ifdef INTERPOLATION_CUBIC
					float sample1  = Sample( octaveID, octaveX - 1, octaveY - 1, octaveRes );
					float sample2  = Sample( octaveID, octaveX    , octaveY - 1, octaveRes );
					float sample3  = Sample( octaveID, octaveX + 1, octaveY - 1, octaveRes );
					float sample4  = Sample( octaveID, octaveX + 2, octaveY - 1, octaveRes );
					float mHeight1 = CubicInterpolate( sample1, sample2, sample3, sample4, octavefX );

						  sample1  = Sample( octaveID, octaveX - 1, octaveY    , octaveRes );
						  sample2  = Sample( octaveID, octaveX    , octaveY    , octaveRes );
						  sample3  = Sample( octaveID, octaveX + 1, octaveY    , octaveRes );
						  sample4  = Sample( octaveID, octaveX + 2, octaveY    , octaveRes );
					float mHeight2 = CubicInterpolate( sample1, sample2, sample3, sample4, octavefX );

						  sample1  = Sample( octaveID, octaveX - 1, octaveY + 1, octaveRes );
						  sample2  = Sample( octaveID, octaveX    , octaveY + 1, octaveRes );
						  sample3  = Sample( octaveID, octaveX + 1, octaveY + 1, octaveRes );
						  sample4  = Sample( octaveID, octaveX + 2, octaveY + 1, octaveRes );
					float mHeight3 = CubicInterpolate( sample1, sample2, sample3, sample4, octavefX );

						  sample1  = Sample( octaveID, octaveX - 1, octaveY + 2, octaveRes );
						  sample2  = Sample( octaveID, octaveX    , octaveY + 2, octaveRes );
						  sample3  = Sample( octaveID, octaveX + 1, octaveY + 2, octaveRes );
						  sample4  = Sample( octaveID, octaveX + 2, octaveY + 2, octaveRes );
					float mHeight4 = CubicInterpolate( sample1, sample2, sample3, sample4, octavefX );

					mHeight  = CubicInterpolate( mHeight1, mHeight2, mHeight3, mHeight4, octavefY );
				#endif
				//////////////////////
				// COSINE ////////////
				#ifdef INTERPOLATION_COSINE
					float sample1  = Sample( octaveID, octaveX    , octaveY, octaveRes );
					float sample2  = Sample( octaveID, octaveX + 1, octaveY, octaveRes );
					float mHeightX = CosineInterpolate( sample1, sample2, octavefX );
					
						  sample1  = Sample( octaveID, octaveX    , octaveY + 1, octaveRes);
						  sample2  = Sample( octaveID, octaveX + 1, octaveY + 1, octaveRes);
					float mHeightY = CosineInterpolate( sample1, sample2, octavefX );
					
					mHeight = CosineInterpolate( mHeightX, mHeightY, octavefY );
				#endif
				/////////////////////
				// BILINEAR /////////
				#ifdef INTERPOLATION_LINEAR
					float sample1  = Sample( octaveID, octaveX    , octaveY, octaveRes );
					float sample2  = Sample( octaveID, octaveX + 1, octaveY, octaveRes );
					float mHeightX = LinearInterpolate( sample1, sample2, octavefX );
					
						  sample1  = Sample( octaveID, octaveX    , octaveY + 1, octaveRes);
						  sample2  = Sample( octaveID, octaveX + 1, octaveY + 1, octaveRes);
					float mHeightY = LinearInterpolate( sample1, sample2, octavefX );
					
					mHeight = LinearInterpolate( mHeightX, mHeightY, octavefY );
				#endif
				/////////////////////
				
				height   = height + mHeight * strength;
				strength = strength * 0.5f;
			}
			height   = (height + minInv) * _255DivRange;
			if (height > 255)
				height = 255;
			if (height <   0)
				height = 0;
			int iHeight = (int) height;
			dbWriteMemblockByte( memblockID, 12 + ( x + y * startingRes ) * 4    , iHeight ); // B
			dbWriteMemblockByte( memblockID, 12 + ( x + y * startingRes ) * 4 + 1, iHeight ); // G
			dbWriteMemblockByte( memblockID, 12 + ( x + y * startingRes ) * 4 + 2, iHeight ); // R
			dbWriteMemblockByte( memblockID, 12 + ( x + y * startingRes ) * 4 + 3, 255    ); // A
		}
	}
	
	dbMakeImageFromMemblock( 1, memblockID );
	
	int benchmark2Elapsed = dbTimer() - benchmark1Start;
	
	// Output image and info
	while ( LoopGDK() )
	{
		sprintf( globalString, "Time to generate noise: %ims", benchmark1Elapsed );
		dbText( 5, 5, globalString );
		sprintf( globalString, "Time to add octaves: %ims", benchmark2Elapsed );
		dbText( 5, 15, globalString );
		
		dbPasteImage( 1, 64, 64 );
		
		dbSync();
	}
	
	return;
}

float Sample( int octaveID, int x, int y, int octaveRes )
{
	// Wrap
	if (x < 0)
		x += octaveRes;
	if (y < 0)
		y += octaveRes;
	
	if (x > octaveRes-1)
		x -= octaveRes;
	if (y > octaveRes-1)
		y -= octaveRes;
	
	return noiseMap[octaveID][x+y*octaveRes];
}

float LinearInterpolate( float a, float b, float x )
{
	return a * (1.0 - x) + b * x;
}
float CosineInterpolate( float a, float b, float x )
{
    float f = (1.0 - cos(x * 3.141592654)) * 0.5;
    
	return a * (1.0 - f) + b * f;
}
float CubicInterpolate( float aMinOne, float a, float b, float bAddOne, float acrossAB )
{
	float P, Q, R, S;
	
	P = (bAddOne - b) - (aMinOne - a);
	Q = (aMinOne - a) - P;
	R = b - aMinOne;
	S = a;
	
	return P * pow( acrossAB, 3.0f ) + Q * pow( acrossAB, 2.0f ) + R * acrossAB + S;
}

You can pick between the 3 interpolation methods in the defines.

I also made a far more optimized version of this just to see how fast I could get the routine, with compiler optimizations I managed to get it down to 427ms to generate a 512^2px image with 8 octaves, which is far faster than in DBP as I get 6800ms in that, however my DBP code isn't quite as optimized, but there's only so much you can do there.

#include "DarkGDK.h"
#include "Windows.h"
#include "Winuser.h"
#include <string>
#include "Math.h"

// Get desktop res
int displayX = GetSystemMetrics(SM_CXSCREEN);
int displayY = GetSystemMetrics(SM_CYSCREEN);

// defines
#define startingRes  512
#define totalOctaves 8

//#define INTERPOLATION_LINEAR
//#define INTERPOLATION_COSINE // 561
#define INTERPOLATION_CUBIC // 1459 - 1155 - 845 - 805 - 653 - 506 - 427

// Globals
float minHeight = 0.0f;
float maxHeight = 0.0f;

// Arrays
float* noiseMap[totalOctaves+1];
char   globalString[256];

// Prototypes
float MyRand( int x );
float Sample( int octaveID, int x, int y, int octaveRes );
float LinearInterpolate( float a, float b, float x );
float CubicInterpolate( float aMinOne, float a, float b, float bAddOne, float acrossAB );
float CosineInterpolate( float a, float b, float x );

void DarkGDK ( void )
{
	// Set display
	dbSetWindowPosition( 0, 0 );
	dbSetDisplayMode( displayX, displayY, 32 );
	dbSetWindowLayout( 0, 0, 0 );
	
	// Init
	dbSyncOn();
	
	// BENCHMARK ///////
	int benchmark1Start = dbTimer();
	////////////////////
	
	// Calculate weight/opacity/strength of all layers
	float strength[totalOctaves];
	float currentStrength	= 1.0;
	float maxPossibleRange	= 0.0;
	for ( int octaveID = totalOctaves; octaveID >= 1; octaveID -- )
	{
		strength[octaveID] = currentStrength;
		maxPossibleRange  += currentStrength;
		currentStrength   *= 0.75f;
	}
	
	// This method is faster than evlauating the actual min and max heights used
	minHeight = -maxPossibleRange;
	maxHeight =  maxPossibleRange;
	
	// Generate noise for all octaves
	for( int octaveID = 1; octaveID <= totalOctaves; octaveID ++ )
	{
		int octaveRes		= startingRes / pow( 2.0f, (float) (octaveID - 1) );
		noiseMap[octaveID] = new float[octaveRes * octaveRes];
		int resLoopMax = octaveRes - 1;
		//float strength = 1.0f / (float) (1 << (totalOctaves - octaveID));
		
		for( int x = 0; x <= resLoopMax; x ++ )
		{
			for( int y = 0; y <= resLoopMax; y ++ )
			{
				//noiseMap[octaveID][x+y*octaveRes] = ( ((float) dbRnd(2000)) - 1000.0f ) * 0.001;
				noiseMap[octaveID][x+y*octaveRes] = MyRand(x+y*octaveRes) * strength[octaveID];
			}
		}
	}
	
	// BENCHMARK ///////
	int benchmark1Elapsed = dbTimer() - benchmark1Start;
	int benchmark2Start = dbTimer();
	////////////////////
	
	int memblockID  = 1;
	
	dbMakeMemblock( memblockID, 12 + startingRes * startingRes * 4 );
	DWORD* memblockPtr = (DWORD*) dbGetMemblockPtr(memblockID);
	
	// Write memblock header
	memblockPtr[0] = (DWORD) startingRes;
	memblockPtr[1] = (DWORD) startingRes;
	memblockPtr[2] = (DWORD) 32;
	
	float minInv			= -minHeight;
	float heightRange		=  maxHeight - minHeight;
	float _255DivRange		=  255.0 / heightRange;
	float sResReciprocal	=    1.0 / (startingRes);
	DWORD BGRA;
	
	int resLoopMax = startingRes - 1;
	
	//sprintf( globalString, "This: %f and this %f", minHeight, maxHeight );
	//MessageBox( NULL, globalString, "Hallo", MB_OK );
	float mHeight;
	
	for( int x = 0; x <= resLoopMax; x ++ )
	{
		for( int y = 0; y <= resLoopMax; y ++ )
		{
			float fX		= ((float) x) * sResReciprocal;
			float fY		= ((float) y) * sResReciprocal;
			float height	= 0.0;
			
			for ( int octaveID = totalOctaves; octaveID >= 2; octaveID -- )
			{
				int   octaveRes	= startingRes / (1 << (octaveID - 1));
				float octavefX	= fX * (float) octaveRes;
				float octavefY	= fY * (float) octaveRes;
				int	  octaveX	= (int) octavefX;
				int	  octaveY	= (int) octavefY;
				//octavefX = fmod( octavefX, 1.0f);
				//octavefY = fmod( octavefY, 1.0f);
				octavefX = 0.0000152587890625f * (float)(((int)(octavefX * 65536.0f)) % 65536);
				octavefY = 0.0000152587890625f * (float)(((int)(octavefY * 65536.0f)) % 65536);
				
				// BICUBIC ///////////
				#ifdef INTERPOLATION_CUBIC
					float sample1  = Sample( octaveID, octaveX - 1, octaveY - 1, octaveRes );
					float sample2  = Sample( octaveID, octaveX    , octaveY - 1, octaveRes );
					float sample3  = Sample( octaveID, octaveX + 1, octaveY - 1, octaveRes );
					float sample4  = Sample( octaveID, octaveX + 2, octaveY - 1, octaveRes );
					float mHeight1 = CubicInterpolate( sample1, sample2, sample3, sample4, octavefX );

sample1  = Sample( octaveID, octaveX - 1, octaveY    , octaveRes );
						  sample2  = Sample( octaveID, octaveX    , octaveY    , octaveRes );
						  sample3  = Sample( octaveID, octaveX + 1, octaveY    , octaveRes );
						  sample4  = Sample( octaveID, octaveX + 2, octaveY    , octaveRes );
					float mHeight2 = CubicInterpolate( sample1, sample2, sample3, sample4, octavefX );

sample1  = Sample( octaveID, octaveX - 1, octaveY + 1, octaveRes );
						  sample2  = Sample( octaveID, octaveX    , octaveY + 1, octaveRes );
						  sample3  = Sample( octaveID, octaveX + 1, octaveY + 1, octaveRes );
						  sample4  = Sample( octaveID, octaveX + 2, octaveY + 1, octaveRes );
					float mHeight3 = CubicInterpolate( sample1, sample2, sample3, sample4, octavefX );

sample1  = Sample( octaveID, octaveX - 1, octaveY + 2, octaveRes );
						  sample2  = Sample( octaveID, octaveX    , octaveY + 2, octaveRes );
						  sample3  = Sample( octaveID, octaveX + 1, octaveY + 2, octaveRes );
						  sample4  = Sample( octaveID, octaveX + 2, octaveY + 2, octaveRes );
					float mHeight4 = CubicInterpolate( sample1, sample2, sample3, sample4, octavefX );

mHeight  = CubicInterpolate( mHeight1, mHeight2, mHeight3, mHeight4, octavefY );
				#endif
				//////////////////////
				// COSINE ////////////
				#ifdef INTERPOLATION_COSINE
					float sample1  = Sample( octaveID, octaveX    , octaveY, octaveRes );
					float sample2  = Sample( octaveID, octaveX + 1, octaveY, octaveRes );
					float mHeightX = CosineInterpolate( sample1, sample2, octavefX );
					
						  sample1  = Sample( octaveID, octaveX    , octaveY + 1, octaveRes);
						  sample2  = Sample( octaveID, octaveX + 1, octaveY + 1, octaveRes);
					float mHeightY = CosineInterpolate( sample1, sample2, octavefX );
					
					mHeight = CosineInterpolate( mHeightX, mHeightY, octavefY );
				#endif
				/////////////////////
				// BILINEAR /////////
				#ifdef INTERPOLATION_LINEAR
					float sample1  = Sample( octaveID, octaveX    , octaveY, octaveRes );
					float sample2  = Sample( octaveID, octaveX + 1, octaveY, octaveRes );
					float mHeightX = LinearInterpolate( sample1, sample2, octavefX );
					
						  sample1  = Sample( octaveID, octaveX    , octaveY + 1, octaveRes);
						  sample2  = Sample( octaveID, octaveX + 1, octaveY + 1, octaveRes);
					float mHeightY = LinearInterpolate( sample1, sample2, octavefX );
					
					mHeight = LinearInterpolate( mHeightX, mHeightY, octavefY );
				#endif
				/////////////////////
				
				height   = height + mHeight;
			}
			// Highest frequency noise
			height   = height + Sample( 1, x, y, startingRes);
			
			height   = (height + minInv) * _255DivRange;
			int iHeight = (int) height;
			BGRA = iHeight | (iHeight<<8) | (iHeight<<16) | (255<<24);
			memblockPtr[3 + x + y * startingRes] = BGRA;
		}
	}
	
	// BENCHMARK ///////
	int benchmark2Elapsed = dbTimer() - benchmark1Start;
	////////////////////
	
	// Make image from memblock
	dbMakeImageFromMemblock( 1, memblockID );
	
	// Output image and info
	while ( LoopGDK() )
	{
		sprintf( globalString, "Time to generate noise: %ims", benchmark1Elapsed );
		dbText( 5, 5, globalString );
		sprintf( globalString, "Time to add octaves: %ims", benchmark2Elapsed );
		dbText( 5, 15, globalString );
		
		dbPasteImage( 1, 64, 64 );
		
		dbSync();
	}
	
	return;
}

float MyRand( int x )
{	 
	x = (x<<13) ^ x;
	
	return ( 1.0 - ( (x * (x * x * 15731 + 789221) + 1376312589) & 0x7fffffff) / 1073741824.0f);    
}
float Sample( int octaveID, int x, int y, int octaveRes )
{
	// Wrap
	if (x < 0)
		x += octaveRes;
	if (y < 0)
		y += octaveRes;
	
	if (x > octaveRes-1)
		x -= octaveRes;
	if (y > octaveRes-1)
		y -= octaveRes;
	
	return noiseMap[octaveID][x+y*octaveRes];
}

float LinearInterpolate( float a, float b, float x )
{
	return a * (1.0 - x) + b * x;
}
float CosineInterpolate( float a, float b, float x )
{
    x = (1.0 - cos(x * 3.141592654)) * 0.5;
    
	return a * (1.0 - x) + b * x;
}
float CubicInterpolate( float aMinOne, float a, float b, float bAddOne, float acrossAB )
{
	float T;
	
	bAddOne = (bAddOne - b) - (aMinOne - a);
	T = acrossAB * acrossAB;
	
	return bAddOne * (T * acrossAB) + ((aMinOne - a) - bAddOne) * T + (b - aMinOne) * acrossAB + a;
}

+ Code Snippet

#include "DarkGDK.h"
#include "Windows.h"
#include "Winuser.h"
#include <string>
#include "Math.h"

// Get desktop res
int displayX = GetSystemMetrics(SM_CXSCREEN);
int displayY = GetSystemMetrics(SM_CYSCREEN);

// defines
#define startingRes  512
#define totalOctaves 8

//#define INTERPOLATION_LINEAR
//#define INTERPOLATION_COSINE // 561
#define INTERPOLATION_CUBIC // 1459 - 1155 - 845 - 805 - 653 - 506 - 427

// Globals
float minHeight = 0.0f;
float maxHeight = 0.0f;

// Arrays
float* noiseMap[totalOctaves+1];
char   globalString[256];

// Prototypes
float MyRand( int x );
float Sample( int octaveID, int x, int y, int octaveRes );
float LinearInterpolate( float a, float b, float x );
float CubicInterpolate( float aMinOne, float a, float b, float bAddOne, float acrossAB );
float CosineInterpolate( float a, float b, float x );

void DarkGDK ( void )
{
	// Set display
	dbSetWindowPosition( 0, 0 );
	dbSetDisplayMode( displayX, displayY, 32 );
	dbSetWindowLayout( 0, 0, 0 );
	
	// Init
	dbSyncOn();
	
	// BENCHMARK ///////
	int benchmark1Start = dbTimer();
	////////////////////
	
	// Calculate weight/opacity/strength of all layers
	float strength[totalOctaves];
	float currentStrength	= 1.0;
	float maxPossibleRange	= 0.0;
	for ( int octaveID = totalOctaves; octaveID >= 1; octaveID -- )
	{
		strength[octaveID] = currentStrength;
		maxPossibleRange  += currentStrength;
		currentStrength   *= 0.75f;
	}
	
	// This method is faster than evlauating the actual min and max heights used
	minHeight = -maxPossibleRange;
	maxHeight =  maxPossibleRange;
	
	// Generate noise for all octaves
	for( int octaveID = 1; octaveID <= totalOctaves; octaveID ++ )
	{
		int octaveRes		= startingRes / pow( 2.0f, (float) (octaveID - 1) );
		noiseMap[octaveID] = new float[octaveRes * octaveRes];
		int resLoopMax = octaveRes - 1;
		//float strength = 1.0f / (float) (1 << (totalOctaves - octaveID));
		
		for( int x = 0; x <= resLoopMax; x ++ )
		{
			for( int y = 0; y <= resLoopMax; y ++ )
			{
				//noiseMap[octaveID][x+y*octaveRes] = ( ((float) dbRnd(2000)) - 1000.0f ) * 0.001;
				noiseMap[octaveID][x+y*octaveRes] = MyRand(x+y*octaveRes) * strength[octaveID];
			}
		}
	}
	
	// BENCHMARK ///////
	int benchmark1Elapsed = dbTimer() - benchmark1Start;
	int benchmark2Start = dbTimer();
	////////////////////
	
	int memblockID  = 1;
	
	dbMakeMemblock( memblockID, 12 + startingRes * startingRes * 4 );
	DWORD* memblockPtr = (DWORD*) dbGetMemblockPtr(memblockID);
	
	// Write memblock header
	memblockPtr[0] = (DWORD) startingRes;
	memblockPtr[1] = (DWORD) startingRes;
	memblockPtr[2] = (DWORD) 32;
	
	float minInv			= -minHeight;
	float heightRange		=  maxHeight - minHeight;
	float _255DivRange		=  255.0 / heightRange;
	float sResReciprocal	=    1.0 / (startingRes);
	DWORD BGRA;
	
	int resLoopMax = startingRes - 1;
	
	//sprintf( globalString, "This: %f and this %f", minHeight, maxHeight );
	//MessageBox( NULL, globalString, "Hallo", MB_OK );
	float mHeight;
	
	for( int x = 0; x <= resLoopMax; x ++ )
	{
		for( int y = 0; y <= resLoopMax; y ++ )
		{
			float fX		= ((float) x) * sResReciprocal;
			float fY		= ((float) y) * sResReciprocal;
			float height	= 0.0;
			
			for ( int octaveID = totalOctaves; octaveID >= 2; octaveID -- )
			{
				int   octaveRes	= startingRes / (1 << (octaveID - 1));
				float octavefX	= fX * (float) octaveRes;
				float octavefY	= fY * (float) octaveRes;
				int	  octaveX	= (int) octavefX;
				int	  octaveY	= (int) octavefY;
				//octavefX = fmod( octavefX, 1.0f);
				//octavefY = fmod( octavefY, 1.0f);
				octavefX = 0.0000152587890625f * (float)(((int)(octavefX * 65536.0f)) % 65536);
				octavefY = 0.0000152587890625f * (float)(((int)(octavefY * 65536.0f)) % 65536);
				
				// BICUBIC ///////////
				#ifdef INTERPOLATION_CUBIC
					float sample1  = Sample( octaveID, octaveX - 1, octaveY - 1, octaveRes );
					float sample2  = Sample( octaveID, octaveX    , octaveY - 1, octaveRes );
					float sample3  = Sample( octaveID, octaveX + 1, octaveY - 1, octaveRes );
					float sample4  = Sample( octaveID, octaveX + 2, octaveY - 1, octaveRes );
					float mHeight1 = CubicInterpolate( sample1, sample2, sample3, sample4, octavefX );

						  sample1  = Sample( octaveID, octaveX - 1, octaveY    , octaveRes );
						  sample2  = Sample( octaveID, octaveX    , octaveY    , octaveRes );
						  sample3  = Sample( octaveID, octaveX + 1, octaveY    , octaveRes );
						  sample4  = Sample( octaveID, octaveX + 2, octaveY    , octaveRes );
					float mHeight2 = CubicInterpolate( sample1, sample2, sample3, sample4, octavefX );

						  sample1  = Sample( octaveID, octaveX - 1, octaveY + 1, octaveRes );
						  sample2  = Sample( octaveID, octaveX    , octaveY + 1, octaveRes );
						  sample3  = Sample( octaveID, octaveX + 1, octaveY + 1, octaveRes );
						  sample4  = Sample( octaveID, octaveX + 2, octaveY + 1, octaveRes );
					float mHeight3 = CubicInterpolate( sample1, sample2, sample3, sample4, octavefX );

						  sample1  = Sample( octaveID, octaveX - 1, octaveY + 2, octaveRes );
						  sample2  = Sample( octaveID, octaveX    , octaveY + 2, octaveRes );
						  sample3  = Sample( octaveID, octaveX + 1, octaveY + 2, octaveRes );
						  sample4  = Sample( octaveID, octaveX + 2, octaveY + 2, octaveRes );
					float mHeight4 = CubicInterpolate( sample1, sample2, sample3, sample4, octavefX );

					mHeight  = CubicInterpolate( mHeight1, mHeight2, mHeight3, mHeight4, octavefY );
				#endif
				//////////////////////
				// COSINE ////////////
				#ifdef INTERPOLATION_COSINE
					float sample1  = Sample( octaveID, octaveX    , octaveY, octaveRes );
					float sample2  = Sample( octaveID, octaveX + 1, octaveY, octaveRes );
					float mHeightX = CosineInterpolate( sample1, sample2, octavefX );
					
						  sample1  = Sample( octaveID, octaveX    , octaveY + 1, octaveRes);
						  sample2  = Sample( octaveID, octaveX + 1, octaveY + 1, octaveRes);
					float mHeightY = CosineInterpolate( sample1, sample2, octavefX );
					
					mHeight = CosineInterpolate( mHeightX, mHeightY, octavefY );
				#endif
				/////////////////////
				// BILINEAR /////////
				#ifdef INTERPOLATION_LINEAR
					float sample1  = Sample( octaveID, octaveX    , octaveY, octaveRes );
					float sample2  = Sample( octaveID, octaveX + 1, octaveY, octaveRes );
					float mHeightX = LinearInterpolate( sample1, sample2, octavefX );
					
						  sample1  = Sample( octaveID, octaveX    , octaveY + 1, octaveRes);
						  sample2  = Sample( octaveID, octaveX + 1, octaveY + 1, octaveRes);
					float mHeightY = LinearInterpolate( sample1, sample2, octavefX );
					
					mHeight = LinearInterpolate( mHeightX, mHeightY, octavefY );
				#endif
				/////////////////////
				
				height   = height + mHeight;
			}
			// Highest frequency noise
			height   = height + Sample( 1, x, y, startingRes);
			
			height   = (height + minInv) * _255DivRange;
			int iHeight = (int) height;
			BGRA = iHeight | (iHeight<<8) | (iHeight<<16) | (255<<24);
			memblockPtr[3 + x + y * startingRes] = BGRA;
		}
	}
	
	// BENCHMARK ///////
	int benchmark2Elapsed = dbTimer() - benchmark1Start;
	////////////////////
	
	// Make image from memblock
	dbMakeImageFromMemblock( 1, memblockID );
	
	// Output image and info
	while ( LoopGDK() )
	{
		sprintf( globalString, "Time to generate noise: %ims", benchmark1Elapsed );
		dbText( 5, 5, globalString );
		sprintf( globalString, "Time to add octaves: %ims", benchmark2Elapsed );
		dbText( 5, 15, globalString );
		
		dbPasteImage( 1, 64, 64 );
		
		dbSync();
	}
	
	return;
}

float MyRand( int x )
{	 
	x = (x<<13) ^ x;
	
	return ( 1.0 - ( (x * (x * x * 15731 + 789221) + 1376312589) & 0x7fffffff) / 1073741824.0f);    
}
float Sample( int octaveID, int x, int y, int octaveRes )
{
	// Wrap
	if (x < 0)
		x += octaveRes;
	if (y < 0)
		y += octaveRes;
	
	if (x > octaveRes-1)
		x -= octaveRes;
	if (y > octaveRes-1)
		y -= octaveRes;
	
	return noiseMap[octaveID][x+y*octaveRes];
}

float LinearInterpolate( float a, float b, float x )
{
	return a * (1.0 - x) + b * x;
}
float CosineInterpolate( float a, float b, float x )
{
    x = (1.0 - cos(x * 3.141592654)) * 0.5;
    
	return a * (1.0 - x) + b * x;
}
float CubicInterpolate( float aMinOne, float a, float b, float bAddOne, float acrossAB )
{
	float T;
	
	bAddOne = (bAddOne - b) - (aMinOne - a);
	T = acrossAB * acrossAB;
	
	return bAddOne * (T * acrossAB) + ((aMinOne - a) - bAddOne) * T + (b - aMinOne) * acrossAB + a;
}

Back to top

Profile PM Email

Green Gandalf

VIP Member

19

Years of Service

User Offline

Joined: 3rd Jan 2005

Playing: Malevolence:Sword of Ahkranox, Skyrim, Civ6.

Posted: 19th Feb 2008 23:02 Edited at: 19th Feb 2008 23:11

Link

Quote: "There has to be a simple bug staring me in the face somewhere in my code ..."

There was - but only when I looked in the right place.

Here's the revised code:

` Green Gandalf's Perlin style noise function - Version 4
` Created 25 June 2007, modified 19 February 2008.

`   Uses suggestions from following website
`      http://www.mandelbrot-dazibao.com/Perlin/Perlin1.htm
`   and especially
`      http://mrl.nyu.edu/~perlin/ (and browse to his 2002 paper)

` This version restricts the gradients to +/- 1 as suggested
` by Ken Perlin (in fact there's no need for the rnd() function at all
` - could probably just use a simple method of haphazardly swapping the signs
` of the gradients).

` All four methods work well - but simple linear filtering shows slight seams.

sync on: sync rate 0: sync
set display mode 800, 600, 32

time = timer()
randomize 140549 ` arbitrary fixed number for reproducibility

autocam off
position camera 0, 50, -300
point camera 0, 0, 0

create bitmap 1, 512, 512

global nOct = 8
global nGrid
global twoPiInv as float
twoPiInv = 0.159154943

nGrid = 2^nOct

dim weight#(nOct)

dim rawNoise(511, 511) as float

p# = 2.0
for i=0 to nOct
  weight#(i) = 1.0/p#^i
next i

dim a(nGrid, nGrid, nOct) as float
dim b(nGrid, nGrid, nOct) as float

for oct = 0 to nOct
  tGrid = 2^oct
  for i = 0 to tGrid
    for j = 0 to tGrid
      a(i, j, oct) = (rnd(1) - 0.5)* 2.0  ` random value +/- 1
      b(i, j, oct) = (rnd(1) - 0.5)* 2.0  ` random value +/- 1
                                             ` not seamless yet
    next j
  next i
next oct

` calculate raw noise and find scale factors for
` rescaling noise values to byte range
minNoise# = 10000  ` arbitrary large number out of range
maxNoise# = -10000 ` arbitrary small number out of range
for x = 0 to 511
  for y = 0 to 511
    u# = x * 0.001953125 ` value in range 0 to 1
    v# = y * 0.001953125
    rawNoise(x, y) = noise(u#, v#)
    if minNoise# > rawNoise(x, y)
      minNoise# = rawNoise(x, y)
    else
      if maxNoise# < rawNoise(x, y) then maxNoise# = rawNoise(x, y)
    endif
  next y
next x

f# = 255.0/(maxNoise# - minNoise#)

lock pixels
  for x = 0 to 511
    for y = 0 to 511
      ` convert raw noise to byte range 0 - 255
      c = (rawNoise(x, y) - minNoise#) * f#
      ` just in case
      if c<0 then c=0
      if c>255 then c=255
      dot x, y, rgb(0, c, 0) ` a nice green colour :)
    next y
  next x
unlock pixels

copy bitmap 1, 0, 0, 511, 511,  0, 144, 44, 655, 555

set current bitmap 0

time = timer() - time
repeat
  text 20, 20, "All done in "+str$(time)+" millisecs"
  sync
until spacekey()

set current bitmap 1
get image 1, 0, 0, 512, 512
save image "test v3.png", 1
end

function noiseBase(oct, x as float, y as float)
  ` x and y both in range 0 to 1
  ` find correct "tile" to use
  tGrid = 2^oct
  x# = x * tGrid
  y# = y * tGrid
  i = floor(x#)
  if i >= tGrid then i = tGrid - 1 ` just in case
  j = floor(y#)
  if j >= tGrid then j = tGrid - 1 ` just in case
  x# = x# - i  ` number in range 0 to 1
  y# = y# - j
  ` calculate interpolation weights
  ` revised Perlin weights
  remstart
  ix1# = x# * x# * x# * (6.0 * x# * x# - 15.0 * x# + 10.0)
  ix0# = 1.0 - ix1#
  iy1# = y# * y# * y# * (6.0 * y# * y# - 15.0 * y# + 10.0)
  iy0# = 1.0 - iy1#
  remend
  ` old Perlin weights - image not right at all WHY?
  remstart
  ix1# = x# * x# * (3.0 - 2.0 * x#)
  ix0# = 1.0 - ix1#
  iy1# = y# * y# * (3.0 - 2.0 * y#)
  iy0# = 1.0 - iy1#
  remend
  ` simple bilinear weights - should show slight seams
  ` because derivatives not matched correctly
  remstart
  ix0# = 1.0 - x#
  ix1# = x#
  iy0# = 1.0 - y#
  iy1# = y#
  remend
  ` simple sine weights (a variation of the Perlin idea)
`  remstart
  ix1# = x# - sin(360.0 * x#) * twoPiInv
  ix0# = 1.0 - ix1#
  iy1# = y# - sin(360.0 * y#) * twoPiInv
  iy0# = 1.0 - iy1#
`  remend
  itp0# = ix0# * iy0#
  itp1# = ix0# * iy1#
  itp2# = ix1# * iy0#
  itp3# = ix1# * iy1#
  ` calculate tangents at four corners of tile
  t0# = a(i, j, oct) * x# + b(i, j, oct) * y#
  t1# = a(i, j + 1, oct) * x# + b(i, j + 1, oct) * (y# - 1.0)
  t2# = a(i + 1 , j, oct) * (x# - 1.0) + b(i + 1, j, oct) * y#
  t3# = a(i + 1, j + 1, oct) * (x# - 1.0) + b(i + 1, j + 1, oct) * (y# - 1.0)
  ` calculate interpolated noise value
  result# = itp0#*t0# + itp1#*t1# + itp2#*t2# + itp3#*t3#
endfunction result#

function noise(x as float, y as float)
  result# = 0.0
  for oct = 0 to 8
    result# = noiseBase(oct, x, y) * weight#(oct) + result#
  next oct
endfunction result#

+ Code Snippet

` Green Gandalf's Perlin style noise function - Version 4
` Created 25 June 2007, modified 19 February 2008.

`   Uses suggestions from following website
`      http://www.mandelbrot-dazibao.com/Perlin/Perlin1.htm
`   and especially
`      http://mrl.nyu.edu/~perlin/ (and browse to his 2002 paper)

` This version restricts the gradients to +/- 1 as suggested
` by Ken Perlin (in fact there's no need for the rnd() function at all
` - could probably just use a simple method of haphazardly swapping the signs
` of the gradients).

` All four methods work well - but simple linear filtering shows slight seams.

sync on: sync rate 0: sync
set display mode 800, 600, 32

time = timer()
randomize 140549 ` arbitrary fixed number for reproducibility

autocam off
position camera 0, 50, -300
point camera 0, 0, 0

create bitmap 1, 512, 512

global nOct = 8
global nGrid
global twoPiInv as float
twoPiInv = 0.159154943

nGrid = 2^nOct

dim weight#(nOct)

dim rawNoise(511, 511) as float

p# = 2.0
for i=0 to nOct
  weight#(i) = 1.0/p#^i
next i

dim a(nGrid, nGrid, nOct) as float
dim b(nGrid, nGrid, nOct) as float

for oct = 0 to nOct
  tGrid = 2^oct
  for i = 0 to tGrid
    for j = 0 to tGrid
      a(i, j, oct) = (rnd(1) - 0.5)* 2.0  ` random value +/- 1
      b(i, j, oct) = (rnd(1) - 0.5)* 2.0  ` random value +/- 1
                                             ` not seamless yet
    next j
  next i
next oct

` calculate raw noise and find scale factors for
` rescaling noise values to byte range
minNoise# = 10000  ` arbitrary large number out of range
maxNoise# = -10000 ` arbitrary small number out of range
for x = 0 to 511
  for y = 0 to 511
    u# = x * 0.001953125 ` value in range 0 to 1
    v# = y * 0.001953125
    rawNoise(x, y) = noise(u#, v#)
    if minNoise# > rawNoise(x, y)
      minNoise# = rawNoise(x, y)
    else
      if maxNoise# < rawNoise(x, y) then maxNoise# = rawNoise(x, y)
    endif
  next y
next x

f# = 255.0/(maxNoise# - minNoise#)

lock pixels
  for x = 0 to 511
    for y = 0 to 511
      ` convert raw noise to byte range 0 - 255
      c = (rawNoise(x, y) - minNoise#) * f#
      ` just in case
      if c<0 then c=0
      if c>255 then c=255
      dot x, y, rgb(0, c, 0) ` a nice green colour :)
    next y
  next x
unlock pixels

copy bitmap 1, 0, 0, 511, 511,  0, 144, 44, 655, 555

set current bitmap 0

time = timer() - time
repeat
  text 20, 20, "All done in "+str$(time)+" millisecs"
  sync
until spacekey()

set current bitmap 1
get image 1, 0, 0, 512, 512
save image "test v3.png", 1
end

function noiseBase(oct, x as float, y as float)
  ` x and y both in range 0 to 1
  ` find correct "tile" to use
  tGrid = 2^oct
  x# = x * tGrid
  y# = y * tGrid
  i = floor(x#)
  if i >= tGrid then i = tGrid - 1 ` just in case
  j = floor(y#)
  if j >= tGrid then j = tGrid - 1 ` just in case
  x# = x# - i  ` number in range 0 to 1
  y# = y# - j
  ` calculate interpolation weights
  ` revised Perlin weights
  remstart
  ix1# = x# * x# * x# * (6.0 * x# * x# - 15.0 * x# + 10.0)
  ix0# = 1.0 - ix1#
  iy1# = y# * y# * y# * (6.0 * y# * y# - 15.0 * y# + 10.0)
  iy0# = 1.0 - iy1#
  remend
  ` old Perlin weights - image not right at all WHY?
  remstart
  ix1# = x# * x# * (3.0 - 2.0 * x#)
  ix0# = 1.0 - ix1#
  iy1# = y# * y# * (3.0 - 2.0 * y#)
  iy0# = 1.0 - iy1#
  remend
  ` simple bilinear weights - should show slight seams
  ` because derivatives not matched correctly
  remstart
  ix0# = 1.0 - x#
  ix1# = x#
  iy0# = 1.0 - y#
  iy1# = y#
  remend
  ` simple sine weights (a variation of the Perlin idea)
`  remstart
  ix1# = x# - sin(360.0 * x#) * twoPiInv
  ix0# = 1.0 - ix1#
  iy1# = y# - sin(360.0 * y#) * twoPiInv
  iy0# = 1.0 - iy1#
`  remend
  itp0# = ix0# * iy0#
  itp1# = ix0# * iy1#
  itp2# = ix1# * iy0#
  itp3# = ix1# * iy1#
  ` calculate tangents at four corners of tile
  t0# = a(i, j, oct) * x# + b(i, j, oct) * y#
  t1# = a(i, j + 1, oct) * x# + b(i, j + 1, oct) * (y# - 1.0)
  t2# = a(i + 1 , j, oct) * (x# - 1.0) + b(i + 1, j, oct) * y#
  t3# = a(i + 1, j + 1, oct) * (x# - 1.0) + b(i + 1, j + 1, oct) * (y# - 1.0)
  ` calculate interpolated noise value
  result# = itp0#*t0# + itp1#*t1# + itp2#*t2# + itp3#*t3#
endfunction result#

function noise(x as float, y as float)
  result# = 0.0
  for oct = 0 to 8
    result# = noiseBase(oct, x, y) * weight#(oct) + result#
  next oct
endfunction result#

and it works perfectly.

The code allows you to change the method of interpolation by commenting/uncommenting a few lines. The methods are:

1. Simple bilinear interpolation.
2. Ken Perlin's original interpolator.
3. Ken Perlin's revised interpolator (see his 2002 paper).
4. Green Gandalf's sine interpolator (

).

The slowest is the last (because of the sine function) but is a simple formula and gives results similar to Perlin's revised interpolator.

The only one that shows visible seams is, as expected, simple bilinear interpolation.

And I attach some nice green clouds.

Thanks everyone for your interest and help in this - especially to Dark Coder who has given us another useful tool to play with. I think my code is slower than Dark Coder's but I suspect the images are slightly better - will need to experiment a bit to decide on that one. [Edit: NO! Mine seems to be much faster - and DC's images have some diagonal "criss-crossing" on them. Looks like it pays to stick to K. Perlin's methods.]

My code needs optimising - and Dark Coder's idea of using memblocks might speed things up a bit. Not sure yet. Will post back if I successfully refine mine any further.

Attachments

Login to view attachments

Back to top

Profile PM Email

dark coder

21

Years of Service

User Offline

Joined: 6th Oct 2002

Location: Japan

Posted: 20th Feb 2008 06:46 Edited at: 20th Feb 2008 07:57

Link

Quote: "Mine seems to be much faster"

Quite likely as my code isn't very optimized, as I call many functions in my loop and DBP has no concept of inline optimization, however if you want a challenge you shall have one :p.

Quote: "DC's images have some diagonal "criss-crossing""

Maybe it was just that seed, when viewing a single octave I found the opposite actually, here's a sample from one of my ones with filtering:

Yours:

Also with your one I tried two seeds and got the same results, also I made them both use my desktop's resolution so there can be no issues with the window's filtering affecting anything(as there is none then).

[edit] the octave resolutions look slightly different but I can't get one the same as yours, here's my octave 1(highest frequency) http://img171.imageshack.us/img171/4753/dcoctave1ru1.png

[edit 2]
Also, I noticed that your code uses cosine interpolation, whereas the version I posted didn't have this and used Cubic interpolation by default, here's a far more optimized DBP version of my code using bicosine interpolation:

Load DLL "user32.dll"   , 1
displayX = Call DLL( 1 , "GetSystemMetrics" , 0 )
displayY = Call DLL( 1 , "GetSystemMetrics" , 1 )
Delete DLL 1

Set Display Mode displayX, displayY, 32, 1
Sync On

// Seed
Randomize 1338

// You can manually specify the resolution for higher detail noise
startingRes     = 512
totalOctaves    = 8

`totalOctaves    = 8
`startingRes     = 2 ^ totalOctaves

// BiCosine --------- 1946ms, 1658ms
// BiCubic  --------- 6943ms, 4967ms, 4922ms, 4903ms, 4823ms, 4672ms, 4078ms

benchmark1Start = Timer()

// Generate noise for all octaves
for octaveID = 1 to totalOctaves
    memblockID  = octaveID
    octaveRes   = startingRes / 2 ^ (octaveID - 1)
    Make Memblock memblockID, octaveRes * octaveRes * 4 // A single float
    for x = 0 to octaveRes - 1
        for y = 0 to octaveRes - 1
            Write Memblock Float memblockID, ( x + y * octaveRes ) * 4, ( Rnd(2000) - 1000 ) * 0.001
        next
    next
next

benchmark1elapsed   = Timer() - benchmark1Start

benchmark2Start = Timer()

// add all layers together
memblockID  = totalOctaves + 1

Make Memblock memblockID, 12 + startingRes * startingRes * 4

Write Memblock DWord memblockID, 0, startingRes
Write Memblock DWord memblockID, 4, startingRes
Write Memblock DWord memblockID, 8, 32

Dim strength(totalOctaves) as float
Dim res(totalOctaves)

strength# = 1.0
for octaveID = totalOctaves to 1 step -1
    strength# = strength# * 0.75
    strength(octaveID) = strength#
    res(octaveID)      = startingRes / 2 ^ (octaveID - 1)
    maxStrength#       = maxStrength# + strength#
next

minHeight#      = -maxStrength#
maxHeight#      =  maxStrength#
minInv#         = -minHeight#
heightRange#    =  maxHeight# - minHeight#
_255DivRange#   =  255.0 / heightRange#
sResReciprocal# =    1.0 / (startingRes)

startingResSubOne = startingRes - 1
for x = 0 to startingResSubOne
    for y = 0 to startingResSubOne
        x#          = x * sResReciprocal#
        y#          = y * sResReciprocal#
        height#     = 0.0
        Strength#   = 1.0

for OctaveID = 2 to totalOctaves
            octaveRes = res(OctaveID)
            // Store the float pixel we are over on the current octave
            memblockX# = x# * octaveRes
            memblockY# = y# * octaveRes
            // Store the int version for pixel sampling
            memblockX  = memblockX#
            memblockY  = memblockY#
            // Get the local offset
            memblockX# = memblockX# mod 1.0
            memblockY# = memblockY# mod 1.0
            
            mX1 = memblockX - 1
            mX2 = memblockX
            mX3 = memblockX + 1
            mX4 = memblockX + 2
            
            mY1 = memblockY - 1
            mY2 = memblockY
            mY3 = memblockY + 1
            mY4 = memblockY + 2
            // BICUBIC ///////////
            REMSTART
            sample1#  = Sample( OctaveID, mX1, mY1, octaveRes )
            sample2#  = Sample( OctaveID, mX2, mY1, octaveRes )
            sample3#  = Sample( OctaveID, mX3, mY1, octaveRes )
            sample4#  = Sample( OctaveID, mX4, mY1, octaveRes )
            mHeight1# = CubicInterpolate( sample1#, sample2#, sample3#, sample4#, memblockX# )

sample1#  = Sample( OctaveID, mX1, mY2, octaveRes )
            sample2#  = Sample( OctaveID, mX2, mY2, octaveRes )
            sample3#  = Sample( OctaveID, mX3, mY2, octaveRes )
            sample4#  = Sample( OctaveID, mX4, mY2, octaveRes )
            mHeight2# = CubicInterpolate( sample1#, sample2#, sample3#, sample4#, memblockX# )

sample1#  = Sample( OctaveID, mX1, mY3, octaveRes )
            sample2#  = Sample( OctaveID, mX2, mY3, octaveRes )
            sample3#  = Sample( OctaveID, mX3, mY3, octaveRes )
            sample4#  = Sample( OctaveID, mX4, mY3, octaveRes )
            mHeight3# = CubicInterpolate( sample1#, sample2#, sample3#, sample4#, memblockX# )

sample1#  = Sample( OctaveID, mX1, mY4, octaveRes )
            sample2#  = Sample( OctaveID, mX2, mY4, octaveRes )
            sample3#  = Sample( OctaveID, mX3, mY4, octaveRes )
            sample4#  = Sample( OctaveID, mX4, mY4, octaveRes )
            mHeight4# = CubicInterpolate( sample1#, sample2#, sample3#, sample4#, memblockX# )

mHeight#  = CubicInterpolate( mHeight1#, mHeight2#, mHeight3#, mHeight4#, memblockY# )
            REMEND
            //////////////////////
            
            // COSINE ////////////
            REMSTART
            sample1#  = Sample( OctaveID, mX2, mY2, octaveRes )
            sample2#  = Sample( OctaveID, mX3, mY2, octaveRes )
            mHeightX# = CosineInterpolate( sample1#, sample2#, memblockX# )

sample1#  = Sample( OctaveID, mX2, mY3, octaveRes)
            sample2#  = Sample( OctaveID, mX3, mY3, octaveRes)
            mHeightY# = CosineInterpolate( sample1#, sample2#, memblockX# )

mHeight#  = CosineInterpolate( mHeightX#, mHeightY#, memblockY# )
            REMEND
            `REMSTART
            // Inline version
            memblockX# = (1.0 - Cos(memblockX# * 180.0)) * 0.5
            memblockY# = (1.0 - Cos(memblockY# * 180.0)) * 0.5
            
            sample1#  = Sample( OctaveID, mX2, mY2, octaveRes )
            sample2#  = Sample( OctaveID, mX3, mY2, octaveRes )
            mHeightX# = sample1# * (1.0 - memblockX#) + sample2# * memblockX#

sample1#  = Sample( OctaveID, mX2, mY3, octaveRes)
            sample2#  = Sample( OctaveID, mX3, mY3, octaveRes)
            mHeightY# = sample1# * (1.0 - memblockX#) + sample2# * memblockX#

mHeight#  = mHeightX# * (1.0 - memblockY#) + mHeightY# * memblockY#
            `REMEND
            //////////////////////
            
            // BILINEAR //////////
            REMSTART
            sample1#  = Sample( OctaveID, memblockX    , memblockY, octaveRes )
            sample2#  = Sample( OctaveID, memblockX + 1, memblockY, octaveRes )
            mHeightX# = LinearInterpolate( sample1#, sample2#, memblockX# )

sample1#  = Sample( OctaveID, memblockX    , memblockY + 1, octaveRes)
            sample2#  = Sample( OctaveID, memblockX + 1, memblockY + 1, octaveRes)
            mHeightY# = LinearInterpolate( sample1#, sample2#, memblockX# )

mHeight#  = LinearInterpolate( mHeightX#, mHeightY#, memblockY# )
            REMEND
            //////////////////////

height#   = height# + mHeight# * strength(octaveID)
        next
        // Add octave 1
        height# = height# + Sample( 1, X, Y, startingRes ) * strength(1)
        
        height  = (height# + minInv#) * _255DivRange#
        Write Memblock DWord memblockID, 12 + ( x + y * startingRes ) * 4, height || (height<<8) || (height<<16) || (255<<24)
    next
next

benchmark2elapsed   = Timer() - benchmark2Start

// Create image
Make Image From Memblock 1, memblockID

// Loop to output result
do
    CLS

Paste Image 1, 64, 64

Text 5,  5, "Time to generate noise: " + Str$(benchmark1elapsed) + "ms"
    Text 5, 15, "Time to add octaves: "    + Str$(benchmark2elapsed) + "ms"

Sync
loop

END

Function Sample( memblockID, x, y, memblockRes )

// Wrap
    if x < 0
        x = x + memblockRes
        if y < 0             then y = y + memblockRes
        if y > memblockRes-1 then y = y - memblockRes
    else
        if x > memblockRes-1 then x = x - memblockRes
        
        if y < 0             then y = y + memblockRes
        if y > memblockRes-1 then y = y - memblockRes
    endif

returnValue# = Memblock Float( memblockID, ( x + y * memblockRes ) * 4 )

Endfunction returnValue#

Function LinearInterpolate( a as float, b as float, x as float )

returnValue# =  a * (1.0 - x) + b * x

Endfunction returnValue#

function CosineInterpolate( a as float, b as float, x as float )
    
    x = (1.0 - Cos(x * 180.0)) * 0.5
    
    returnValue# = a * (1.0 - x) + b * x
    
Endfunction returnValue#

Function CubicInterpolate( aMinOne as float, a as float, b as float, bAddOne as float, acrossAB as float )

T as float
    
    bAddOne = (bAddOne - b) - (aMinOne - a)
    T = acrossAB * acrossAB
    
    returnValue# =  bAddOne * T * acrossAB + ((aMinOne - a) - bAddOne) * T + (b - aMinOne) * acrossAB + a

EndFunction returnValue#

+ Code Snippet

Load DLL "user32.dll"   , 1
displayX = Call DLL( 1 , "GetSystemMetrics" , 0 )
displayY = Call DLL( 1 , "GetSystemMetrics" , 1 )
Delete DLL 1

Set Display Mode displayX, displayY, 32, 1
Sync On

// Seed
Randomize 1338

// You can manually specify the resolution for higher detail noise
startingRes     = 512
totalOctaves    = 8

`totalOctaves    = 8
`startingRes     = 2 ^ totalOctaves

// BiCosine --------- 1946ms, 1658ms
// BiCubic  --------- 6943ms, 4967ms, 4922ms, 4903ms, 4823ms, 4672ms, 4078ms

benchmark1Start = Timer()

// Generate noise for all octaves
for octaveID = 1 to totalOctaves
    memblockID  = octaveID
    octaveRes   = startingRes / 2 ^ (octaveID - 1)
    Make Memblock memblockID, octaveRes * octaveRes * 4 // A single float
    for x = 0 to octaveRes - 1
        for y = 0 to octaveRes - 1
            Write Memblock Float memblockID, ( x + y * octaveRes ) * 4, ( Rnd(2000) - 1000 ) * 0.001
        next
    next
next

benchmark1elapsed   = Timer() - benchmark1Start

benchmark2Start = Timer()

// add all layers together
memblockID  = totalOctaves + 1

Make Memblock memblockID, 12 + startingRes * startingRes * 4

Write Memblock DWord memblockID, 0, startingRes
Write Memblock DWord memblockID, 4, startingRes
Write Memblock DWord memblockID, 8, 32

Dim strength(totalOctaves) as float
Dim res(totalOctaves)

strength# = 1.0
for octaveID = totalOctaves to 1 step -1
    strength# = strength# * 0.75
    strength(octaveID) = strength#
    res(octaveID)      = startingRes / 2 ^ (octaveID - 1)
    maxStrength#       = maxStrength# + strength#
next

minHeight#      = -maxStrength#
maxHeight#      =  maxStrength#
minInv#         = -minHeight#
heightRange#    =  maxHeight# - minHeight#
_255DivRange#   =  255.0 / heightRange#
sResReciprocal# =    1.0 / (startingRes)

startingResSubOne = startingRes - 1
for x = 0 to startingResSubOne
    for y = 0 to startingResSubOne
        x#          = x * sResReciprocal#
        y#          = y * sResReciprocal#
        height#     = 0.0
        Strength#   = 1.0

        for OctaveID = 2 to totalOctaves
            octaveRes = res(OctaveID)
            // Store the float pixel we are over on the current octave
            memblockX# = x# * octaveRes
            memblockY# = y# * octaveRes
            // Store the int version for pixel sampling
            memblockX  = memblockX#
            memblockY  = memblockY#
            // Get the local offset
            memblockX# = memblockX# mod 1.0
            memblockY# = memblockY# mod 1.0
            
            mX1 = memblockX - 1
            mX2 = memblockX
            mX3 = memblockX + 1
            mX4 = memblockX + 2
            
            mY1 = memblockY - 1
            mY2 = memblockY
            mY3 = memblockY + 1
            mY4 = memblockY + 2
            // BICUBIC ///////////
            REMSTART
            sample1#  = Sample( OctaveID, mX1, mY1, octaveRes )
            sample2#  = Sample( OctaveID, mX2, mY1, octaveRes )
            sample3#  = Sample( OctaveID, mX3, mY1, octaveRes )
            sample4#  = Sample( OctaveID, mX4, mY1, octaveRes )
            mHeight1# = CubicInterpolate( sample1#, sample2#, sample3#, sample4#, memblockX# )

            sample1#  = Sample( OctaveID, mX1, mY2, octaveRes )
            sample2#  = Sample( OctaveID, mX2, mY2, octaveRes )
            sample3#  = Sample( OctaveID, mX3, mY2, octaveRes )
            sample4#  = Sample( OctaveID, mX4, mY2, octaveRes )
            mHeight2# = CubicInterpolate( sample1#, sample2#, sample3#, sample4#, memblockX# )

            sample1#  = Sample( OctaveID, mX1, mY3, octaveRes )
            sample2#  = Sample( OctaveID, mX2, mY3, octaveRes )
            sample3#  = Sample( OctaveID, mX3, mY3, octaveRes )
            sample4#  = Sample( OctaveID, mX4, mY3, octaveRes )
            mHeight3# = CubicInterpolate( sample1#, sample2#, sample3#, sample4#, memblockX# )

            sample1#  = Sample( OctaveID, mX1, mY4, octaveRes )
            sample2#  = Sample( OctaveID, mX2, mY4, octaveRes )
            sample3#  = Sample( OctaveID, mX3, mY4, octaveRes )
            sample4#  = Sample( OctaveID, mX4, mY4, octaveRes )
            mHeight4# = CubicInterpolate( sample1#, sample2#, sample3#, sample4#, memblockX# )

            mHeight#  = CubicInterpolate( mHeight1#, mHeight2#, mHeight3#, mHeight4#, memblockY# )
            REMEND
            //////////////////////
            
            // COSINE ////////////
            REMSTART
            sample1#  = Sample( OctaveID, mX2, mY2, octaveRes )
            sample2#  = Sample( OctaveID, mX3, mY2, octaveRes )
            mHeightX# = CosineInterpolate( sample1#, sample2#, memblockX# )

            sample1#  = Sample( OctaveID, mX2, mY3, octaveRes)
            sample2#  = Sample( OctaveID, mX3, mY3, octaveRes)
            mHeightY# = CosineInterpolate( sample1#, sample2#, memblockX# )

            mHeight#  = CosineInterpolate( mHeightX#, mHeightY#, memblockY# )
            REMEND
            `REMSTART
            // Inline version
            memblockX# = (1.0 - Cos(memblockX# * 180.0)) * 0.5
            memblockY# = (1.0 - Cos(memblockY# * 180.0)) * 0.5
            
            sample1#  = Sample( OctaveID, mX2, mY2, octaveRes )
            sample2#  = Sample( OctaveID, mX3, mY2, octaveRes )
            mHeightX# = sample1# * (1.0 - memblockX#) + sample2# * memblockX#

            sample1#  = Sample( OctaveID, mX2, mY3, octaveRes)
            sample2#  = Sample( OctaveID, mX3, mY3, octaveRes)
            mHeightY# = sample1# * (1.0 - memblockX#) + sample2# * memblockX#

            mHeight#  = mHeightX# * (1.0 - memblockY#) + mHeightY# * memblockY#
            `REMEND
            //////////////////////
            
            // BILINEAR //////////
            REMSTART
            sample1#  = Sample( OctaveID, memblockX    , memblockY, octaveRes )
            sample2#  = Sample( OctaveID, memblockX + 1, memblockY, octaveRes )
            mHeightX# = LinearInterpolate( sample1#, sample2#, memblockX# )

            sample1#  = Sample( OctaveID, memblockX    , memblockY + 1, octaveRes)
            sample2#  = Sample( OctaveID, memblockX + 1, memblockY + 1, octaveRes)
            mHeightY# = LinearInterpolate( sample1#, sample2#, memblockX# )

            mHeight#  = LinearInterpolate( mHeightX#, mHeightY#, memblockY# )
            REMEND
            //////////////////////

            height#   = height# + mHeight# * strength(octaveID)
        next
        // Add octave 1
        height# = height# + Sample( 1, X, Y, startingRes ) * strength(1)
        
        height  = (height# + minInv#) * _255DivRange#
        Write Memblock DWord memblockID, 12 + ( x + y * startingRes ) * 4, height || (height<<8) || (height<<16) || (255<<24)
    next
next

benchmark2elapsed   = Timer() - benchmark2Start

// Create image
Make Image From Memblock 1, memblockID

// Loop to output result
do
    CLS

    Paste Image 1, 64, 64

    Text 5,  5, "Time to generate noise: " + Str$(benchmark1elapsed) + "ms"
    Text 5, 15, "Time to add octaves: "    + Str$(benchmark2elapsed) + "ms"

    Sync
loop

END

Function Sample( memblockID, x, y, memblockRes )

    // Wrap
    if x < 0
        x = x + memblockRes
        if y < 0             then y = y + memblockRes
        if y > memblockRes-1 then y = y - memblockRes
    else
        if x > memblockRes-1 then x = x - memblockRes
        
        if y < 0             then y = y + memblockRes
        if y > memblockRes-1 then y = y - memblockRes
    endif

    returnValue# = Memblock Float( memblockID, ( x + y * memblockRes ) * 4 )

Endfunction returnValue#

Function LinearInterpolate( a as float, b as float, x as float )

    returnValue# =  a * (1.0 - x) + b * x

Endfunction returnValue#

function CosineInterpolate( a as float, b as float, x as float )
    
    x = (1.0 - Cos(x * 180.0)) * 0.5
    
    returnValue# = a * (1.0 - x) + b * x
    
Endfunction returnValue#

Function CubicInterpolate( aMinOne as float, a as float, b as float, bAddOne as float, acrossAB as float )

    T as float
    
    bAddOne = (bAddOne - b) - (aMinOne - a)
    T = acrossAB * acrossAB
    
    returnValue# =  bAddOne * T * acrossAB + ((aMinOne - a) - bAddOne) * T + (b - aMinOne) * acrossAB + a

EndFunction returnValue#

With this code it takes me 1658ms to generate a single 512x8 perlin noise texture, and it takes me 2516ms with yours.

Back to top

Profile PM Email

Green Gandalf

VIP Member

19

Years of Service

User Offline

Joined: 3rd Jan 2005

Playing: Malevolence:Sword of Ahkranox, Skyrim, Civ6.

Posted: 20th Feb 2008 12:37

Link

Quote: "however if you want a challenge you shall have one"

Your version performs somewhat faster on my machine too. Looks like I'd better optimize my code. Will have a go later after work.

Quote: "Also with your one I tried two seeds and got the same results, also I made them both use my desktop's resolution so there can be no issues with the window's filtering affecting anything(as there is none then)."

Yes I get those lines with the high resolution octave with my version too. The odd thing is that they don't seem to show in the final image.

Watch this space.

Back to top

Profile PM Email

Green Gandalf

VIP Member

19

Years of Service

User Offline

Joined: 3rd Jan 2005

Playing: Malevolence:Sword of Ahkranox, Skyrim, Civ6.

Posted: 20th Feb 2008 23:49 Edited at: 21st Feb 2008 00:16

Link

Dark Coder

Before I bust a gut trying to optimize my code, could you clarify a point about your code? How many octaves are you using? I'm using 9, i.e. my octave 0 uses a 2x2 grid of points (just the four corners of the image), octave 1 uses a 4x4 grid and so on up to octave 8 which uses a 512x512 (which is rather superfluous).

Your high res octave seems to be only 256x256. Could you confirm that?

When I reduce mine to a max resolution of 256x256 (i.e. octaves 0 to 7 in my notation) my image looks the same (the extra high res octave adds nothing) and my code takes about 6089 ms to produce the image, whereas yours takes about 6400 (225+6175) ms. I haven't tried optimising mine yet (I know at least one place where I can).

See "health" warning edit at end.

I still have those noticeable lines in the 256x256 octave image - but, again, they don't show in the final cloud image. I vaguely recall Perlin mentioning something about that in his paper - I'll have to look at it again.

Only good can come of this - between us we should end up with fast code for producing this sort of image. But I guess we are some way off producing them in real time either way.

Edit I've pruned a bit of time off my code - but I've just realised that I hadn't checked that I was comparing like with like in my timings above, i.e. which interpolators were being used in the two sets of code. I'll check this again tomorrow, must sign off now.

.

Back to top

Profile PM Email

dark coder

21

Years of Service

User Offline

Joined: 6th Oct 2002

Location: Japan

Posted: 21st Feb 2008 06:50

Link

Quote: "Your high res octave seems to be only 256x256. Could you confirm that?"

No, it might be slightly hard to read but after my loop I use to add the octaves I add my highest resolution octave(512^2), because it's the same resolution as the canvas I don't need to do any interpolation and can just do a straight lookup, below is some even more optimized code too, also I'm recoding my C++ version as I've thought up several more ways to speed it up

.

However you're correct about my 8 octaves being 1-8, but the way mine works is that is I specify a lower number of octaves the end result becomes more detailed at it merely trims off the lower resolution octaves, so for a large canvas you wouldn't want to use octaves all the way down to 2x2 else you'd likely end up with one gigantic hill or mountain, but using a lower number results in many mountains.

Either, way I changed my code to 9 octaves(1-9) so it should be the same as yours and I still get faster speed

, I also increaced the weights/strength/opacity between the layers slightly so you can see the result better.

Load DLL "user32.dll"   , 1
displayX = Call DLL( 1 , "GetSystemMetrics" , 0 )
displayY = Call DLL( 1 , "GetSystemMetrics" , 1 )
Delete DLL 1

Set Display Mode displayX, displayY, 32, 1
Sync On

// Seed
Randomize 1338

// You can manually specify the resolution for higher detail noise
startingRes     = 512
totalOctaves    = 9

`totalOctaves    = 8
`startingRes     = 2 ^ totalOctaves

// BiCosine --------- 1946ms, 1658ms, 1601ms - 9x 1783ms
// BiCubic  --------- 6943ms, 4967ms, 4922ms, 4903ms, 4823ms, 4672ms, 4078ms

// BENCHMARK //////////////
benchmark1Start = Timer()
///////////////////////////

Dim strength(totalOctaves)  as float
Dim res(totalOctaves)       as integer
Dim spacing(totalOctaves)   as integer

strength# = 1.0
for octaveID = totalOctaves to 1 step -1
    strength#          = strength# * 0.75
    strength(octaveID) = strength#
    res(octaveID)      = startingRes / 2 ^ (octaveID - 1)
    spacing(octaveID)  = startingRes / res(octaveID)
    maxStrength#       = maxStrength# + strength#
next

// Generate noise for all octaves
for octaveID = 1 to totalOctaves
    memblockID  = octaveID
    octaveRes   = startingRes / 2 ^ (octaveID - 1)
    pos         = 0
    Make Memblock memblockID, octaveRes * octaveRes * 4 // A single float
    for y = 0 to octaveRes - 1
        for x = 0 to octaveRes - 1
            Write Memblock Float memblockID, pos, ( ( Rnd(2000) - 1000 ) * 0.001 ) * strength(octaveID)
            Inc pos, 4
        next
    next
next

// BENCHMARK //////////////
benchmark1elapsed   = Timer() - benchmark1Start
benchmark2Start     = Timer()
///////////////////////////

// add all layers together
memblockID  = totalOctaves + 1

// Make memblock and write header
Make Memblock memblockID, 12 + startingRes * startingRes * 4

Write Memblock DWord memblockID, 0, startingRes
Write Memblock DWord memblockID, 4, startingRes
Write Memblock DWord memblockID, 8, 32

// Compute values
minHeight#      = -maxStrength#
maxHeight#      =  maxStrength#
minInv#         = -minHeight#
heightRange#    =  maxHeight# - minHeight#
_255DivRange#   =  255.0 / heightRange#
sResReciprocal# =    1.0 / (startingRes)

startingResSubOne = startingRes - 1
for y = 0 to startingResSubOne
    for x = 0 to startingResSubOne
        x#          = x * sResReciprocal#
        y#          = y * sResReciprocal#
        height#     = 0.0
        Strength#   = 1.0

for OctaveID = 2 to totalOctaves
            `if (x mod spacing(octaveID))+(y mod spacing(octaveID)) = 0
            `    height# = height# + Memblock Float( OctaveID, ( x# * res(OctaveID) + y# * res(OctaveID) * res(OctaveID) ) * 4 )
            `else
            
            octaveRes = res(OctaveID)
            // Store the float pixel we are over on the current octave
            memblockX# = x# * octaveRes
            memblockY# = y# * octaveRes
            // Store the int version for pixel sampling
            memblockX  = memblockX#
            memblockY  = memblockY#
            // Get the local offset
            memblockX# = memblockX# mod 1.0
            memblockY# = memblockY# mod 1.0
            
            `if (memblockX# = 0.0 or memblockX# = 1.0) and (memblockY# = 0.0 or memblockY# = 1.0)
            `    height# = height# + Memblock Float( OctaveID, ( memblockX + memblockY * octaveRes ) * 4 )
            `else
                // BICOSINE //////////
                mX2 = memblockX
                mX3 = memblockX + 1
                mY2 = memblockY
                mY3 = memblockY + 1
                
                memblockX# = (1.0 - Cos(memblockX# * 180.0)) * 0.5
                memblockY# = (1.0 - Cos(memblockY# * 180.0)) * 0.5
                
                sample1#  = Sample( OctaveID, mX2, mY2  , octaveRes )
                sample2#  = Sample( OctaveID, mX3, mY2  , octaveRes )
                mHeightX# = sample1# * (1.0 - memblockX#) + sample2# * memblockX#
    
                sample1#  = Sample( OctaveID, mX2, mY3, octaveRes)
                sample2#  = Sample( OctaveID, mX3, mY3, octaveRes)
                mHeightY# = sample1# * (1.0 - memblockX#) + sample2# * memblockX#
    
                height# = height# + mHeightX# * (1.0 - memblockY#) + mHeightY# * memblockY#
                //////////////////////
            `endif
        next
        // Add octave 1
        height# = height# + Memblock Float( 1, ( X + Y * startingRes ) * 4 )
        
        height  = (height# + minInv#) * _255DivRange#
        Write Memblock DWord memblockID, 12 + ( x + y * startingRes ) * 4, height || (height<<8) || (height<<16) || (255<<24)
    next
next

// BENCHMARK //////////////
benchmark2elapsed   = Timer() - benchmark2Start
///////////////////////////

// Create image
Make Image From Memblock 1, memblockID

// Loop to output result
do
    CLS

Paste Image 1, 64, 64

Text 5,  5, "Time to generate noise: " + Str$(benchmark1elapsed) + "ms"
    Text 5, 15, "Time to add octaves: "    + Str$(benchmark2elapsed) + "ms"

Sync
loop

END

Function Sample( memblockID, x, y, memblockRes )

// Wrap
    if x < 0
        x = x + memblockRes
        if y < 0             then y = y + memblockRes
        if y > memblockRes-1 then y = y - memblockRes
    else
        if x > memblockRes-1 then x = x - memblockRes
        
        if y < 0             then y = y + memblockRes
        if y > memblockRes-1 then y = y - memblockRes
    endif

returnValue# = Memblock Float( memblockID, ( x + y * memblockRes ) * 4 )

Endfunction returnValue#

Function LinearInterpolate( a as float, b as float, x as float )

returnValue# =  a * (1.0 - x) + b * x

Endfunction returnValue#

function CosineInterpolate( a as float, b as float, x as float )
    
    x = (1.0 - Cos(x * 180.0)) * 0.5
    
    returnValue# = a * (1.0 - x) + b * x
    
Endfunction returnValue#

Function CubicInterpolate( aMinOne as float, a as float, b as float, bAddOne as float, acrossAB as float )

T as float
    
    bAddOne = (bAddOne - b) - (aMinOne - a)
    T = acrossAB * acrossAB
    
    returnValue# =  bAddOne * T * acrossAB + ((aMinOne - a) - bAddOne) * T + (b - aMinOne) * acrossAB + a

EndFunction returnValue#

+ Code Snippet

Load DLL "user32.dll"   , 1
displayX = Call DLL( 1 , "GetSystemMetrics" , 0 )
displayY = Call DLL( 1 , "GetSystemMetrics" , 1 )
Delete DLL 1

Set Display Mode displayX, displayY, 32, 1
Sync On

// Seed
Randomize 1338

// You can manually specify the resolution for higher detail noise
startingRes     = 512
totalOctaves    = 9

`totalOctaves    = 8
`startingRes     = 2 ^ totalOctaves

// BiCosine --------- 1946ms, 1658ms, 1601ms - 9x 1783ms
// BiCubic  --------- 6943ms, 4967ms, 4922ms, 4903ms, 4823ms, 4672ms, 4078ms

// BENCHMARK //////////////
benchmark1Start = Timer()
///////////////////////////

Dim strength(totalOctaves)  as float
Dim res(totalOctaves)       as integer
Dim spacing(totalOctaves)   as integer

strength# = 1.0
for octaveID = totalOctaves to 1 step -1
    strength#          = strength# * 0.75
    strength(octaveID) = strength#
    res(octaveID)      = startingRes / 2 ^ (octaveID - 1)
    spacing(octaveID)  = startingRes / res(octaveID)
    maxStrength#       = maxStrength# + strength#
next

// Generate noise for all octaves
for octaveID = 1 to totalOctaves
    memblockID  = octaveID
    octaveRes   = startingRes / 2 ^ (octaveID - 1)
    pos         = 0
    Make Memblock memblockID, octaveRes * octaveRes * 4 // A single float
    for y = 0 to octaveRes - 1
        for x = 0 to octaveRes - 1
            Write Memblock Float memblockID, pos, ( ( Rnd(2000) - 1000 ) * 0.001 ) * strength(octaveID)
            Inc pos, 4
        next
    next
next

// BENCHMARK //////////////
benchmark1elapsed   = Timer() - benchmark1Start
benchmark2Start     = Timer()
///////////////////////////

// add all layers together
memblockID  = totalOctaves + 1

// Make memblock and write header
Make Memblock memblockID, 12 + startingRes * startingRes * 4

Write Memblock DWord memblockID, 0, startingRes
Write Memblock DWord memblockID, 4, startingRes
Write Memblock DWord memblockID, 8, 32

// Compute values
minHeight#      = -maxStrength#
maxHeight#      =  maxStrength#
minInv#         = -minHeight#
heightRange#    =  maxHeight# - minHeight#
_255DivRange#   =  255.0 / heightRange#
sResReciprocal# =    1.0 / (startingRes)

startingResSubOne = startingRes - 1
for y = 0 to startingResSubOne
    for x = 0 to startingResSubOne
        x#          = x * sResReciprocal#
        y#          = y * sResReciprocal#
        height#     = 0.0
        Strength#   = 1.0

        for OctaveID = 2 to totalOctaves
            `if (x mod spacing(octaveID))+(y mod spacing(octaveID)) = 0
            `    height# = height# + Memblock Float( OctaveID, ( x# * res(OctaveID) + y# * res(OctaveID) * res(OctaveID) ) * 4 )
            `else
            
            octaveRes = res(OctaveID)
            // Store the float pixel we are over on the current octave
            memblockX# = x# * octaveRes
            memblockY# = y# * octaveRes
            // Store the int version for pixel sampling
            memblockX  = memblockX#
            memblockY  = memblockY#
            // Get the local offset
            memblockX# = memblockX# mod 1.0
            memblockY# = memblockY# mod 1.0
            
            `if (memblockX# = 0.0 or memblockX# = 1.0) and (memblockY# = 0.0 or memblockY# = 1.0)
            `    height# = height# + Memblock Float( OctaveID, ( memblockX + memblockY * octaveRes ) * 4 )
            `else
                // BICOSINE //////////
                mX2 = memblockX
                mX3 = memblockX + 1
                mY2 = memblockY
                mY3 = memblockY + 1
                
                memblockX# = (1.0 - Cos(memblockX# * 180.0)) * 0.5
                memblockY# = (1.0 - Cos(memblockY# * 180.0)) * 0.5
                
                sample1#  = Sample( OctaveID, mX2, mY2  , octaveRes )
                sample2#  = Sample( OctaveID, mX3, mY2  , octaveRes )
                mHeightX# = sample1# * (1.0 - memblockX#) + sample2# * memblockX#
    
                sample1#  = Sample( OctaveID, mX2, mY3, octaveRes)
                sample2#  = Sample( OctaveID, mX3, mY3, octaveRes)
                mHeightY# = sample1# * (1.0 - memblockX#) + sample2# * memblockX#
    
                height# = height# + mHeightX# * (1.0 - memblockY#) + mHeightY# * memblockY#
                //////////////////////
            `endif
        next
        // Add octave 1
        height# = height# + Memblock Float( 1, ( X + Y * startingRes ) * 4 )
        
        height  = (height# + minInv#) * _255DivRange#
        Write Memblock DWord memblockID, 12 + ( x + y * startingRes ) * 4, height || (height<<8) || (height<<16) || (255<<24)
    next
next

// BENCHMARK //////////////
benchmark2elapsed   = Timer() - benchmark2Start
///////////////////////////

// Create image
Make Image From Memblock 1, memblockID

// Loop to output result
do
    CLS

    Paste Image 1, 64, 64

    Text 5,  5, "Time to generate noise: " + Str$(benchmark1elapsed) + "ms"
    Text 5, 15, "Time to add octaves: "    + Str$(benchmark2elapsed) + "ms"

    Sync
loop

END

Function Sample( memblockID, x, y, memblockRes )

    // Wrap
    if x < 0
        x = x + memblockRes
        if y < 0             then y = y + memblockRes
        if y > memblockRes-1 then y = y - memblockRes
    else
        if x > memblockRes-1 then x = x - memblockRes
        
        if y < 0             then y = y + memblockRes
        if y > memblockRes-1 then y = y - memblockRes
    endif

    returnValue# = Memblock Float( memblockID, ( x + y * memblockRes ) * 4 )

Endfunction returnValue#

Function LinearInterpolate( a as float, b as float, x as float )

    returnValue# =  a * (1.0 - x) + b * x

Endfunction returnValue#

function CosineInterpolate( a as float, b as float, x as float )
    
    x = (1.0 - Cos(x * 180.0)) * 0.5
    
    returnValue# = a * (1.0 - x) + b * x
    
Endfunction returnValue#

Function CubicInterpolate( aMinOne as float, a as float, b as float, bAddOne as float, acrossAB as float )

    T as float
    
    bAddOne = (bAddOne - b) - (aMinOne - a)
    T = acrossAB * acrossAB
    
    returnValue# =  bAddOne * T * acrossAB + ((aMinOne - a) - bAddOne) * T + (b - aMinOne) * acrossAB + a

EndFunction returnValue#

Back to top

Profile PM Email

Green Gandalf

VIP Member

19

Years of Service

User Offline

Joined: 3rd Jan 2005

Playing: Malevolence:Sword of Ahkranox, Skyrim, Civ6.

Posted: 21st Feb 2008 14:27

Link

Quote: "I'm using 9, i.e. my octave 0 uses a 2x2 grid of points (just the four corners of the image), octave 1 uses a 4x4 grid and so on up to octave 8 which uses a 512x512 (which is rather superfluous)."

Looks like I can't count - not enough fingers. Octave 0 should use a 2x2 grid, octave 1 a 3x3 grid, and so on up to octave 8 which uses a 257x257 grid of points (i.e. a 256x256 grid of squares or "tiles"). Will have to check my code to see if any similar errors have crept in.

With my code the next octave, i.e. a 513x513 grid, would (should?) literally add nothing as the octaves are defined as zero at the grid points.

Will go over my code more carefully later - need to identify the step(s) which is(are) eating the time.

Quote: "Either, way I changed my code to 9 octaves(1-9) so it should be the same as yours and I still get faster speed"

That doesn't surprise me given the errors that have crept into my code.

Back to top

Profile PM Email

dark coder

21

Years of Service

User Offline

Joined: 6th Oct 2002

Location: Japan

Posted: 21st Feb 2008 16:09

Link

Also if you need something else to compete against, I rewrote my C++ version using all the optimizations I could think of and got a low of 75ms for 512^2px texture with 8 octaves(with BiCosine interpolation), the only way you could probably get faster than this would be to use straight asm with major optimized instructions with SSE and/or multiple threads, however both are beyond my current C++ knowledge.

Anyway, this code will go very nicely in my game, plus all these optimizations have taught me some things I didn't know before(i.e. how slow DBP is)

.

Back to top

Profile PM Email

jason p sage

16

Years of Service

User Offline

Joined: 10th Jun 2007

Location: Ellington, CT USA

Posted: 21st Feb 2008 16:33

Link

Quote: "plus all these optimizations have taught me some things I didn't know before(i.e. how slow DBP is) "

@DarkCoder- with the stuff I've seen you do - I'm surprised at this one!

http://www.jasonpetersage.com/?PAGE=ironinfantry&SECTION=ironinfantry

Back to top

Profile PM Email Website

Green Gandalf

VIP Member

19

Years of Service

User Offline

Joined: 3rd Jan 2005

Playing: Malevolence:Sword of Ahkranox, Skyrim, Civ6.

Posted: 21st Feb 2008 18:18

Link

Quote: "Also if you need something else to compete against, I rewrote my C++ version using all the optimizations I could think of and got a low of 75ms"

Another reason for me to learn C++.

Quote: "plus all these optimizations have taught me some things I didn't know before(i.e. how slow DBP is)"

Yes, I wondered at that statement too - but it is useful to have some benchmark comparisons which DC's posts have provided.

Why is DBP so slow, anyway? The executables must be doing a lot of unnecessary stuff - or is it a consequence of the executables being so large? Anyone know?

Back to top

Profile PM Email

Benjamin

21

Years of Service

User Offline

Joined: 24th Nov 2002

Location: France

Posted: 21st Feb 2008 18:32 Edited at: 21st Feb 2008 18:37

Link

Quote: "Why is DBP so slow, anyway?"

After every line of code (I think) is four instructions. The first line sets the global line number variable, and next three lines check the global error variable and jump to the end of the program if it is set. This check also occurs after function calls (which is the only place it should ever happen really). This is quite a bit of overhead if you're doing intensive calculations.

And then there's instructions that are longer than they need to be. Instructions that specify a local variable use a 32-bit offset when in most cases it only needs to be 8-bit.

Then there's also redundant instructions - stores and reads that contradict each other. Some of these probably cause a pipeline stall, losing more processing time. An example of what I mean, in assembly:

+ Code Snippet

mov myVar, eax
mov eax, myVar

Not enough use of registers - results are put straight back into memory rather than storing them in registers until needed.

No constant folding, which means code like this will perform calculations at run-time:

+ Code Snippet

myVar = 1 * 2 * 3 * 4 * 5 * 6

A lot of these things could probably be fixed with a post-optimizer. We'll have to see about that.

Multisync - TCP Server/Client Multiplayer Plugin (DBP/DBCe)

Back to top

Profile PM Email

Green Gandalf

VIP Member

19

Years of Service

User Offline

Joined: 3rd Jan 2005

Playing: Malevolence:Sword of Ahkranox, Skyrim, Civ6.

Posted: 21st Feb 2008 19:18

Link

Unnecessary "mov" instructions occur at a low level and the programmer can't do much about those - but dbp code like your example

+ Code Snippet

myVar = 1 * 2 * 3 * 4 * 5 * 6

is the sort of thing the programmer can eliminate and is one of the first things I would look for when optimising my code. Another thing to do is to avoid calculating something twice - my code "snippets" on this thread recalculate the same interpolation weights many times over, i.e. that's an obvious candidate for optimisation.

Another thing we can try is to use integer arithmetic rather than float arithmetic where possible.

But obviously, after a certain point, we are stuck with the extra DBP stuff you mention.

Back to top

Profile PM Email

Benjamin

21

Years of Service

User Offline

Joined: 24th Nov 2002

Location: France

Posted: 21st Feb 2008 19:27 Edited at: 21st Feb 2008 19:28

Link

Quote: "is the sort of thing the programmer can eliminate and is one of the first things I would look for when optimising my code"

You're right, it's not such a valid point. It's merely useful sometimes to write out an equation rather than a constant.

Quote: "Another thing we can try is to use integer arithmetic rather than float arithmetic where possible."

Agreed, and if using floats, replace divisions by multiplications by the reciprocal where possible.

Another thing I'm not too keen on is how DBP handles float casting, it seems slower than it should be. It takes only a couple of instructions to convert between integer/float, but DBP actually calls a function for this.

Quote: "But obviously, after a certain point, we are stuck with the extra DBP stuff you mention."

Well, if someone were to write an optimiser...

Multisync - TCP Server/Client Multiplayer Plugin (DBP/DBCe)

Back to top

Profile PM Email

jason p sage

16

Years of Service

User Offline

Joined: 10th Jun 2007

Location: Ellington, CT USA

Posted: 21st Feb 2008 19:41

Link

...So Ben... You Busy?

http://www.jasonpetersage.com/?PAGE=ironinfantry&SECTION=ironinfantry

Back to top

Profile PM Email Website

TinTin

17

Years of Service

User Offline

Joined: 16th May 2006

Location: BORG Drone Ship - Being Assimilated near Roda Beta (28)

Posted: 22nd Feb 2008 00:09 Edited at: 25th Feb 2008 16:06

Link

Haven't read this whole post yet so appologies in advance if someone has already mentioned this...

Perlin Noise (I was using this in another project, GG you know the one

) it's better known as 'Gradient Noise'

Any way here is one of Ken's own implementation in C++ from Texturing & Modeling. This method uses a precalculated table of random values.

+ Code Snippet

#include <math.h>
float gradientTab[TABSIZE*3];
void gradientTabInit(int seed){
 float *table = gradientTab;
 float z, r, theta;
 int i;
 srandom(seed);
 for (i=0; i < TABSIZE; i++){
  z = 1. - 2. * RANDNBR;
  /* r is radius of x,y circle */
  r = sqrtf(1 - z*z);
  /* theta is angle in (x,y) */
  theta = 2 * M_PI * RANDNBR;
  *table++ = r * cosf(theta);
  *table++ = r * sinf(theta);
  *table++ = z;
 }
}
float glattice(int ix,int iy, int iz, float fx, float fy, float fz){
 float *g = &gradientTab[INDEX(ix,iy,iz)*3];
 return g[0]*fx + g[1]*fy + g[2]*fz;
}

His own website has some interesting articles that may be of use but basicaly perin noise is generated from a calculation using the standard noise function in either 1, 2, 3 or 4 dimentions...
yeah! 4-Dimentions. the fourth being time and can be used to animate the noise.

from the top of my head here are the first two on generating noise, I'll look out the code for the others...
I remember something about artifacts occuring at integer intersections, and perins method atempted to interpolate between these to remove or smooth them out.

+ Code Snippet

double Noise1D(int x){
 n = (x<<13)^x;
 return (1. -((n * (n * n * 15731 + 789221)+ 1376312589) & 7fffffff) / 1073741824.);
}
double Noise2D(int x, int y){
 n = x + y * 57;
 n = (n<<13)^n;
 return (1. -((n * (n * n * 15731 + 789221)+ 1376312589) & 7fffffff) / 1073741824.);
}
double Noise3D(int x, int y, int z){
 n = x + y * 57 + z * 131;
 n = (n<<13)^n;
 return (1. -((n * (n * n * 15731 + 789221)+ 1376312589) & 7fffffff) / 1073741824.);
}

notice the big value calculations are the same and only the first line realy changes this trend stays the same for the other dimentions.

Here is the DBP equivalent code, unfortunatly there seems to be an issue with the wrong result from the bit shift ^ part. The bit shift works as expected but squaring it always gives the same answer if the input value is over 2. (odd!)

+ Code Snippet

`*** Noise functions in DBP
Function Noise1D(X as integer)
   n = (X << 13) ^X `<- I think this part isnt working correctly possibly the ^ command.
   Result# = (1.0 -((n*(n*n*15731 + 789221) + 1376312589) && 0x7fffffff) / 1073741824.0 )
EndFunction Result#
Function Noise2D(X as integer ,Y as integer)
   n = X + Y * 57
   n = (n << 13) ^n
   Result# = (1.0 -((n*(n*n*15731 + 789221) + 1376312589) && 0x7fffffff) / 1073741824.0 )
EndFunction Result#
Function Noise3D(X as integer ,Y as integer,Z as Integer)
   n = X + Y * 57 + Z * 131
   n = (n << 13) ^n
   Result# = (1.0 -((n*(n*n*15731 + 789221) + 1376312589) && 0x7fffffff) / 1073741824.0 )
EndFunction Result#

Cyberspace was becoming overcrowded and slummy so I decided to move. These nice chaps gave me a lift.

Back to top

Profile PM

Green Gandalf

VIP Member

19

Years of Service

User Offline

Joined: 3rd Jan 2005

Playing: Malevolence:Sword of Ahkranox, Skyrim, Civ6.

Posted: 22nd Feb 2008 01:34 Edited at: 22nd Feb 2008 01:35

Link

dark coder

This code seems to run somewhat faster than your code. It uses 9 octaves numbered 0 to 8 starting with a single quad, i.e. a 2x2 grid of four corner values working up to a grid of 256x256 quads using 257x257 grid points.

This code on my machine takes a total of about 4737 ms whereas your latest code takes about 7000 ms.

` Green Gandalf's Perlin style noise function - Version 6
` Created 25 June 2007, modified 21 February 2008.

`   Uses suggestions from following website
`      http://www.mandelbrot-dazibao.com/Perlin/Perlin1.htm
`   and especially
`      http://mrl.nyu.edu/~perlin/ (and browse to his 2002 paper)

` This version restricts the gradients to +/- 1 as suggested
` by Ken Perlin (in fact there's no need for the rnd() function at all
` - could probably just use a simple method of haphazardly swapping the signs
` of the gradients).

` All four methods work well - but simple linear filtering shows slight seams on final image.

` Attempt at an another optimized version.

sync on: sync rate 0: sync
set display mode 800, 600, 32

time0 = timer()

randomize 140549 ` arbitrary fixed number for reproducibility

autocam off
position camera 0, 50, -300
point camera 0, 0, 0

create bitmap 1, 512, 512

global nOct = 8
global twoPiInv as float
twoPiInv = 0.159154943

dim weight#(nOct)

dim rawNoise(511, 511) as float

dim a(512, 512, nOct) as float ` somewhat wasteful in space
dim b(512, 512, nOct) as float
dim interp(511) as float
dim offset(nOct, 512)
dim invSize(nOct) as float
dim tile(nOct)
dim invTile(nOct)

increment = 512
p# = 2.0
for oct = 0 to nOct
  for i = 0 to 512 step increment
    for j = 0 to 512 step increment
      a(i, j, oct) = rnd(1) * 2 - 1  ` random value +/- 1
      b(i, j, oct) = rnd(1) * 2 - 1  ` random value +/- 1
                                             ` not seamless yet
    next j
  next i
  for i = 0 to 512
    offset(oct, i) = i MOD increment
  next i
  weight#(oct) = 1.0/p#^oct
  invSize(oct) = 1.0/increment
  tile(oct) = increment
  invTile(oct) = 512/increment
  increment = increment/2
next oct

for i = 0 to 511
  x# = i/512.0
`  interp(i) = x#                                 ` simple bilinear weight
`  interp(i) = x# * x# * (3.0 - 2.0 * x#)         ` original Perlin weight
`  interp(i) = x# * x# * x# * (6.0 * x# * x# - 15.0 * x# + 10.0) ` revised Perlin weight
  interp(i) = x# - sin(360.0 * x#) * twoPiInv    ` sine weight
next i

time1 = timer()

` calculate raw noise and find scale factors for
` rescaling noise values to byte range
minNoise# = 10000  ` arbitrary large number out of range
maxNoise# = -10000 ` arbitrary small number out of range
for x = 0 to 511
  for y = 0 to 511
    rawNoise(x, y) = noise(x, y)
    if minNoise# > rawNoise(x, y)
      minNoise# = rawNoise(x, y)
    else
      if maxNoise# < rawNoise(x, y) then maxNoise# = rawNoise(x, y)
    endif
  next y
next x

time2 = timer()

f# = 255.0/(maxNoise# - minNoise#)

lock pixels
  for x = 0 to 511
    for y = 0 to 511
      ` convert raw noise to byte range 0 - 255
      c = (rawNoise(x, y) - minNoise#) * f#
      ` just in case
      if c<0 then c=0
      if c>255 then c=255
      dot x, y, rgb(0, c, 0) ` a nice green colour :)
    next y
  next x
unlock pixels

time3 = timer()

copy bitmap 1, 0, 0, 511, 511,  0, 144, 44, 655, 555

set current bitmap 0

time = timer()

repeat
  text 20, 20, "All done in "+str$(time-time0)+" ms"
  text 20, 40, "Table set-up = "+str$(time1-time0)+" ms"
  text 20, 60, "Raw noise set-up = "+str$(time2-time1)+" ms"
  text 20, 80, "Noise rescale and image creation = "+str$(time3-time2)+" ms"
  sync
until spacekey()

set current bitmap 1
get image 1, 0, 0, 512, 512
save image "test v3.png", 1
end

function noiseBase(oct, x, y)
  ` x and y both in range 0 to 511
  ` find x and y offsets within tile
  xd = offset(oct, x)
  yd = offset(oct, y)
  w = tile(oct)
  x# = xd * invSize(oct)  ` number in range 0 to 1
  y# = yd * invSize(oct)
  ` look-up interpolation weights
  ix# = interp(xd*invTile(oct))
  iy# = interp(yd*invTile(oct))
  ` calculate tangents at four corners of tile
  i = x - xd
  j = y - yd
  t0# = a(i, j, oct) * x# + b(i, j, oct) * y#
  t1# = a(i, j + w, oct) * x# + b(i, j + w, oct) * (y# - 1.0)
  t2# = a(i + w , j, oct) * (x# - 1.0) + b(i + w, j, oct) * y#
  t3# = a(i + w, j + w, oct) * (x# - 1.0) + b(i + w, j + w, oct) * (y# - 1.0)
  ` calculate interpolated noise value in two stages
  tA# = t0# + (t2# - t0#) * ix#
  tB# = t1# + (t3# - t1#) * ix#
  result# = tA# + (tb# - tA#) * iy#
endfunction result#

function noise(x, y)
  result# = 0.0
`  result# = noiseBase(2, x, y) ` used for testing single octave instead of next 3 lines
  for oct = 0 to nOct
    result# = noiseBase(oct, x, y) * weight#(oct) + result#
  next oct
endfunction result#

+ Code Snippet

` Green Gandalf's Perlin style noise function - Version 6
` Created 25 June 2007, modified 21 February 2008.

`   Uses suggestions from following website
`      http://www.mandelbrot-dazibao.com/Perlin/Perlin1.htm
`   and especially
`      http://mrl.nyu.edu/~perlin/ (and browse to his 2002 paper)

` This version restricts the gradients to +/- 1 as suggested
` by Ken Perlin (in fact there's no need for the rnd() function at all
` - could probably just use a simple method of haphazardly swapping the signs
` of the gradients).

` All four methods work well - but simple linear filtering shows slight seams on final image.

` Attempt at an another optimized version.

sync on: sync rate 0: sync
set display mode 800, 600, 32

time0 = timer()

randomize 140549 ` arbitrary fixed number for reproducibility

autocam off
position camera 0, 50, -300
point camera 0, 0, 0

create bitmap 1, 512, 512

global nOct = 8
global twoPiInv as float
twoPiInv = 0.159154943

dim weight#(nOct)

dim rawNoise(511, 511) as float

dim a(512, 512, nOct) as float ` somewhat wasteful in space
dim b(512, 512, nOct) as float
dim interp(511) as float
dim offset(nOct, 512)
dim invSize(nOct) as float
dim tile(nOct)
dim invTile(nOct)

increment = 512
p# = 2.0
for oct = 0 to nOct
  for i = 0 to 512 step increment
    for j = 0 to 512 step increment
      a(i, j, oct) = rnd(1) * 2 - 1  ` random value +/- 1
      b(i, j, oct) = rnd(1) * 2 - 1  ` random value +/- 1
                                             ` not seamless yet
    next j
  next i
  for i = 0 to 512
    offset(oct, i) = i MOD increment
  next i
  weight#(oct) = 1.0/p#^oct
  invSize(oct) = 1.0/increment
  tile(oct) = increment
  invTile(oct) = 512/increment
  increment = increment/2
next oct

for i = 0 to 511
  x# = i/512.0
`  interp(i) = x#                                 ` simple bilinear weight
`  interp(i) = x# * x# * (3.0 - 2.0 * x#)         ` original Perlin weight
`  interp(i) = x# * x# * x# * (6.0 * x# * x# - 15.0 * x# + 10.0) ` revised Perlin weight
  interp(i) = x# - sin(360.0 * x#) * twoPiInv    ` sine weight
next i

time1 = timer()

` calculate raw noise and find scale factors for
` rescaling noise values to byte range
minNoise# = 10000  ` arbitrary large number out of range
maxNoise# = -10000 ` arbitrary small number out of range
for x = 0 to 511
  for y = 0 to 511
    rawNoise(x, y) = noise(x, y)
    if minNoise# > rawNoise(x, y)
      minNoise# = rawNoise(x, y)
    else
      if maxNoise# < rawNoise(x, y) then maxNoise# = rawNoise(x, y)
    endif
  next y
next x

time2 = timer()

f# = 255.0/(maxNoise# - minNoise#)

lock pixels
  for x = 0 to 511
    for y = 0 to 511
      ` convert raw noise to byte range 0 - 255
      c = (rawNoise(x, y) - minNoise#) * f#
      ` just in case
      if c<0 then c=0
      if c>255 then c=255
      dot x, y, rgb(0, c, 0) ` a nice green colour :)
    next y
  next x
unlock pixels

time3 = timer()

copy bitmap 1, 0, 0, 511, 511,  0, 144, 44, 655, 555

set current bitmap 0

time = timer()

repeat
  text 20, 20, "All done in "+str$(time-time0)+" ms"
  text 20, 40, "Table set-up = "+str$(time1-time0)+" ms"
  text 20, 60, "Raw noise set-up = "+str$(time2-time1)+" ms"
  text 20, 80, "Noise rescale and image creation = "+str$(time3-time2)+" ms"
  sync
until spacekey()

set current bitmap 1
get image 1, 0, 0, 512, 512
save image "test v3.png", 1
end

function noiseBase(oct, x, y)
  ` x and y both in range 0 to 511
  ` find x and y offsets within tile
  xd = offset(oct, x)
  yd = offset(oct, y)
  w = tile(oct)
  x# = xd * invSize(oct)  ` number in range 0 to 1
  y# = yd * invSize(oct)
  ` look-up interpolation weights
  ix# = interp(xd*invTile(oct))
  iy# = interp(yd*invTile(oct))
  ` calculate tangents at four corners of tile
  i = x - xd
  j = y - yd
  t0# = a(i, j, oct) * x# + b(i, j, oct) * y#
  t1# = a(i, j + w, oct) * x# + b(i, j + w, oct) * (y# - 1.0)
  t2# = a(i + w , j, oct) * (x# - 1.0) + b(i + w, j, oct) * y#
  t3# = a(i + w, j + w, oct) * (x# - 1.0) + b(i + w, j + w, oct) * (y# - 1.0)
  ` calculate interpolated noise value in two stages
  tA# = t0# + (t2# - t0#) * ix#
  tB# = t1# + (t3# - t1#) * ix#
  result# = tA# + (tb# - tA#) * iy#
endfunction result#

function noise(x, y)
  result# = 0.0
`  result# = noiseBase(2, x, y) ` used for testing single octave instead of next 3 lines
  for oct = 0 to nOct
    result# = noiseBase(oct, x, y) * weight#(oct) + result#
  next oct
endfunction result#

I haven't investigated the diagonal line issue yet - it could be the rnd(1) calls but I'm not sure.

Your comments about the low order octaves not always being desirable is a good one and hadn't occurred to me. You can obviously get somewhat different effects by modifying the relative weights of the different octaves - and omitting a few entirely is obviously a special case.

There is no way I can compete with your C++ achievements. (Yet?)

TinTin

Quote: "it's better known as 'Gradient Noise'"

I've never heard it called that before - but it's obviously appropriate.

Back to top

Profile PM Email

Green Gandalf

VIP Member

19

Years of Service

User Offline

Joined: 3rd Jan 2005

Playing: Malevolence:Sword of Ahkranox, Skyrim, Civ6.

Posted: 22nd Feb 2008 12:21 Edited at: 22nd Apr 2012 13:53

Link

Quote: "I haven't investigated the diagonal line issue yet - it could be the rnd(1) calls but I'm not sure."

Yep. That was the problem.

DBP must do something silly when computing "rnd(1)".

Replacing the following code:

+ Code Snippet

      a(i, j, oct) = rnd(1) * 2 - 1  ` random value +/- 1
      b(i, j, oct) = rnd(1) * 2 - 1  ` random value +/- 1

with:

+ Code Snippet

      a(i, j, oct) = rnd(1000) - 500 ` random value in range -500 to 500
      b(i, j, oct) = rnd(1000) - 500 ` random value in range -500 to 500

fixes the problem with similar timings.

I still like the simplicity of working with a random binary sequence, so I'll see if I can find a fast pseudo-random binary sequence generator. It would need to be significantly faster than DBP's rnd() command to be worthwhile - and even then, looking at the timings using my previous code snippet, not much can be saved anyway.

Edit Fixed spelling typo.

Back to top

Profile PM Email

Green Gandalf

VIP Member

19

Years of Service

User Offline

Joined: 3rd Jan 2005

Playing: Malevolence:Sword of Ahkranox, Skyrim, Civ6.

Posted: 22nd Feb 2008 13:19

Link

Of course, my old diamond-square algorithm for producing noisy cloud-like images is even faster - and you get three noise images for the price of one, i.e. one for each colour channel.

Here's a slightly updated version of the diamond-square code:

` Green Gandalf's fractal noise texture demo v2
`
` Created 13 Sep 2006, modified 22 February 2008.

` Creates a noise image using the diamond-square algorithm.
` Each colour channel contains its own noise image.
` Produces THREE noise images far faster than Perlin noise
` produces ONE.

set display mode 800,600,32
set text opaque
sync on: sync rate 60: sync
autocam off

time = timer()

set cursor 20,20

n=9: dispersion#=512: k=0 ` n=8 gives a 256x256 bitmap

maxgrid=2^n: g=maxgrid
dim col(3,maxgrid,maxgrid) as float
base as dword
global cx#: global cy#: global cz#

im=1: gosub makeImage ` prepare initial image

end

make_bitmap:
  create bitmap im,maxgrid+1,maxgrid+1
  lock pixels
    for i=0 to maxgrid
      for j=0 to maxgrid
        gosub dotij
      next j
    next i
  unlock pixels
  ` save base image
  get image 1, 0, 0, maxgrid, maxgrid
  if file exist("baseImage.png") then delete file "baseImage.png"
  save image "baseImage.png",1
  set current bitmap 0
  cls
  copy bitmap im,0
  time = timer() - time
  center text screen width()/2,screen height()/2+20,"Fractal image created in "+str$(time)+" ms"
  center text screen width()/2,screen height()/2+40,"Press any key to exit"
  sync
  wait key
return

dotij:
  dot i,j,rgb(col(1,i,j),col(2,i,j),col(3,i,j))
return

diamond_step:
  i=mid: i1=i-mid: i2=i+mid
  while i<maxgrid
    j=mid: j1=j-mid: j2=j+mid
    while j<maxgrid
      av#=(col(r,i1,j1)+col(r,i1,j2)+col(r,i2,j1)+col(r,i2,j2))/4.0
      ` calculate random values between -1 and 1
      u#=rnd(16384)/8192.0-1.0
      col(r,i,j)=av#+u#*d#`+m#
      inc j,g: inc j1,g: inc j2,g
    endwhile
    inc i,g: inc i1,g: inc i2,g
  endwhile
return

square_step:
  i=0: i1=i-mid: i2=i+mid: js=0
  while i<maxgrid
    js=mid-js ` toggle start values of j loop
    j=js: j1=j-mid: j2=j+mid
    while j<maxgrid`+1
      av#=0
      if i1<0  ` check for need to wrap around i value
        inc av#,col(r,i2,j)+col(r,i2,j)
      else
        if i2>maxgrid
          inc av#,col(r,i1,j)+col(r,i1,j)
        else
          inc av#,col(r,i1,j)+col(r,i2,j)
        endif
      endif
      if j1<0  ` check for need to wrap around j value
        inc av#,col(r,i,j2)+col(r,i,j2)
      else
        if j2>maxgrid
          inc av#,col(r,i,j1)+col(r,i,j1)
        else
          inc av#,col(r,i,j1)+col(r,i,j2)
        endif
      endif
      av#=av#/4
      ` calculate random value between -1 and 1
      u#=rnd(16384)/8192.0-1.0
      col(r,i,j)=av#+u#*d#`+m#
      col(r,maxgrid,j)=col(r,0,j) ` copy opposite edge
      inc j,g: inc j1,g: inc j2,g
    endwhile
    if j=maxgrid then col(r,i,j)=col(r,i,0) ` copy opposite edge
    inc i,mid: inc i1,mid: inc i2,mid
  endwhile
return

makeImage:
  ` Produces a fractal image based on the "diamond-square algorithm"
  ` described in following website
  `          http://www.gameprogrammer.com/fractal.html
  cls
  randomize timer()
  for r=1 to 3  ` process each colour component separately
    g=maxgrid
    d#=dispersion#
    for i=0 to maxgrid
      for j=0 to maxgrid
        col(r,i,j)=k
      next j
    next i

` main loop
    while g>1
      mid=g/2

` diamond step - calculates new diamond corners from squares
      gosub diamond_step

` square step - calculates new square corners from diamonds
      gosub square_step

d#=d#/2.0: m#=m#/2.0: g=g/2
    endwhile

` now scale heightmap values to byte range
    min#=col(r,0,0): max#=min#
    for i=0 to maxgrid
      for j=0 to maxgrid
        u#=col(r,i,j)
        if u#<min#
          min#=u#
        else
          if u#>max# then max#=u#
        endif
      next j
    next i
    max#=256.0/(max#-min#)
    for i=0 to maxgrid
      for j=0 to maxgrid
        temp=int((col(r,i,j)-min#)*max#)
        ` range check for byte just in case
        if temp<0
          col(r,i,j)=0
        else
          if temp>255
            col(r,i,j)=255
          else
            col(r,i,j)=temp
          endif
        endif
      next j
    next i
  next r

gosub make_bitmap

return

+ Code Snippet

` Green Gandalf's fractal noise texture demo v2
`
` Created 13 Sep 2006, modified 22 February 2008.

` Creates a noise image using the diamond-square algorithm.
` Each colour channel contains its own noise image.
` Produces THREE noise images far faster than Perlin noise
` produces ONE.

set display mode 800,600,32
set text opaque
sync on: sync rate 60: sync
autocam off

time = timer()

set cursor 20,20

n=9: dispersion#=512: k=0 ` n=8 gives a 256x256 bitmap

maxgrid=2^n: g=maxgrid
dim col(3,maxgrid,maxgrid) as float
base as dword
global cx#: global cy#: global cz#

im=1: gosub makeImage ` prepare initial image

end

make_bitmap:
  create bitmap im,maxgrid+1,maxgrid+1
  lock pixels
    for i=0 to maxgrid
      for j=0 to maxgrid
        gosub dotij
      next j
    next i
  unlock pixels
  ` save base image
  get image 1, 0, 0, maxgrid, maxgrid
  if file exist("baseImage.png") then delete file "baseImage.png"
  save image "baseImage.png",1
  set current bitmap 0
  cls
  copy bitmap im,0
  time = timer() - time
  center text screen width()/2,screen height()/2+20,"Fractal image created in "+str$(time)+" ms"
  center text screen width()/2,screen height()/2+40,"Press any key to exit"
  sync
  wait key
return

dotij:
  dot i,j,rgb(col(1,i,j),col(2,i,j),col(3,i,j))
return

diamond_step:
  i=mid: i1=i-mid: i2=i+mid
  while i<maxgrid
    j=mid: j1=j-mid: j2=j+mid
    while j<maxgrid
      av#=(col(r,i1,j1)+col(r,i1,j2)+col(r,i2,j1)+col(r,i2,j2))/4.0
      ` calculate random values between -1 and 1
      u#=rnd(16384)/8192.0-1.0
      col(r,i,j)=av#+u#*d#`+m#
      inc j,g: inc j1,g: inc j2,g
    endwhile
    inc i,g: inc i1,g: inc i2,g
  endwhile
return

square_step:
  i=0: i1=i-mid: i2=i+mid: js=0
  while i<maxgrid
    js=mid-js ` toggle start values of j loop
    j=js: j1=j-mid: j2=j+mid
    while j<maxgrid`+1
      av#=0
      if i1<0  ` check for need to wrap around i value
        inc av#,col(r,i2,j)+col(r,i2,j)
      else
        if i2>maxgrid
          inc av#,col(r,i1,j)+col(r,i1,j)
        else
          inc av#,col(r,i1,j)+col(r,i2,j)
        endif
      endif
      if j1<0  ` check for need to wrap around j value
        inc av#,col(r,i,j2)+col(r,i,j2)
      else
        if j2>maxgrid
          inc av#,col(r,i,j1)+col(r,i,j1)
        else
          inc av#,col(r,i,j1)+col(r,i,j2)
        endif
      endif
      av#=av#/4
      ` calculate random value between -1 and 1
      u#=rnd(16384)/8192.0-1.0
      col(r,i,j)=av#+u#*d#`+m#
      col(r,maxgrid,j)=col(r,0,j) ` copy opposite edge
      inc j,g: inc j1,g: inc j2,g
    endwhile
    if j=maxgrid then col(r,i,j)=col(r,i,0) ` copy opposite edge
    inc i,mid: inc i1,mid: inc i2,mid
  endwhile
return

makeImage:
  ` Produces a fractal image based on the "diamond-square algorithm"
  ` described in following website
  `          http://www.gameprogrammer.com/fractal.html
  cls
  randomize timer()
  for r=1 to 3  ` process each colour component separately
    g=maxgrid
    d#=dispersion#
    for i=0 to maxgrid
      for j=0 to maxgrid
        col(r,i,j)=k
      next j
    next i

    ` main loop
    while g>1
      mid=g/2

      ` diamond step - calculates new diamond corners from squares
      gosub diamond_step

      ` square step - calculates new square corners from diamonds
      gosub square_step

      d#=d#/2.0: m#=m#/2.0: g=g/2
    endwhile

    ` now scale heightmap values to byte range
    min#=col(r,0,0): max#=min#
    for i=0 to maxgrid
      for j=0 to maxgrid
        u#=col(r,i,j)
        if u#<min#
          min#=u#
        else
          if u#>max# then max#=u#
        endif
      next j
    next i
    max#=256.0/(max#-min#)
    for i=0 to maxgrid
      for j=0 to maxgrid
        temp=int((col(r,i,j)-min#)*max#)
        ` range check for byte just in case
        if temp<0
          col(r,i,j)=0
        else
          if temp>255
            col(r,i,j)=255
          else
            col(r,i,j)=temp
          endif
        endif
      next j
    next i
  next r

  gosub make_bitmap

return

Of course, I could try to optimise that as well - a casual glance suggests several candidates for optimisation. And I'd like to re-write it using functions rather than gosubs. Enough for now though.

And here's an image created by that code.

Attachments

Login to view attachments

Back to top

Profile PM Email

jason p sage

16

Years of Service

User Offline

Joined: 10th Jun 2007

Location: Ellington, CT USA

Posted: 22nd Feb 2008 14:50

Link

NICE! Thats Perfect!

http://www.jasonpetersage.com/?PAGE=ironinfantry&SECTION=ironinfantry

Back to top

Profile PM Email Website

Sorry your browser is not supported!

DarkBASIC Professional Discussion / help needed with Perlin noise generator

Attachments

Attachments

Attachments

Attachments