ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/OpenCL/OpenCL.pm
Revision: 1.69
Committed: Thu May 3 23:30:08 2012 UTC (12 years ago) by root
Branch: MAIN
Changes since 1.68: +42 -8 lines
Log Message:
*** empty log message ***

File Contents

# User Rev Content
1 root 1.1 =head1 NAME
2    
3 root 1.5 OpenCL - Open Computing Language Bindings
4 root 1.1
5     =head1 SYNOPSIS
6    
7     use OpenCL;
8    
9     =head1 DESCRIPTION
10    
11 root 1.7 This is an early release which might be useful, but hasn't seen much testing.
12 root 1.1
13 root 1.9 =head2 OpenCL FROM 10000 FEET HEIGHT
14    
15     Here is a high level overview of OpenCL:
16    
17     First you need to find one or more OpenCL::Platforms (kind of like
18     vendors) - usually there is only one.
19    
20     Each platform gives you access to a number of OpenCL::Device objects, e.g.
21     your graphics card.
22    
23 root 1.11 From a platform and some device(s), you create an OpenCL::Context, which is
24 root 1.9 a very central object in OpenCL: Once you have a context you can create
25     most other objects:
26    
27 root 1.11 OpenCL::Program objects, which store source code and, after building for a
28     specific device ("compiling and linking"), also binary programs. For each
29     kernel function in a program you can then create an OpenCL::Kernel object
30     which represents basically a function call with argument values.
31 root 1.9
32 root 1.20 OpenCL::Memory objects of various flavours: OpenCL::Buffer objects (flat
33 root 1.16 memory areas, think arrays or structs) and OpenCL::Image objects (think 2d
34     or 3d array) for bulk data and input and output for kernels.
35 root 1.9
36     OpenCL::Sampler objects, which are kind of like texture filter modes in
37     OpenGL.
38    
39     OpenCL::Queue objects - command queues, which allow you to submit memory
40     reads, writes and copies, as well as kernel calls to your devices. They
41     also offer a variety of methods to synchronise request execution, for
42     example with barriers or OpenCL::Event objects.
43    
44     OpenCL::Event objects are used to signal when something is complete.
45    
46     =head2 HELPFUL RESOURCES
47 root 1.3
48 root 1.57 The OpenCL specs used to develop this module:
49 root 1.3
50     http://www.khronos.org/registry/cl/specs/opencl-1.1.pdf
51 root 1.57 http://www.khronos.org/registry/cl/specs/opencl-1.2.pdf
52     http://www.khronos.org/registry/cl/specs/opencl-1.2-extensions.pdf
53 root 1.3
54     OpenCL manpages:
55    
56     http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/
57 root 1.57 http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/
58 root 1.3
59 root 1.18 If you are into UML class diagrams, the following diagram might help - if
60 root 1.57 not, it will be mildly confusing (also, the class hierarchy of this module
61     is much more fine-grained):
62 root 1.18
63 root 1.57 http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/classDiagram.html
64 root 1.18
65 root 1.16 Here's a tutorial from AMD (very AMD-centric, too), not sure how useful it
66     is, but at least it's free of charge:
67    
68     http://developer.amd.com/zones/OpenCLZone/courses/Documents/Introduction_to_OpenCL_Programming%20Training_Guide%20%28201005%29.pdf
69    
70 root 1.18 And here's NVIDIA's OpenCL Best Practises Guide:
71 root 1.16
72 root 1.18 http://developer.download.nvidia.com/compute/cuda/3_2/toolkit/docs/OpenCL_Best_Practices_Guide.pdf
73 root 1.16
74 root 1.9 =head1 BASIC WORKFLOW
75    
76 root 1.11 To get something done, you basically have to do this once (refer to the
77     examples below for actual code, this is just a high-level description):
78 root 1.9
79 root 1.11 Find some platform (e.g. the first one) and some device(s) (e.g. the first
80     device of the platform), and create a context from those.
81 root 1.9
82 root 1.11 Create program objects from your OpenCL source code, then build (compile)
83     the programs for each device you want to run them on.
84 root 1.9
85 root 1.11 Create kernel objects for all kernels you want to use (surprisingly, these
86     are not device-specific).
87 root 1.9
88 root 1.11 Then, to execute stuff, you repeat these steps, possibly resuing or
89     sharing some buffers:
90 root 1.9
91 root 1.11 Create some input and output buffers from your context. Set these as
92     arguments to your kernel.
93    
94     Enqueue buffer writes to initialise your input buffers (when not
95     initialised at creation time).
96 root 1.9
97     Enqueue the kernel execution.
98    
99     Enqueue buffer reads for your output buffer to read results.
100    
101 root 1.3 =head1 EXAMPLES
102    
103 root 1.5 =head2 Enumerate all devices and get contexts for them.
104 root 1.1
105 root 1.11 Best run this once to get a feel for the platforms and devices in your
106     system.
107    
108 root 1.1 for my $platform (OpenCL::platforms) {
109 root 1.24 printf "platform: %s\n", $platform->name;
110     printf "extensions: %s\n", $platform->extensions;
111 root 1.1 for my $device ($platform->devices) {
112 root 1.24 printf "+ device: %s\n", $device->name;
113 root 1.29 my $ctx = $platform->context (undef, [$device]);
114 root 1.1 # do stuff
115     }
116     }
117    
118 root 1.5 =head2 Get a useful context and a command queue.
119 root 1.1
120 root 1.11 This is a useful boilerplate for any OpenCL program that only wants to use
121     one device,
122    
123     my ($platform) = OpenCL::platforms; # find first platform
124     my ($dev) = $platform->devices; # find first device of platform
125     my $ctx = $platform->context (undef, [$dev]); # create context out of those
126     my $queue = $ctx->queue ($dev); # create a command queue for the device
127 root 1.1
128 root 1.5 =head2 Print all supported image formats of a context.
129    
130 root 1.11 Best run this once for your context, to see whats available and how to
131     gather information.
132    
133 root 1.5 for my $type (OpenCL::MEM_OBJECT_IMAGE2D, OpenCL::MEM_OBJECT_IMAGE3D) {
134 root 1.10 print "supported image formats for ", OpenCL::enum2str $type, "\n";
135 root 1.5
136     for my $f ($ctx->supported_image_formats (0, $type)) {
137     printf " %-10s %-20s\n", OpenCL::enum2str $f->[0], OpenCL::enum2str $f->[1];
138     }
139     }
140    
141     =head2 Create a buffer with some predefined data, read it back synchronously,
142     then asynchronously.
143 root 1.3
144     my $buf = $ctx->buffer_sv (OpenCL::MEM_COPY_HOST_PTR, "helmut");
145    
146 root 1.59 $queue->read_buffer ($buf, 1, 1, 3, my $data);
147 root 1.10 print "$data\n";
148 root 1.3
149 root 1.59 my $ev = $queue->read_buffer ($buf, 0, 1, 3, my $data);
150 root 1.3 $ev->wait;
151 root 1.10 print "$data\n"; # prints "elm"
152 root 1.3
153 root 1.5 =head2 Create and build a program, then create a kernel out of one of its
154     functions.
155 root 1.3
156     my $src = '
157 root 1.31 kernel void
158     squareit (global float *input, global float *output)
159 root 1.3 {
160 root 1.15 $id = get_global_id (0);
161 root 1.3 output [id] = input [id] * input [id];
162     }
163     ';
164    
165 root 1.51 my $prog = $ctx->build_program ($src);
166 root 1.3 my $kernel = $prog->kernel ("squareit");
167    
168 root 1.11 =head2 Create some input and output float buffers, then call the
169     'squareit' kernel on them.
170 root 1.4
171     my $input = $ctx->buffer_sv (OpenCL::MEM_COPY_HOST_PTR, pack "f*", 1, 2, 3, 4.5);
172     my $output = $ctx->buffer (0, OpenCL::SIZEOF_FLOAT * 5);
173    
174     # set buffer
175     $kernel->set_buffer (0, $input);
176     $kernel->set_buffer (1, $output);
177    
178     # execute it for all 4 numbers
179 root 1.59 $queue->nd_range_kernel ($kernel, undef, [4], undef);
180 root 1.4
181 root 1.5 # enqueue a synchronous read
182 root 1.59 $queue->read_buffer ($output, 1, 0, OpenCL::SIZEOF_FLOAT * 4, my $data);
183 root 1.5
184     # print the results:
185 root 1.10 printf "%s\n", join ", ", unpack "f*", $data;
186 root 1.5
187     =head2 The same enqueue operations as before, but assuming an out-of-order queue,
188     showing off barriers.
189    
190     # execute it for all 4 numbers
191 root 1.59 $queue->nd_range_kernel ($kernel, undef, [4], undef);
192 root 1.5
193     # enqueue a barrier to ensure in-order execution
194 root 1.59 $queue->barrier;
195 root 1.4
196 root 1.5 # enqueue an async read
197 root 1.59 $queue->read_buffer ($output, 0, 0, OpenCL::SIZEOF_FLOAT * 4, my $data);
198 root 1.5
199     # wait for all requests to finish
200     $queue->finish;
201    
202     =head2 The same enqueue operations as before, but assuming an out-of-order queue,
203     showing off event objects and wait lists.
204    
205     # execute it for all 4 numbers
206 root 1.59 my $ev = $queue->nd_range_kernel ($kernel, undef, [4], undef);
207 root 1.5
208     # enqueue an async read
209 root 1.59 $ev = $queue->read_buffer ($output, 0, 0, OpenCL::SIZEOF_FLOAT * 4, my $data, $ev);
210 root 1.5
211     # wait for the last event to complete
212 root 1.4 $ev->wait;
213    
214 root 1.38 =head2 Use the OpenGL module to share a texture between OpenCL and OpenGL and draw some julia
215     set tunnel effect.
216    
217 root 1.64 This is quite a long example to get you going - you can download it from
218     L<http://cvs.schmorp.de/OpenCL/examples/juliaflight>.
219 root 1.38
220     use OpenGL ":all";
221     use OpenCL;
222    
223 root 1.64 my $S = $ARGV[0] || 256; # window/texture size, smaller is faster
224    
225 root 1.38 # open a window and create a gl texture
226 root 1.64 OpenGL::glpOpenWindow width => $S, height => $S;
227 root 1.38 my $texid = glGenTextures_p 1;
228     glBindTexture GL_TEXTURE_2D, $texid;
229 root 1.64 glTexImage2D_c GL_TEXTURE_2D, 0, GL_RGBA8, $S, $S, 0, GL_RGBA, GL_UNSIGNED_BYTE, 0;
230 root 1.38
231     # find and use the first opencl device that let's us get a shared opengl context
232     my $platform;
233     my $dev;
234     my $ctx;
235    
236     for (OpenCL::platforms) {
237     $platform = $_;
238     for ($platform->devices) {
239     $dev = $_;
240     $ctx = $platform->context ([OpenCL::GLX_DISPLAY_KHR, undef, OpenCL::GL_CONTEXT_KHR, undef], [$dev])
241     and last;
242     }
243     }
244    
245     $ctx
246     or die "cannot find suitable OpenCL device\n";
247    
248     my $queue = $ctx->queue ($dev);
249    
250     # now attach an opencl image2d object to the opengl texture
251     my $tex = $ctx->gl_texture2d (OpenCL::MEM_WRITE_ONLY, GL_TEXTURE_2D, 0, $texid);
252    
253     # now the boring opencl code
254     my $src = <<EOF;
255     kernel void
256     juliatunnel (write_only image2d_t img, float time)
257     {
258 root 1.64 int2 xy = (int2)(get_global_id (0), get_global_id (1));
259     float2 p = convert_float2 (xy) / $S.f * 2.f - 1.f;
260 root 1.38
261 root 1.64 float2 m = (float2)(1.f, p.y) / fabs (p.x); // tunnel
262     m.x = fabs (fmod (m.x + time * 0.05f, 4.f) - 2.f);
263 root 1.38
264     float2 z = m;
265 root 1.64 float2 c = (float2)(sin (time * 0.01133f), cos (time * 0.02521f));
266 root 1.38
267 root 1.64 for (int i = 0; i < 25 && dot (z, z) < 4.f; ++i) // standard julia
268 root 1.38 z = (float2)(z.x * z.x - z.y * z.y, 2.f * z.x * z.y) + c;
269    
270 root 1.64 float3 colour = (float3)(z.x, z.y, atan2 (z.y, z.x));
271     write_imagef (img, xy, (float4)(colour * p.x * p.x, 1.));
272 root 1.38 }
273     EOF
274    
275 root 1.51 my $prog = $ctx->build_program ($src);
276 root 1.38 my $kernel = $prog->kernel ("juliatunnel");
277    
278     # program compiled, kernel ready, now draw and loop
279    
280     for (my $time; ; ++$time) {
281     # acquire objects from opengl
282 root 1.59 $queue->acquire_gl_objects ([$tex]);
283 root 1.38
284     # configure and run our kernel
285 root 1.64 $kernel->setf ("mf", $tex, $time*2); # mf = memory object, float
286     $queue->nd_range_kernel ($kernel, undef, [$S, $S], undef);
287 root 1.38
288     # release objects to opengl again
289 root 1.59 $queue->release_gl_objects ([$tex]);
290 root 1.38
291     # wait
292 root 1.40 $queue->finish;
293 root 1.38
294     # now draw the texture, the defaults should be all right
295     glTexParameterf GL_TEXTURE_2D, GL_TEXTURE_MIN_FILTER, GL_NEAREST;
296    
297     glEnable GL_TEXTURE_2D;
298     glBegin GL_QUADS;
299     glTexCoord2f 0, 1; glVertex3i -1, -1, -1;
300     glTexCoord2f 0, 0; glVertex3i 1, -1, -1;
301     glTexCoord2f 1, 0; glVertex3i 1, 1, -1;
302     glTexCoord2f 1, 1; glVertex3i -1, 1, -1;
303     glEnd;
304    
305     glXSwapBuffers;
306    
307     select undef, undef, undef, 1/60;
308     }
309    
310 root 1.66 =head2 How to modify the previous example to not rely on GL sharing.
311 root 1.65
312     For those poor souls with only a sucky CPU OpenCL implementation, you
313     currently have to read the image into some perl scalar, and then modify a
314     texture or use glDrawPixels or so).
315    
316     First, when you don't need gl sharing, you can create the context much simpler:
317    
318     $ctx = $platform->context (undef, [$dev])
319    
320     To use a texture, you would modify the above example by creating an
321     OpenCL::Image manually instead of deriving it from a texture:
322    
323     my $tex = $ctx->image2d (OpenCL::MEM_WRITE_ONLY, OpenCL::RGBA, OpenCL::UNORM_INT8, $S, $S);
324    
325     And in the darw loop, intead of acquire_gl_objects/release_gl_objects, you
326     would read the image2d after the kernel has written it:
327    
328     $queue->read_image ($tex, 0, 0, 0, 0, $S, $S, 1, 0, 0, my $data);
329    
330     And then you would upload the pixel data to the texture (or use glDrawPixels):
331    
332     glTexSubImage2D_s GL_TEXTURE_2D, 0, 0, 0, $S, $S, GL_RGBA, GL_UNSIGNED_BYTE, $data;
333    
334     The fully modified example can be found at
335     L<http://cvs.schmorp.de/OpenCL/examples/juliaflight-nosharing>.
336    
337 root 1.5 =head1 DOCUMENTATION
338    
339     =head2 BASIC CONVENTIONS
340    
341 root 1.14 This is not a one-to-one C-style translation of OpenCL to Perl - instead
342     I attempted to make the interface as type-safe as possible by introducing
343 root 1.5 object syntax where it makes sense. There are a number of important
344     differences between the OpenCL C API and this module:
345    
346     =over 4
347    
348     =item * Object lifetime managament is automatic - there is no need
349     to free objects explicitly (C<clReleaseXXX>), the release function
350     is called automatically once all Perl references to it go away.
351    
352 root 1.20 =item * OpenCL uses CamelCase for function names
353     (e.g. C<clGetPlatformIDs>, C<clGetPlatformInfo>), while this module
354     uses underscores as word separator and often leaves out prefixes
355     (C<OpenCL::platforms>, C<< $platform->info >>).
356 root 1.5
357     =item * OpenCL often specifies fixed vector function arguments as short
358 root 1.19 arrays (C<size_t origin[3]>), while this module explicitly expects the
359     components as separate arguments (C<$orig_x, $orig_y, $orig_z>) in
360     function calls.
361 root 1.5
362 root 1.19 =item * Structures are often specified by flattening out their components
363     as with short vectors, and returned as arrayrefs.
364 root 1.5
365     =item * When enqueuing commands, the wait list is specified by adding
366 root 1.9 extra arguments to the function - anywhere a C<$wait_events...> argument
367 root 1.44 is documented this can be any number of event objects. As an extsnion
368     implemented by this module, C<undef> values will be ignored in the event
369     list.
370 root 1.5
371     =item * When enqueuing commands, if the enqueue method is called in void
372     context, no event is created. In all other contexts an event is returned
373     by the method.
374    
375     =item * This module expects all functions to return C<CL_SUCCESS>. If any
376     other status is returned the function will throw an exception, so you
377     don't normally have to to any error checking.
378    
379     =back
380    
381 root 1.7 =head2 PERL AND OPENCL TYPES
382    
383 root 1.8 This handy(?) table lists OpenCL types and their perl, PDL and pack/unpack
384 root 1.7 format equivalents:
385    
386 root 1.8 OpenCL perl PDL pack/unpack
387     char IV - c
388     uchar IV byte C
389     short IV short s
390     ushort IV ushort S
391     int IV long? l
392     uint IV - L
393     long IV longlong q
394     ulong IV - Q
395     float NV float f
396     half IV ushort S
397     double NV double d
398 root 1.7
399 root 1.36 =head2 GLX SUPPORT
400    
401     Due to the sad state that OpenGL support is in in Perl (mostly the OpenGL
402     module, which has little to no documentation and has little to no support
403 root 1.38 for glX), this module, as a special extension, treats context creation
404 root 1.36 properties C<OpenCL::GLX_DISPLAY_KHR> and C<OpenCL::GL_CONTEXT_KHR>
405     specially: If either or both of these are C<undef>, then the OpenCL
406 root 1.38 module tries to dynamically resolve C<glXGetCurrentDisplay> and
407     C<glXGetCurrentContext>, call these functions and use their return values
408 root 1.36 instead.
409    
410     For this to work, the OpenGL library must be loaded, a GLX context must
411     have been created and be made current, and C<dlsym> must be available and
412     capable of finding the function via C<RTLD_DEFAULT>.
413    
414 root 1.55 =head2 EVENT SYSTEM
415    
416     OpenCL can generate a number of (potentially) asynchronous events, for
417     example, after compiling a program, to signal a context-related error or,
418     perhaps most important, to signal completion of queued jobs (by setting
419     callbacks on OpenCL::Event objects).
420    
421     To facilitate this, this module maintains an event queue - each
422     time an asynchronous event happens, it is queued, and perl will be
423     interrupted. This is implemented via the L<Async::Interrupt> module. In
424     addition, this module has L<AnyEvent> support, so it can seamlessly
425     integrate itself into many event loops.
426    
427     Since this module is a bit hard to understand, here are some case examples:
428    
429     =head3 Don't use callbacks.
430    
431     When your program never uses any callbacks, then there will never be any
432     notifications you need to take care of, and therefore no need to worry
433     about all this.
434    
435     You can achieve a great deal by explicitly waiting for events, or using
436     barriers and flush calls. In many programs, there is no need at all to
437     tinker with asynchronous events.
438    
439     =head3 Use AnyEvent
440    
441     This module automatically registers a watcher that invokes all outstanding
442     event callbacks when AnyEvent is initialised (and block asynchronous
443     interruptions). Using this mode of operations is the safest and most
444     recommended one.
445    
446     To use this, simply use AnyEvent and this module normally, make sure you
447     have an event loop running:
448    
449     use Gtk2 -init;
450     use AnyEvent;
451    
452     # initialise AnyEvent, by creating a watcher, or:
453     AnyEvent::detect;
454    
455 root 1.59 my $e = $queue->marker;
456 root 1.55 $e->cb (sub {
457     warn "opencl is finished\n";
458     })
459    
460     main Gtk2;
461    
462     Note that this module will not initialise AnyEvent for you. Before
463     AnyEvent is initialised, the module will asynchronously interrupt perl
464     instead. To avoid any surprises, it's best to explicitly initialise
465     AnyEvent.
466    
467     You can temporarily enable asynchronous interruptions (see next paragraph)
468     by calling C<$OpenCL::INTERRUPT->unblock> and disable them again by
469     calling C<$OpenCL::INTERRUPT->block>.
470    
471     =head3 Let yourself be interrupted at any time
472    
473     This mode is the default unless AnyEvent is loaded and initialised. In
474     this mode, OpenCL asynchronously interrupts a running perl program. The
475     emphasis is on both I<asynchronously> and I<running> here.
476    
477     Asynchronously means that perl might execute your callbacks at any
478     time. For example, in the following code (I<THAT YOU SHOULD NOT COPY>),
479     the C<until> loop following the marker call will be interrupted by the
480     callback:
481    
482 root 1.59 my $e = $queue->marker;
483 root 1.55 my $flag;
484     $e->cb (sub { $flag = 1 });
485     1 until $flag;
486     # $flag is now 1
487    
488     The reason why you shouldn't blindly copy the above code is that
489     busy waiting is a really really bad thing, and really really bad for
490     performance.
491    
492     While at first this asynchronous business might look exciting, it can be
493     really hard, because you need to be prepared for the callback code to be
494     executed at any time, which limits the amount of things the callback code
495     can do safely.
496    
497     This can be mitigated somewhat by using C<<
498     $OpenCL::INTERRUPT->scope_block >> (see the L<Async::Interrupt>
499     documentation for details).
500    
501     The other problem is that your program must be actively I<running> to be
502     interrupted. When you calculate stuff, your program is running. When you
503     hang in some C functions or other block execution (by calling C<sleep>,
504     C<select>, running an event loop and so on), your program is waiting, not
505     running.
506    
507     One way around that would be to attach a read watcher to your event loop,
508     listening for events on C<< $OpenCL::INTERRUPT->pipe_fileno >>, using a
509     dummy callback (C<sub { }>) to temporarily execute some perl code.
510    
511     That is then awfully close to using the built-in AnyEvent support above,
512     though, so consider that one instead.
513    
514     =head3 Be creative
515    
516     OpenCL exports the L<Async::Interrupt> object it uses in the global
517     variable C<$OpenCL::INTERRUPT>. You can configure it in any way you like.
518    
519     So if you want to feel like a real pro, err, wait, if you feel no risk
520     menas no fun, you can experiment by implementing your own mode of
521     operations.
522    
523 root 1.52 =cut
524    
525     package OpenCL;
526    
527     use common::sense;
528 root 1.62 use Carp ();
529 root 1.55 use Async::Interrupt ();
530    
531     our $POLL_FUNC; # set by XS
532 root 1.52
533     BEGIN {
534 root 1.62 our $VERSION = '0.98';
535 root 1.52
536     require XSLoader;
537     XSLoader::load (__PACKAGE__, $VERSION);
538    
539     @OpenCL::Platform::ISA =
540     @OpenCL::Device::ISA =
541     @OpenCL::Context::ISA =
542     @OpenCL::Queue::ISA =
543     @OpenCL::Memory::ISA =
544     @OpenCL::Sampler::ISA =
545     @OpenCL::Program::ISA =
546     @OpenCL::Kernel::ISA =
547     @OpenCL::Event::ISA = OpenCL::Object::;
548    
549     @OpenCL::Buffer::ISA =
550     @OpenCL::Image::ISA = OpenCL::Memory::;
551    
552     @OpenCL::BufferObj::ISA = OpenCL::Buffer::;
553    
554     @OpenCL::Image2D::ISA =
555     @OpenCL::Image3D::ISA =
556     @OpenCL::Image2DArray::ISA =
557     @OpenCL::Image1D::ISA =
558     @OpenCL::Image1DArray::ISA =
559     @OpenCL::Image1DBuffer::ISA = OpenCL::Image::;
560    
561     @OpenCL::UserEvent::ISA = OpenCL::Event::;
562 root 1.66
563 root 1.67 @OpenCL::MappedBuffer::ISA =
564     @OpenCL::MappedImage::ISA = OpenCL::Mapped::;
565 root 1.52 }
566    
567 root 1.5 =head2 THE OpenCL PACKAGE
568    
569     =over 4
570    
571     =item $int = OpenCL::errno
572    
573 root 1.11 The last error returned by a function - it's only valid after an error occured
574     and before calling another OpenCL function.
575 root 1.5
576 root 1.61 =item $str = OpenCL::err2str [$errval]
577 root 1.5
578 root 1.61 Converts an error value into a human readable string. IF no error value is
579     given, then the last error will be used (as returned by OpenCL::errno).
580 root 1.5
581 root 1.9 =item $str = OpenCL::enum2str $enum
582 root 1.5
583 root 1.30 Converts most enum values (of parameter names, image format constants,
584 root 1.5 object types, addressing and filter modes, command types etc.) into a
585 root 1.30 human readable string. When confronted with some random integer it can be
586 root 1.5 very helpful to pass it through this function to maybe get some readable
587     string out of it.
588    
589     =item @platforms = OpenCL::platforms
590    
591     Returns all available OpenCL::Platform objects.
592    
593     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetPlatformIDs.html>
594    
595 root 1.56 =item $ctx = OpenCL::context_from_type $properties, $type = OpenCL::DEVICE_TYPE_DEFAULT, $callback->($err, $pvt) = $print_stderr
596 root 1.5
597 root 1.56 Tries to create a context from a default device and platform type - never worked for me.
598 root 1.5
599     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateContextFromType.html>
600    
601 root 1.56 =item $ctx = OpenCL::context $properties, \@devices, $callback->($err, $pvt) = $print_stderr)
602    
603     Create a new OpenCL::Context object using the given device object(s). This
604     function isn't implemented yet, use C<< $platform->context >> instead.
605    
606     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateContext.html>
607    
608 root 1.5 =item OpenCL::wait_for_events $wait_events...
609    
610     Waits for all events to complete.
611    
612     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clWaitForEvents.html>
613    
614 root 1.55 =item OpenCL::poll
615    
616     Checks if there are any outstanding events (see L<EVENT SYSTEM>) and
617     invokes their callbacks.
618    
619     =item $OpenCL::INTERRUPT
620    
621     The L<Async::Interrupt> object used to signal asynchronous events (see
622     L<EVENT SYSTEM>).
623    
624     =cut
625    
626     our $INTERRUPT = new Async::Interrupt c_cb => [$POLL_FUNC, 0];
627    
628     &_eq_initialise ($INTERRUPT->signal_func);
629    
630     =item $OpenCL::WATCHER
631    
632     The L<AnyEvent> watcher object used to watch for asynchronous events (see
633     L<EVENT SYSTEM>). This variable is C<undef> until L<AnyEvent> has been
634     loaded I<and> initialised (e.g. by calling C<AnyEvent::detect>).
635    
636     =cut
637    
638     our $WATCHER;
639    
640     sub _init_anyevent {
641     $INTERRUPT->block;
642     $WATCHER = AE::io ($INTERRUPT->pipe_fileno, 0, sub { $INTERRUPT->handle });
643     }
644    
645     if (defined $AnyEvent::MODEL) {
646     _init_anyevent;
647     } else {
648     push @AnyEvent::post_detect, \&_init_anyevent;
649     }
650    
651 root 1.5 =back
652    
653 root 1.52 =head2 THE OpenCL::Object CLASS
654    
655     This is the base class for all objects in the OpenCL module. The only
656     method it implements is the C<id> method, which is only useful if you want
657     to interface to OpenCL on the C level.
658    
659     =over 4
660    
661     =item $iv = $obj->id
662    
663     OpenCL objects are represented by pointers or integers on the C level. If
664     you want to interface to an OpenCL object directly on the C level, then
665     you need this value, which is returned by this method. You should use an
666     C<IV> type in your code and cast that to the correct type.
667    
668     =cut
669    
670     sub OpenCL::Object::id {
671 root 1.55 ref $_[0] eq "SCALAR"
672     ? ${ $_[0] }
673     : $_[0][0]
674 root 1.52 }
675    
676     =back
677    
678 root 1.5 =head2 THE OpenCL::Platform CLASS
679    
680     =over 4
681    
682     =item @devices = $platform->devices ($type = OpenCL::DEVICE_TYPE_ALL)
683    
684     Returns a list of matching OpenCL::Device objects.
685    
686 root 1.56 =item $ctx = $platform->context_from_type ($properties, $type = OpenCL::DEVICE_TYPE_DEFAULT, $callback->($err, $pvt) = $print_stderr)
687 root 1.5
688 root 1.22 Tries to create a context. Never worked for me, and you need devices explicitly anyway.
689 root 1.5
690     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateContextFromType.html>
691    
692 root 1.56 =item $ctx = $platform->context ($properties, \@devices, $callback->($err, $pvt) = $print_stderr)
693 root 1.11
694     Create a new OpenCL::Context object using the given device object(s)- a
695     CL_CONTEXT_PLATFORM property is supplied automatically.
696    
697     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateContext.html>
698    
699 root 1.20 =item $packed_value = $platform->info ($name)
700    
701     Calls C<clGetPlatformInfo> and returns the packed, raw value - for
702 root 1.22 strings, this will be the string (possibly including terminating \0), for
703     other values you probably need to use the correct C<unpack>.
704 root 1.20
705 root 1.22 It's best to avoid this method and use one of the following convenience
706     wrappers.
707 root 1.20
708     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetPlatformInfo.html>
709    
710 root 1.50 =item $platform->unload_compiler
711    
712     Attempts to unload the compiler for this platform, for endless
713     profit. Does nothing on OpenCL 1.1.
714    
715     L<http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/clUnloadPlatformCompiler.html>
716    
717 root 1.20 =for gengetinfo begin platform
718    
719     =item $string = $platform->profile
720    
721 root 1.24 Calls C<clGetPlatformInfo> with C<CL_PLATFORM_PROFILE> and returns the result.
722 root 1.20
723     =item $string = $platform->version
724    
725 root 1.24 Calls C<clGetPlatformInfo> with C<CL_PLATFORM_VERSION> and returns the result.
726 root 1.20
727     =item $string = $platform->name
728    
729 root 1.24 Calls C<clGetPlatformInfo> with C<CL_PLATFORM_NAME> and returns the result.
730 root 1.20
731     =item $string = $platform->vendor
732    
733 root 1.24 Calls C<clGetPlatformInfo> with C<CL_PLATFORM_VENDOR> and returns the result.
734 root 1.20
735     =item $string = $platform->extensions
736    
737 root 1.24 Calls C<clGetPlatformInfo> with C<CL_PLATFORM_EXTENSIONS> and returns the result.
738 root 1.21
739 root 1.20 =for gengetinfo end platform
740    
741 root 1.5 =back
742    
743     =head2 THE OpenCL::Device CLASS
744    
745     =over 4
746    
747     =item $packed_value = $device->info ($name)
748    
749     See C<< $platform->info >> for details.
750    
751     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetDeviceInfo.html>
752    
753 root 1.21 =for gengetinfo begin device
754    
755     =item $device_type = $device->type
756    
757 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_TYPE> and returns the result.
758 root 1.21
759     =item $uint = $device->vendor_id
760    
761 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_VENDOR_ID> and returns the result.
762 root 1.21
763     =item $uint = $device->max_compute_units
764    
765 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_COMPUTE_UNITS> and returns the result.
766 root 1.21
767     =item $uint = $device->max_work_item_dimensions
768    
769 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS> and returns the result.
770 root 1.21
771     =item $int = $device->max_work_group_size
772    
773 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_WORK_GROUP_SIZE> and returns the result.
774 root 1.21
775     =item @ints = $device->max_work_item_sizes
776    
777 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_WORK_ITEM_SIZES> and returns the result.
778 root 1.21
779     =item $uint = $device->preferred_vector_width_char
780    
781 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PREFERRED_VECTOR_WIDTH_CHAR> and returns the result.
782 root 1.21
783     =item $uint = $device->preferred_vector_width_short
784    
785 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PREFERRED_VECTOR_WIDTH_SHORT> and returns the result.
786 root 1.21
787     =item $uint = $device->preferred_vector_width_int
788    
789 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PREFERRED_VECTOR_WIDTH_INT> and returns the result.
790 root 1.21
791     =item $uint = $device->preferred_vector_width_long
792    
793 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PREFERRED_VECTOR_WIDTH_LONG> and returns the result.
794 root 1.21
795     =item $uint = $device->preferred_vector_width_float
796    
797 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PREFERRED_VECTOR_WIDTH_FLOAT> and returns the result.
798 root 1.21
799     =item $uint = $device->preferred_vector_width_double
800    
801 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PREFERRED_VECTOR_WIDTH_DOUBLE> and returns the result.
802 root 1.21
803     =item $uint = $device->max_clock_frequency
804    
805 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_CLOCK_FREQUENCY> and returns the result.
806 root 1.21
807     =item $bitfield = $device->address_bits
808    
809 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_ADDRESS_BITS> and returns the result.
810 root 1.21
811     =item $uint = $device->max_read_image_args
812    
813 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_READ_IMAGE_ARGS> and returns the result.
814 root 1.21
815     =item $uint = $device->max_write_image_args
816    
817 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_WRITE_IMAGE_ARGS> and returns the result.
818 root 1.21
819     =item $ulong = $device->max_mem_alloc_size
820    
821 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_MEM_ALLOC_SIZE> and returns the result.
822 root 1.21
823     =item $int = $device->image2d_max_width
824    
825 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_IMAGE2D_MAX_WIDTH> and returns the result.
826 root 1.21
827     =item $int = $device->image2d_max_height
828    
829 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_IMAGE2D_MAX_HEIGHT> and returns the result.
830 root 1.21
831     =item $int = $device->image3d_max_width
832    
833 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_IMAGE3D_MAX_WIDTH> and returns the result.
834 root 1.21
835     =item $int = $device->image3d_max_height
836    
837 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_IMAGE3D_MAX_HEIGHT> and returns the result.
838 root 1.21
839     =item $int = $device->image3d_max_depth
840    
841 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_IMAGE3D_MAX_DEPTH> and returns the result.
842 root 1.21
843     =item $uint = $device->image_support
844    
845 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_IMAGE_SUPPORT> and returns the result.
846 root 1.21
847     =item $int = $device->max_parameter_size
848    
849 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_PARAMETER_SIZE> and returns the result.
850 root 1.21
851     =item $uint = $device->max_samplers
852    
853 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_SAMPLERS> and returns the result.
854 root 1.21
855     =item $uint = $device->mem_base_addr_align
856    
857 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MEM_BASE_ADDR_ALIGN> and returns the result.
858 root 1.21
859     =item $uint = $device->min_data_type_align_size
860    
861 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MIN_DATA_TYPE_ALIGN_SIZE> and returns the result.
862 root 1.21
863     =item $device_fp_config = $device->single_fp_config
864    
865 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_SINGLE_FP_CONFIG> and returns the result.
866 root 1.21
867     =item $device_mem_cache_type = $device->global_mem_cache_type
868    
869 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_GLOBAL_MEM_CACHE_TYPE> and returns the result.
870 root 1.21
871     =item $uint = $device->global_mem_cacheline_size
872    
873 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_GLOBAL_MEM_CACHELINE_SIZE> and returns the result.
874 root 1.21
875     =item $ulong = $device->global_mem_cache_size
876    
877 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_GLOBAL_MEM_CACHE_SIZE> and returns the result.
878 root 1.21
879     =item $ulong = $device->global_mem_size
880    
881 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_GLOBAL_MEM_SIZE> and returns the result.
882 root 1.21
883     =item $ulong = $device->max_constant_buffer_size
884    
885 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_CONSTANT_BUFFER_SIZE> and returns the result.
886 root 1.21
887     =item $uint = $device->max_constant_args
888    
889 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_CONSTANT_ARGS> and returns the result.
890 root 1.21
891     =item $device_local_mem_type = $device->local_mem_type
892    
893 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_LOCAL_MEM_TYPE> and returns the result.
894 root 1.21
895     =item $ulong = $device->local_mem_size
896    
897 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_LOCAL_MEM_SIZE> and returns the result.
898 root 1.21
899     =item $boolean = $device->error_correction_support
900    
901 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_ERROR_CORRECTION_SUPPORT> and returns the result.
902 root 1.21
903     =item $int = $device->profiling_timer_resolution
904    
905 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PROFILING_TIMER_RESOLUTION> and returns the result.
906 root 1.21
907     =item $boolean = $device->endian_little
908    
909 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_ENDIAN_LITTLE> and returns the result.
910 root 1.21
911     =item $boolean = $device->available
912    
913 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_AVAILABLE> and returns the result.
914 root 1.21
915     =item $boolean = $device->compiler_available
916    
917 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_COMPILER_AVAILABLE> and returns the result.
918 root 1.21
919     =item $device_exec_capabilities = $device->execution_capabilities
920    
921 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_EXECUTION_CAPABILITIES> and returns the result.
922 root 1.21
923     =item $command_queue_properties = $device->properties
924    
925 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_QUEUE_PROPERTIES> and returns the result.
926 root 1.21
927     =item $ = $device->platform
928    
929 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PLATFORM> and returns the result.
930 root 1.21
931     =item $string = $device->name
932    
933 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_NAME> and returns the result.
934 root 1.21
935     =item $string = $device->vendor
936    
937 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_VENDOR> and returns the result.
938 root 1.21
939     =item $string = $device->driver_version
940    
941 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DRIVER_VERSION> and returns the result.
942 root 1.21
943     =item $string = $device->profile
944    
945 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PROFILE> and returns the result.
946 root 1.21
947     =item $string = $device->version
948    
949 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_VERSION> and returns the result.
950 root 1.21
951     =item $string = $device->extensions
952    
953 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_EXTENSIONS> and returns the result.
954 root 1.21
955     =item $uint = $device->preferred_vector_width_half
956    
957 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PREFERRED_VECTOR_WIDTH_HALF> and returns the result.
958 root 1.21
959     =item $uint = $device->native_vector_width_char
960    
961 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_NATIVE_VECTOR_WIDTH_CHAR> and returns the result.
962 root 1.21
963     =item $uint = $device->native_vector_width_short
964    
965 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_NATIVE_VECTOR_WIDTH_SHORT> and returns the result.
966 root 1.21
967     =item $uint = $device->native_vector_width_int
968    
969 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_NATIVE_VECTOR_WIDTH_INT> and returns the result.
970 root 1.21
971     =item $uint = $device->native_vector_width_long
972    
973 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_NATIVE_VECTOR_WIDTH_LONG> and returns the result.
974 root 1.21
975     =item $uint = $device->native_vector_width_float
976    
977 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_NATIVE_VECTOR_WIDTH_FLOAT> and returns the result.
978 root 1.21
979     =item $uint = $device->native_vector_width_double
980    
981 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_NATIVE_VECTOR_WIDTH_DOUBLE> and returns the result.
982 root 1.21
983     =item $uint = $device->native_vector_width_half
984    
985 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_NATIVE_VECTOR_WIDTH_HALF> and returns the result.
986 root 1.21
987     =item $device_fp_config = $device->double_fp_config
988    
989 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_DOUBLE_FP_CONFIG> and returns the result.
990 root 1.21
991     =item $device_fp_config = $device->half_fp_config
992    
993 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_HALF_FP_CONFIG> and returns the result.
994 root 1.21
995     =item $boolean = $device->host_unified_memory
996    
997 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_HOST_UNIFIED_MEMORY> and returns the result.
998 root 1.21
999     =item $device = $device->parent_device_ext
1000    
1001 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PARENT_DEVICE_EXT> and returns the result.
1002 root 1.21
1003     =item @device_partition_property_exts = $device->partition_types_ext
1004    
1005 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PARTITION_TYPES_EXT> and returns the result.
1006 root 1.21
1007     =item @device_partition_property_exts = $device->affinity_domains_ext
1008    
1009 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_AFFINITY_DOMAINS_EXT> and returns the result.
1010 root 1.21
1011 root 1.45 =item $uint = $device->reference_count_ext
1012 root 1.21
1013 root 1.48 Calls C<clGetDeviceInfo> with C<CL_DEVICE_REFERENCE_COUNT_EXT> and returns the result.
1014 root 1.21
1015     =item @device_partition_property_exts = $device->partition_style_ext
1016    
1017 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PARTITION_STYLE_EXT> and returns the result.
1018 root 1.21
1019     =for gengetinfo end device
1020    
1021 root 1.5 =back
1022    
1023     =head2 THE OpenCL::Context CLASS
1024    
1025     =over 4
1026    
1027 root 1.51 =item $prog = $ctx->build_program ($program, $options = "")
1028    
1029     This convenience function tries to build the program on all devices in
1030     the context. If the build fails, then the function will C<croak> with the
1031     build log. Otherwise ti returns the program object.
1032    
1033     The C<$program> can either be a C<OpenCL::Program> object or a string
1034     containing the program. In the latter case, a program objetc will be
1035     created automatically.
1036    
1037     =cut
1038    
1039     sub OpenCL::Context::build_program {
1040     my ($self, $prog, $options) = @_;
1041    
1042     $prog = $self->program_with_source ($prog)
1043     unless ref $prog;
1044    
1045 root 1.61 eval { $prog->build (undef, $options); 1 }
1046     or errno == BUILD_PROGRAM_FAILURE
1047 root 1.63 or errno == INVALID_BINARY # workaround nvidia bug
1048 root 1.61 or Carp::croak "OpenCL::Context->build_program: " . err2str;
1049    
1050     # we check status for all devices
1051 root 1.51 for my $dev ($self->devices) {
1052 root 1.61 $prog->build_status ($dev) == BUILD_SUCCESS
1053 root 1.62 or Carp::croak "Building OpenCL program for device '" . $dev->name . "' failed:\n"
1054     . $prog->build_log ($dev);
1055 root 1.51 }
1056    
1057     $prog
1058     }
1059    
1060 root 1.9 =item $queue = $ctx->queue ($device, $properties)
1061 root 1.5
1062 root 1.9 Create a new OpenCL::Queue object from the context and the given device.
1063 root 1.5
1064     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateCommandQueue.html>
1065    
1066 root 1.45 Example: create an out-of-order queue.
1067    
1068     $queue = $ctx->queue ($device, OpenCL::QUEUE_OUT_OF_ORDER_EXEC_MODE_ENABLE);
1069    
1070 root 1.5 =item $ev = $ctx->user_event
1071    
1072     Creates a new OpenCL::UserEvent object.
1073    
1074     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateUserEvent.html>
1075    
1076     =item $buf = $ctx->buffer ($flags, $len)
1077    
1078 root 1.27 Creates a new OpenCL::Buffer (actually OpenCL::BufferObj) object with the
1079     given flags and octet-size.
1080 root 1.5
1081     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateBuffer.html>
1082    
1083     =item $buf = $ctx->buffer_sv ($flags, $data)
1084    
1085 root 1.27 Creates a new OpenCL::Buffer (actually OpenCL::BufferObj) object and
1086     initialise it with the given data values.
1087 root 1.5
1088 root 1.59 =item $img = $ctx->image ($self, $flags, $channel_order, $channel_type, $type, $width, $height, $depth = 0, $array_size = 0, $row_pitch = 0, $slice_pitch = 0, $num_mip_level = 0, $num_samples = 0, $*data = &PL_sv_undef)
1089 root 1.49
1090     Creates a new OpenCL::Image object and optionally initialises it with
1091     the given data values.
1092    
1093     L<http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/clCreateImage.html>
1094    
1095 root 1.18 =item $img = $ctx->image2d ($flags, $channel_order, $channel_type, $width, $height, $row_pitch = 0, $data = undef)
1096 root 1.5
1097 root 1.27 Creates a new OpenCL::Image2D object and optionally initialises it with
1098     the given data values.
1099 root 1.5
1100     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateImage2D.html>
1101    
1102 root 1.18 =item $img = $ctx->image3d ($flags, $channel_order, $channel_type, $width, $height, $depth, $row_pitch = 0, $slice_pitch = 0, $data = undef)
1103 root 1.5
1104 root 1.27 Creates a new OpenCL::Image3D object and optionally initialises it with
1105     the given data values.
1106 root 1.5
1107     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateImage3D.html>
1108    
1109 root 1.33 =item $buffer = $ctx->gl_buffer ($flags, $bufobj)
1110    
1111     Creates a new OpenCL::Buffer (actually OpenCL::BufferObj) object that refers to the given
1112     OpenGL buffer object.
1113    
1114     http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateFromGLBuffer.html
1115    
1116 root 1.47 =item $img = $ctx->gl_texture ($flags, $target, $miplevel, $texture)
1117    
1118     Creates a new OpenCL::Image object that refers to the given OpenGL
1119     texture object or buffer.
1120    
1121     http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/clCreateFromGLTexture.html
1122    
1123     =item $img = $ctx->gl_texture2d ($flags, $target, $miplevel, $texture)
1124 root 1.33
1125     Creates a new OpenCL::Image2D object that refers to the given OpenGL
1126     2D texture object.
1127    
1128     http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateFromGLTexture2D.html
1129    
1130 root 1.47 =item $img = $ctx->gl_texture3d ($flags, $target, $miplevel, $texture)
1131 root 1.33
1132     Creates a new OpenCL::Image3D object that refers to the given OpenGL
1133     3D texture object.
1134    
1135     http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateFromGLTexture3D.html
1136    
1137     =item $ctx->gl_renderbuffer ($flags, $renderbuffer)
1138    
1139     Creates a new OpenCL::Image2D object that refers to the given OpenGL
1140     render buffer.
1141    
1142     http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateFromGLRenderbuffer.html
1143    
1144 root 1.5 =item @formats = $ctx->supported_image_formats ($flags, $image_type)
1145    
1146     Returns a list of matching image formats - each format is an arrayref with
1147     two values, $channel_order and $channel_type, in it.
1148    
1149     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetSupportedImageFormats.html>
1150    
1151     =item $sampler = $ctx->sampler ($normalized_coords, $addressing_mode, $filter_mode)
1152    
1153     Creates a new OpenCL::Sampler object.
1154    
1155     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateSampler.html>
1156    
1157     =item $program = $ctx->program_with_source ($string)
1158    
1159     Creates a new OpenCL::Program object from the given source code.
1160    
1161     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateProgramWithSource.html>
1162    
1163 root 1.69 =item ($program, \@status) = $ctx->program_with_binary (\@devices, \@binaries)
1164    
1165     Creates a new OpenCL::Program object from the given binaries.
1166    
1167     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateProgramWithBinary.html>
1168    
1169     Example: clone an existing program object that contains a successfully
1170     compiled program, no matter how useless this is.
1171    
1172     my $clone = $ctx->program_with_binary ([$prog->devices], [$prog->binaries]);
1173    
1174 root 1.20 =item $packed_value = $ctx->info ($name)
1175    
1176     See C<< $platform->info >> for details.
1177    
1178     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetContextInfo.html>
1179    
1180     =for gengetinfo begin context
1181    
1182 root 1.21 =item $uint = $context->reference_count
1183    
1184 root 1.24 Calls C<clGetContextInfo> with C<CL_CONTEXT_REFERENCE_COUNT> and returns the result.
1185 root 1.21
1186     =item @devices = $context->devices
1187    
1188 root 1.24 Calls C<clGetContextInfo> with C<CL_CONTEXT_DEVICES> and returns the result.
1189 root 1.21
1190     =item @property_ints = $context->properties
1191    
1192 root 1.24 Calls C<clGetContextInfo> with C<CL_CONTEXT_PROPERTIES> and returns the result.
1193 root 1.21
1194     =item $uint = $context->num_devices
1195    
1196 root 1.24 Calls C<clGetContextInfo> with C<CL_CONTEXT_NUM_DEVICES> and returns the result.
1197 root 1.21
1198 root 1.20 =for gengetinfo end context
1199    
1200 root 1.5 =back
1201    
1202     =head2 THE OpenCL::Queue CLASS
1203    
1204     An OpenCL::Queue represents an execution queue for OpenCL. You execute
1205 root 1.59 requests by calling their respective method and waiting for it to complete
1206     in some way.
1207 root 1.5
1208 root 1.59 Most methods that enqueue some request return an event object that can
1209     be used to wait for completion (optionally using a callback), unless
1210     the method is called in void context, in which case no event object is
1211     created.
1212 root 1.5
1213     They also allow you to specify any number of other event objects that this
1214     request has to wait for before it starts executing, by simply passing the
1215 root 1.45 event objects as extra parameters to the enqueue methods. To simplify
1216     program design, this module ignores any C<undef> values in the list of
1217     events. This makes it possible to code operations such as this, without
1218     having to put a valid event object into C<$event> first:
1219    
1220 root 1.59 $event = $queue->xxx (..., $event);
1221 root 1.5
1222     Queues execute in-order by default, without any parallelism, so in most
1223 root 1.6 cases (i.e. you use only one queue) it's not necessary to wait for or
1224 root 1.45 create event objects, althoguh an our of order queue is often a bit
1225     faster.
1226 root 1.5
1227     =over 4
1228    
1229 root 1.59 =item $ev = $queue->read_buffer ($buffer, $blocking, $offset, $len, $data, $wait_events...)
1230 root 1.5
1231     Reads data from buffer into the given string.
1232    
1233     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueReadBuffer.html>
1234    
1235 root 1.59 =item $ev = $queue->write_buffer ($buffer, $blocking, $offset, $data, $wait_events...)
1236 root 1.5
1237     Writes data to buffer from the given string.
1238    
1239     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueWriteBuffer.html>
1240    
1241 root 1.59 =item $ev = $queue->copy_buffer ($src, $dst, $src_offset, $dst_offset, $len, $wait_events...)
1242 root 1.5
1243     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueCopyBuffer.html>
1244    
1245 root 1.59 =item $ev = $queue->read_buffer_rect (OpenCL::Memory buf, cl_bool blocking, $buf_x, $buf_y, $buf_z, $host_x, $host_y, $host_z, $width, $height, $depth, $buf_row_pitch, $buf_slice_pitch, $host_row_pitch, $host_slice_pitch, $data, $wait_events...)
1246 root 1.25
1247     http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueReadBufferRect.html
1248    
1249 root 1.59 =item $ev = $queue->write_buffer_rect (OpenCL::Memory buf, cl_bool blocking, $buf_x, $buf_y, $buf_z, $host_x, $host_y, $host_z, $width, $height, $depth, $buf_row_pitch, $buf_slice_pitch, $host_row_pitch, $host_slice_pitch, $data, $wait_events...)
1250 root 1.25
1251     http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueWriteBufferRect.html
1252    
1253 root 1.65 =item $ev = $queue->copy_buffer_to_image ($src_buffer, $dst_image, $src_offset, $dst_x, $dst_y, $dst_z, $width, $height, $depth, $wait_events...)
1254    
1255     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueCopyBufferToImage.html>
1256    
1257 root 1.59 =item $ev = $queue->read_image ($src, $blocking, $x, $y, $z, $width, $height, $depth, $row_pitch, $slice_pitch, $data, $wait_events...)
1258 root 1.5
1259 root 1.65 C<$row_pitch> (and C<$slice_pitch>) can be C<0>, in which case the OpenCL
1260     module uses the image width (and height) to supply default values.
1261 root 1.27
1262 root 1.5 L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueReadImage.html>
1263    
1264 root 1.59 =item $ev = $queue->write_image ($src, $blocking, $x, $y, $z, $width, $height, $depth, $row_pitch, $slice_pitch, $data, $wait_events...)
1265 root 1.5
1266 root 1.65 C<$row_pitch> (and C<$slice_pitch>) can be C<0>, in which case the OpenCL
1267     module uses the image width (and height) to supply default values.
1268 root 1.5 L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueWriteImage.html>
1269    
1270 root 1.59 =item $ev = $queue->copy_image ($src_image, $dst_image, $src_x, $src_y, $src_z, $dst_x, $dst_y, $dst_z, $width, $height, $depth, $wait_events...)
1271 root 1.5
1272     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueCopyImage.html>
1273    
1274 root 1.59 =item $ev = $queue->copy_image_to_buffer ($src_image, $dst_image, $src_x, $src_y, $src_z, $width, $height, $depth, $dst_offset, $wait_events...)
1275 root 1.5
1276     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueCopyImageToBuffer.html>
1277    
1278 root 1.59 =item $ev = $queue->copy_buffer_rect ($src, $dst, $src_x, $src_y, $src_z, $dst_x, $dst_y, $dst_z, $width, $height, $depth, $src_row_pitch, $src_slice_pitch, $dst_row_pitch, $dst_slice_pitch, $wait_event...)
1279 root 1.27
1280     Yeah.
1281    
1282     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueCopyBufferToImage.html>.
1283    
1284 root 1.59 =item $ev = $queue->fill_buffer ($mem, $pattern, $offset, $size, ...)
1285 root 1.52
1286     Fills the given buffer object with repeated applications of C<$pattern>,
1287     starting at C<$offset> for C<$size> octets.
1288    
1289     L<http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/clEnqueueFillBuffer.html>
1290    
1291 root 1.59 =item $ev = $queue->fill_image ($img, $r, $g, $b, $a, $x, $y, $z, $width, $height, $depth, ...)
1292 root 1.52
1293     Fills the given image area with the given rgba colour components. The
1294     components are normally floating point values between C<0> and C<1>,
1295     except when the image channel data type is a signe dor unsigned
1296     unnormalised format, in which case the range is determined by the format.
1297    
1298     L<http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/clEnqueueFillImage.html>
1299    
1300 root 1.59 =item $ev = $queue->task ($kernel, $wait_events...)
1301 root 1.5
1302     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueTask.html>
1303    
1304 root 1.59 =item $ev = $queue->nd_range_kernel ($kernel, \@global_work_offset, \@global_work_size, \@local_work_size, $wait_events...)
1305 root 1.5
1306     Enqueues a kernel execution.
1307    
1308 root 1.57 \@global_work_size must be specified as a reference to an array of
1309 root 1.5 integers specifying the work sizes (element counts).
1310    
1311 root 1.57 \@global_work_offset must be either C<undef> (in which case all offsets
1312 root 1.5 are C<0>), or a reference to an array of work offsets, with the same number
1313 root 1.57 of elements as \@global_work_size.
1314 root 1.5
1315 root 1.57 \@local_work_size must be either C<undef> (in which case the
1316 root 1.5 implementation is supposed to choose good local work sizes), or a
1317     reference to an array of local work sizes, with the same number of
1318 root 1.57 elements as \@global_work_size.
1319 root 1.5
1320     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueNDRangeKernel.html>
1321    
1322 root 1.59 =item $ev = $queue->acquire_gl_objects ([object, ...], $wait_events...)
1323 root 1.35
1324     Enqueues a list (an array-ref of OpenCL::Memory objects) to be acquired
1325     for subsequent OpenCL usage.
1326    
1327     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueAcquireGLObjects.html>
1328    
1329 root 1.59 =item $ev = $queue->release_gl_objects ([object, ...], $wait_events...)
1330 root 1.35
1331     Enqueues a list (an array-ref of OpenCL::Memory objects) to be released
1332     for subsequent OpenGL usage.
1333    
1334     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueReleaseGLObjects.html>
1335    
1336 root 1.59 =item $ev = $queue->wait_for_events ($wait_events...)
1337 root 1.5
1338     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueWaitForEvents.html>
1339    
1340 root 1.59 =item $ev = $queue->marker ($wait_events...)
1341 root 1.46
1342     L<http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/clEnqueueMarkerWithWaitList.html>
1343    
1344 root 1.59 =item $ev = $queue->barrier ($wait_events...)
1345 root 1.5
1346 root 1.46 L<http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/clEnqueueBarrierWithWaitList.html>
1347 root 1.5
1348     =item $queue->flush
1349    
1350     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clFlush.html>
1351    
1352     =item $queue->finish
1353    
1354     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clFinish.html>
1355    
1356 root 1.21 =item $packed_value = $queue->info ($name)
1357    
1358     See C<< $platform->info >> for details.
1359    
1360     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetCommandQueueInfo.html>
1361    
1362     =for gengetinfo begin command_queue
1363    
1364     =item $ctx = $command_queue->context
1365    
1366 root 1.24 Calls C<clGetCommandQueueInfo> with C<CL_QUEUE_CONTEXT> and returns the result.
1367 root 1.21
1368     =item $device = $command_queue->device
1369    
1370 root 1.24 Calls C<clGetCommandQueueInfo> with C<CL_QUEUE_DEVICE> and returns the result.
1371 root 1.21
1372     =item $uint = $command_queue->reference_count
1373    
1374 root 1.24 Calls C<clGetCommandQueueInfo> with C<CL_QUEUE_REFERENCE_COUNT> and returns the result.
1375 root 1.21
1376     =item $command_queue_properties = $command_queue->properties
1377    
1378 root 1.24 Calls C<clGetCommandQueueInfo> with C<CL_QUEUE_PROPERTIES> and returns the result.
1379 root 1.21
1380     =for gengetinfo end command_queue
1381    
1382 root 1.5 =back
1383    
1384 root 1.66 =head3 MEMORY MAPPED BUFFERS
1385    
1386     OpenCL allows you to map buffers and images to host memory (read: perl
1387     scalars). This is done much like reading or copying a buffer, by enqueuing
1388     a map or unmap operation on the command queue.
1389    
1390 root 1.69 The map operations return an C<OpenCL::Mapped> object - see L<THE
1391 root 1.66 OpenCL::Mapped CLASS> section for details on what to do with these
1392     objects.
1393    
1394     The object will be unmapped automatically when the mapped object is
1395     destroyed (you can use a barrier to make sure the unmap has finished,
1396     before using the buffer in a kernel), but you can also enqueue an unmap
1397     operation manually.
1398    
1399     =over 4
1400    
1401 root 1.69 =item $mapped_buffer = $queue->map_buffer ($buf, $blocking=1, $map_flags=OpenCL::MAP_READ|OpenCL::MAP_WRITE, $offset=0, $size=undef, $wait_events...)
1402 root 1.66
1403 root 1.69 Maps the given buffer into host memory and returns an
1404     C<OpenCL::MappedBuffer> object. If C<$size> is specified as undef, then
1405     the map will extend to the end of the buffer.
1406 root 1.66
1407     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueMapBuffer.html>
1408    
1409 root 1.69 Example: map the buffer $buf fully and replace the first 4 bytes by "abcd", then unmap.
1410    
1411     {
1412     my $mapped = $queue->map_buffer ($buf, 1, OpenCL::MAP_WRITE);
1413     substr $$mapped, 0, 4, "abcd";
1414     } # asynchronously unmap because $mapped is destroyed
1415 root 1.66
1416 root 1.69 =item $mapped_image = $queue->map_image ($img, $blocking=1, $map_flags=OpenCL::MAP_READ|OpenCL::MAP_WRITE, $x=0, $y=0, $z=0, $width=undef, $height=undef, $depth=undef, $wait_events...)
1417    
1418     Maps the given image area into host memory and return an
1419     C<OpenCL::MappedImage> object.
1420    
1421     If any of C<$width>, C<$height> and/or C<$depth> are C<undef> then they
1422     will be replaced by the maximum possible value.
1423 root 1.66
1424     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueMapImage.html>
1425    
1426 root 1.69 Example: map an image (with OpenCL::UNSIGNED_INT8 channel type) and set
1427     the first channel of the leftmost column to 5, then explicitly unmap
1428     it. You are not necessarily meant to do it this way, this example just
1429     shows you the accessors to use :)
1430    
1431     my $mapped = $queue->map_image ($image, 1, OpenCL::MAP_WRITE);
1432    
1433     $mapped->set ($_ * $mapped->row_pitch, pack "C", 5)
1434     for 0..$image->height;
1435    
1436     $mapped->unmap;.
1437     $mapped->wait; # only needed for out of order queues normally
1438    
1439 root 1.66 =item $ev = $queue->unmap ($mapped, $wait_events...)
1440    
1441     Unmaps the data from host memory. You must not call any methods that
1442     modify the data, or modify the data scalar directly, after calling this
1443     method.
1444    
1445     The mapped event object will always be passed as part of the
1446     $wait_events. The mapped event object will be replaced by the new event
1447     object that this request creates.
1448    
1449     =back
1450    
1451 root 1.5 =head2 THE OpenCL::Memory CLASS
1452    
1453     This the superclass of all memory objects - OpenCL::Buffer, OpenCL::Image,
1454 root 1.21 OpenCL::Image2D and OpenCL::Image3D.
1455 root 1.5
1456     =over 4
1457    
1458     =item $packed_value = $memory->info ($name)
1459    
1460     See C<< $platform->info >> for details.
1461    
1462     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetMemObjectInfo.html>
1463    
1464 root 1.21 =for gengetinfo begin mem
1465    
1466     =item $mem_object_type = $mem->type
1467    
1468 root 1.24 Calls C<clGetMemObjectInfo> with C<CL_MEM_TYPE> and returns the result.
1469 root 1.21
1470     =item $mem_flags = $mem->flags
1471    
1472 root 1.24 Calls C<clGetMemObjectInfo> with C<CL_MEM_FLAGS> and returns the result.
1473 root 1.21
1474     =item $int = $mem->size
1475    
1476 root 1.24 Calls C<clGetMemObjectInfo> with C<CL_MEM_SIZE> and returns the result.
1477 root 1.21
1478     =item $ptr_value = $mem->host_ptr
1479    
1480 root 1.24 Calls C<clGetMemObjectInfo> with C<CL_MEM_HOST_PTR> and returns the result.
1481 root 1.21
1482     =item $uint = $mem->map_count
1483    
1484 root 1.24 Calls C<clGetMemObjectInfo> with C<CL_MEM_MAP_COUNT> and returns the result.
1485 root 1.21
1486     =item $uint = $mem->reference_count
1487    
1488 root 1.24 Calls C<clGetMemObjectInfo> with C<CL_MEM_REFERENCE_COUNT> and returns the result.
1489 root 1.21
1490     =item $ctx = $mem->context
1491    
1492 root 1.24 Calls C<clGetMemObjectInfo> with C<CL_MEM_CONTEXT> and returns the result.
1493 root 1.21
1494     =item $mem = $mem->associated_memobject
1495    
1496 root 1.24 Calls C<clGetMemObjectInfo> with C<CL_MEM_ASSOCIATED_MEMOBJECT> and returns the result.
1497 root 1.21
1498     =item $int = $mem->offset
1499    
1500 root 1.24 Calls C<clGetMemObjectInfo> with C<CL_MEM_OFFSET> and returns the result.
1501 root 1.21
1502     =for gengetinfo end mem
1503    
1504 root 1.34 =item ($type, $name) = $mem->gl_object_info
1505    
1506     Returns the OpenGL object type (e.g. OpenCL::GL_OBJECT_TEXTURE2D) and the
1507     object "name" (e.g. the texture name) used to create this memory object.
1508    
1509     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetGLObjectInfo.html>
1510    
1511 root 1.5 =back
1512    
1513 root 1.27 =head2 THE OpenCL::Buffer CLASS
1514    
1515     This is a subclass of OpenCL::Memory, and the superclass of
1516     OpenCL::BufferObj. Its purpose is simply to distinguish between buffers
1517     and sub-buffers.
1518    
1519     =head2 THE OpenCL::BufferObj CLASS
1520    
1521     This is a subclass of OpenCL::Buffer and thus OpenCL::Memory. It exists
1522     because one cna create sub buffers of OpenLC::BufferObj objects, but not
1523     sub buffers from these sub buffers.
1524    
1525     =over 4
1526    
1527     =item $subbuf = $buf_obj->sub_buffer_region ($flags, $origin, $size)
1528    
1529     Creates an OpenCL::Buffer objects from this buffer and returns it. The
1530     C<buffer_create_type> is assumed to be C<CL_BUFFER_CREATE_TYPE_REGION>.
1531    
1532     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateSubBuffer.html>
1533    
1534     =back
1535    
1536 root 1.20 =head2 THE OpenCL::Image CLASS
1537    
1538 root 1.47 This is the superclass of all image objects - OpenCL::Image1D,
1539     OpenCL::Image1DArray, OpenCL::Image1DBuffer, OpenCL::Image2D,
1540     OpenCL::Image2DArray and OpenCL::Image3D.
1541 root 1.20
1542     =over 4
1543    
1544 root 1.53 =item $packed_value = $image->image_info ($name)
1545 root 1.20
1546     See C<< $platform->info >> for details.
1547    
1548     The reason this method is not called C<info> is that there already is an
1549     C<< ->info >> method inherited from C<OpenCL::Memory>.
1550    
1551     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetImageInfo.html>
1552    
1553 root 1.53 =item ($channel_order, $channel_data_type) = $image->format
1554    
1555     Returns the channel order and type used to create the image by calling
1556     C<clGetImageInfo> with C<CL_IMAGE_FORMAT>.
1557    
1558 root 1.21 =for gengetinfo begin image
1559    
1560     =item $int = $image->element_size
1561    
1562 root 1.24 Calls C<clGetImageInfo> with C<CL_IMAGE_ELEMENT_SIZE> and returns the result.
1563 root 1.21
1564     =item $int = $image->row_pitch
1565    
1566 root 1.24 Calls C<clGetImageInfo> with C<CL_IMAGE_ROW_PITCH> and returns the result.
1567 root 1.21
1568     =item $int = $image->slice_pitch
1569    
1570 root 1.24 Calls C<clGetImageInfo> with C<CL_IMAGE_SLICE_PITCH> and returns the result.
1571 root 1.21
1572     =item $int = $image->width
1573    
1574 root 1.24 Calls C<clGetImageInfo> with C<CL_IMAGE_WIDTH> and returns the result.
1575 root 1.21
1576     =item $int = $image->height
1577    
1578 root 1.24 Calls C<clGetImageInfo> with C<CL_IMAGE_HEIGHT> and returns the result.
1579 root 1.21
1580     =item $int = $image->depth
1581    
1582 root 1.24 Calls C<clGetImageInfo> with C<CL_IMAGE_DEPTH> and returns the result.
1583 root 1.21
1584     =for gengetinfo end image
1585    
1586 root 1.34 =for gengetinfo begin gl_texture
1587    
1588     =item $GLenum = $gl_texture->target
1589    
1590 root 1.37 Calls C<clGetGLTextureInfo> with C<CL_GL_TEXTURE_TARGET> and returns the result.
1591 root 1.34
1592     =item $GLint = $gl_texture->gl_mipmap_level
1593    
1594 root 1.37 Calls C<clGetGLTextureInfo> with C<CL_GL_MIPMAP_LEVEL> and returns the result.
1595 root 1.34
1596     =for gengetinfo end gl_texture
1597    
1598 root 1.20 =back
1599    
1600 root 1.5 =head2 THE OpenCL::Sampler CLASS
1601    
1602     =over 4
1603    
1604     =item $packed_value = $sampler->info ($name)
1605    
1606     See C<< $platform->info >> for details.
1607    
1608     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetSamplerInfo.html>
1609    
1610 root 1.21 =for gengetinfo begin sampler
1611    
1612     =item $uint = $sampler->reference_count
1613    
1614 root 1.24 Calls C<clGetSamplerInfo> with C<CL_SAMPLER_REFERENCE_COUNT> and returns the result.
1615 root 1.21
1616     =item $ctx = $sampler->context
1617    
1618 root 1.24 Calls C<clGetSamplerInfo> with C<CL_SAMPLER_CONTEXT> and returns the result.
1619 root 1.21
1620     =item $addressing_mode = $sampler->normalized_coords
1621    
1622 root 1.24 Calls C<clGetSamplerInfo> with C<CL_SAMPLER_NORMALIZED_COORDS> and returns the result.
1623 root 1.21
1624     =item $filter_mode = $sampler->addressing_mode
1625    
1626 root 1.24 Calls C<clGetSamplerInfo> with C<CL_SAMPLER_ADDRESSING_MODE> and returns the result.
1627 root 1.21
1628     =item $boolean = $sampler->filter_mode
1629    
1630 root 1.24 Calls C<clGetSamplerInfo> with C<CL_SAMPLER_FILTER_MODE> and returns the result.
1631 root 1.21
1632     =for gengetinfo end sampler
1633    
1634 root 1.5 =back
1635    
1636     =head2 THE OpenCL::Program CLASS
1637    
1638     =over 4
1639    
1640 root 1.55 =item $program->build (\@devices = undef, $options = "", $cb->($program) = undef)
1641 root 1.5
1642 root 1.51 Tries to build the program with the given options. See also the
1643     C<$ctx->build> convenience function.
1644 root 1.5
1645 root 1.55 If a callback is specified, then it will be called when compilation is
1646     finished. Note that many OpenCL implementations block your program while
1647     compiling whether you use a callback or not. See C<build_async> if you
1648     want to make sure the build is done in the background.
1649    
1650 root 1.63 Note that some OpenCL implementations act up badly, and don't call the
1651 root 1.55 callback in some error cases (but call it in others). This implementation
1652     assumes the callback will always be called, and leaks memory if this is
1653     not so. So best make sure you don't pass in invalid values.
1654    
1655 root 1.63 Some implementations fail with C<OpenCL::INVALID_BINARY> when the
1656     compilation state is successful but some later stage fails.
1657    
1658 root 1.5 L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clBuildProgram.html>
1659    
1660 root 1.55 =item $program->build_async (\@devices = undef, $options = "", $cb->($program) = undef)
1661    
1662     Similar to C<< ->build >>, except it starts a thread, and never fails (you
1663     need to check the compilation status form the callback, or by polling).
1664    
1665 root 1.5 =item $packed_value = $program->build_info ($device, $name)
1666    
1667     Similar to C<< $platform->info >>, but returns build info for a previous
1668     build attempt for the given device.
1669    
1670     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetBuildInfo.html>
1671    
1672     =item $kernel = $program->kernel ($function_name)
1673    
1674     Creates an OpenCL::Kernel object out of the named C<__kernel> function in
1675     the program.
1676    
1677     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateKernel.html>
1678    
1679 root 1.50 =item @kernels = $program->kernels_in_program
1680    
1681     Returns all kernels successfully compiled for all devices in program.
1682    
1683     http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateKernelsInProgram.html
1684    
1685 root 1.21 =for gengetinfo begin program_build
1686    
1687     =item $build_status = $program->build_status ($device)
1688    
1689 root 1.24 Calls C<clGetProgramBuildInfo> with C<CL_PROGRAM_BUILD_STATUS> and returns the result.
1690 root 1.21
1691     =item $string = $program->build_options ($device)
1692    
1693 root 1.24 Calls C<clGetProgramBuildInfo> with C<CL_PROGRAM_BUILD_OPTIONS> and returns the result.
1694 root 1.21
1695     =item $string = $program->build_log ($device)
1696    
1697 root 1.24 Calls C<clGetProgramBuildInfo> with C<CL_PROGRAM_BUILD_LOG> and returns the result.
1698 root 1.21
1699     =for gengetinfo end program_build
1700    
1701     =item $packed_value = $program->info ($name)
1702    
1703     See C<< $platform->info >> for details.
1704    
1705     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetProgramInfo.html>
1706    
1707     =for gengetinfo begin program
1708    
1709     =item $uint = $program->reference_count
1710    
1711 root 1.24 Calls C<clGetProgramInfo> with C<CL_PROGRAM_REFERENCE_COUNT> and returns the result.
1712 root 1.21
1713     =item $ctx = $program->context
1714    
1715 root 1.24 Calls C<clGetProgramInfo> with C<CL_PROGRAM_CONTEXT> and returns the result.
1716 root 1.21
1717     =item $uint = $program->num_devices
1718    
1719 root 1.24 Calls C<clGetProgramInfo> with C<CL_PROGRAM_NUM_DEVICES> and returns the result.
1720 root 1.21
1721     =item @devices = $program->devices
1722    
1723 root 1.24 Calls C<clGetProgramInfo> with C<CL_PROGRAM_DEVICES> and returns the result.
1724 root 1.21
1725     =item $string = $program->source
1726    
1727 root 1.24 Calls C<clGetProgramInfo> with C<CL_PROGRAM_SOURCE> and returns the result.
1728 root 1.21
1729     =item @ints = $program->binary_sizes
1730    
1731 root 1.24 Calls C<clGetProgramInfo> with C<CL_PROGRAM_BINARY_SIZES> and returns the result.
1732 root 1.21
1733     =for gengetinfo end program
1734    
1735 root 1.23 =item @blobs = $program->binaries
1736    
1737     Returns a string for the compiled binary for every device associated with
1738     the program, empty strings indicate missing programs, and an empty result
1739     means no program binaries are available.
1740    
1741     These "binaries" are often, in fact, informative low-level assembly
1742     sources.
1743    
1744     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetProgramInfo.html>
1745    
1746 root 1.5 =back
1747    
1748     =head2 THE OpenCL::Kernel CLASS
1749    
1750     =over 4
1751    
1752     =item $packed_value = $kernel->info ($name)
1753    
1754     See C<< $platform->info >> for details.
1755    
1756     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetKernelInfo.html>
1757    
1758 root 1.21 =for gengetinfo begin kernel
1759    
1760     =item $string = $kernel->function_name
1761    
1762 root 1.24 Calls C<clGetKernelInfo> with C<CL_KERNEL_FUNCTION_NAME> and returns the result.
1763 root 1.21
1764     =item $uint = $kernel->num_args
1765    
1766 root 1.24 Calls C<clGetKernelInfo> with C<CL_KERNEL_NUM_ARGS> and returns the result.
1767 root 1.21
1768     =item $uint = $kernel->reference_count
1769    
1770 root 1.24 Calls C<clGetKernelInfo> with C<CL_KERNEL_REFERENCE_COUNT> and returns the result.
1771 root 1.21
1772     =item $ctx = $kernel->context
1773    
1774 root 1.24 Calls C<clGetKernelInfo> with C<CL_KERNEL_CONTEXT> and returns the result.
1775 root 1.21
1776     =item $program = $kernel->program
1777    
1778 root 1.24 Calls C<clGetKernelInfo> with C<CL_KERNEL_PROGRAM> and returns the result.
1779 root 1.21
1780     =for gengetinfo end kernel
1781    
1782 root 1.20 =item $packed_value = $kernel->work_group_info ($device, $name)
1783    
1784     See C<< $platform->info >> for details.
1785    
1786     The reason this method is not called C<info> is that there already is an
1787     C<< ->info >> method.
1788    
1789     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetKernelWorkGroupInfo.html>
1790    
1791 root 1.21 =for gengetinfo begin kernel_work_group
1792    
1793     =item $int = $kernel->work_group_size ($device)
1794    
1795 root 1.24 Calls C<clGetKernelWorkGroupInfo> with C<CL_KERNEL_WORK_GROUP_SIZE> and returns the result.
1796 root 1.21
1797     =item @ints = $kernel->compile_work_group_size ($device)
1798    
1799 root 1.24 Calls C<clGetKernelWorkGroupInfo> with C<CL_KERNEL_COMPILE_WORK_GROUP_SIZE> and returns the result.
1800 root 1.21
1801     =item $ulong = $kernel->local_mem_size ($device)
1802    
1803 root 1.24 Calls C<clGetKernelWorkGroupInfo> with C<CL_KERNEL_LOCAL_MEM_SIZE> and returns the result.
1804 root 1.21
1805     =item $int = $kernel->preferred_work_group_size_multiple ($device)
1806    
1807 root 1.24 Calls C<clGetKernelWorkGroupInfo> with C<CL_KERNEL_PREFERRED_WORK_GROUP_SIZE_MULTIPLE> and returns the result.
1808 root 1.21
1809     =item $ulong = $kernel->private_mem_size ($device)
1810    
1811 root 1.24 Calls C<clGetKernelWorkGroupInfo> with C<CL_KERNEL_PRIVATE_MEM_SIZE> and returns the result.
1812 root 1.21
1813     =for gengetinfo end kernel_work_group
1814    
1815 root 1.60 =item $kernel->setf ($format, ...)
1816    
1817     Sets the arguments of a kernel. Since OpenCL 1.1 doesn't have a generic
1818     way to set arguments (and with OpenCL 1.2 it might be rather slow), you
1819     need to specify a format argument, much as with C<printf>, to tell OpenCL
1820     what type of argument it is.
1821    
1822     The format arguments are single letters:
1823    
1824     c char
1825     C unsigned char
1826     s short
1827     S unsigned short
1828     i int
1829     I unsigned int
1830     l long
1831     L unsigned long
1832    
1833     h half float (0..65535)
1834     f float
1835     d double
1836    
1837     z local (octet size)
1838    
1839     m memory object (buffer or image)
1840     a sampler
1841     e event
1842    
1843     Space characters in the format string are ignored.
1844    
1845     Example: set the arguments for a kernel that expects an int, two floats, a buffer and an image.
1846    
1847     $kernel->setf ("i ff mm", 5, 0.5, 3, $buffer, $image);
1848    
1849 root 1.58 =item $kernel->set_TYPE ($index, $value)
1850 root 1.5
1851 root 1.58 =item $kernel->set_char ($index, $value)
1852 root 1.5
1853 root 1.58 =item $kernel->set_uchar ($index, $value)
1854    
1855     =item $kernel->set_short ($index, $value)
1856    
1857     =item $kernel->set_ushort ($index, $value)
1858    
1859     =item $kernel->set_int ($index, $value)
1860    
1861     =item $kernel->set_uint ($index, $value)
1862    
1863     =item $kernel->set_long ($index, $value)
1864    
1865     =item $kernel->set_ulong ($index, $value)
1866    
1867     =item $kernel->set_half ($index, $value)
1868    
1869     =item $kernel->set_float ($index, $value)
1870    
1871     =item $kernel->set_double ($index, $value)
1872    
1873     =item $kernel->set_memory ($index, $value)
1874    
1875     =item $kernel->set_buffer ($index, $value)
1876    
1877     =item $kernel->set_image ($index, $value)
1878    
1879     =item $kernel->set_sampler ($index, $value)
1880    
1881     =item $kernel->set_local ($index, $value)
1882    
1883     =item $kernel->set_event ($index, $value)
1884    
1885     This is a family of methods to set the kernel argument with the number
1886     C<$index> to the give C<$value>.
1887 root 1.5
1888     Chars and integers (including the half type) are specified as integers,
1889 root 1.58 float and double as floating point values, memory/buffer/image must be
1890     an object of that type or C<undef>, local-memory arguments are set by
1891     specifying the size, and sampler and event must be objects of that type.
1892    
1893     Note that C<set_memory> works for all memory objects (all types of buffers
1894     and images) - the main purpose of the more specific C<set_TYPE> functions
1895     is type checking.
1896 root 1.5
1897 root 1.50 Setting an argument for a kernel does NOT keep a reference to the object -
1898     for example, if you set an argument to some image object, free the image,
1899     and call the kernel, you will run into undefined behaviour.
1900    
1901 root 1.5 L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clSetKernelArg.html>
1902    
1903     =back
1904    
1905     =head2 THE OpenCL::Event CLASS
1906    
1907     This is the superclass for all event objects (including OpenCL::UserEvent
1908     objects).
1909    
1910     =over 4
1911    
1912 root 1.21 =item $ev->wait
1913    
1914     Waits for the event to complete.
1915    
1916     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clWaitForEvents.html>
1917    
1918 root 1.55 =item $ev->cb ($exec_callback_type, $callback->($event, $event_command_exec_status))
1919    
1920     Adds a callback to the callback stack for the given event type. There is
1921     no way to remove a callback again.
1922    
1923     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clSetEventCallback.html>
1924    
1925 root 1.5 =item $packed_value = $ev->info ($name)
1926    
1927     See C<< $platform->info >> for details.
1928    
1929     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetEventInfo.html>
1930    
1931 root 1.21 =for gengetinfo begin event
1932    
1933     =item $queue = $event->command_queue
1934    
1935 root 1.24 Calls C<clGetEventInfo> with C<CL_EVENT_COMMAND_QUEUE> and returns the result.
1936 root 1.21
1937     =item $command_type = $event->command_type
1938    
1939 root 1.24 Calls C<clGetEventInfo> with C<CL_EVENT_COMMAND_TYPE> and returns the result.
1940 root 1.21
1941     =item $uint = $event->reference_count
1942    
1943 root 1.24 Calls C<clGetEventInfo> with C<CL_EVENT_REFERENCE_COUNT> and returns the result.
1944 root 1.21
1945     =item $uint = $event->command_execution_status
1946    
1947 root 1.24 Calls C<clGetEventInfo> with C<CL_EVENT_COMMAND_EXECUTION_STATUS> and returns the result.
1948 root 1.21
1949     =item $ctx = $event->context
1950    
1951 root 1.24 Calls C<clGetEventInfo> with C<CL_EVENT_CONTEXT> and returns the result.
1952 root 1.21
1953     =for gengetinfo end event
1954    
1955 root 1.20 =item $packed_value = $ev->profiling_info ($name)
1956    
1957     See C<< $platform->info >> for details.
1958    
1959     The reason this method is not called C<info> is that there already is an
1960     C<< ->info >> method.
1961    
1962     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetProfilingInfo.html>
1963    
1964 root 1.21 =for gengetinfo begin profiling
1965    
1966     =item $ulong = $event->profiling_command_queued
1967    
1968 root 1.24 Calls C<clGetEventProfilingInfo> with C<CL_PROFILING_COMMAND_QUEUED> and returns the result.
1969 root 1.21
1970     =item $ulong = $event->profiling_command_submit
1971    
1972 root 1.24 Calls C<clGetEventProfilingInfo> with C<CL_PROFILING_COMMAND_SUBMIT> and returns the result.
1973 root 1.21
1974     =item $ulong = $event->profiling_command_start
1975    
1976 root 1.24 Calls C<clGetEventProfilingInfo> with C<CL_PROFILING_COMMAND_START> and returns the result.
1977 root 1.21
1978     =item $ulong = $event->profiling_command_end
1979 root 1.5
1980 root 1.24 Calls C<clGetEventProfilingInfo> with C<CL_PROFILING_COMMAND_END> and returns the result.
1981 root 1.5
1982 root 1.21 =for gengetinfo end profiling
1983 root 1.5
1984     =back
1985    
1986     =head2 THE OpenCL::UserEvent CLASS
1987    
1988     This is a subclass of OpenCL::Event.
1989 root 1.4
1990 root 1.1 =over 4
1991    
1992 root 1.5 =item $ev->set_status ($execution_status)
1993    
1994 root 1.55 Sets the execution status of the user event. Can only be called once,
1995     either with OpenCL::COMPLETE or a negative number as status.
1996    
1997 root 1.5 L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clSetUserEventStatus.html>
1998    
1999     =back
2000    
2001 root 1.66 =head2 THE OpenCL::Mapped CLASS
2002    
2003     This class represents objects mapped into host memory. They are
2004     represented by a blessed string scalar. The string data is the mapped
2005     memory area, that is, if you read or write it, then the mapped object is
2006     accessed directly.
2007    
2008     You must only ever use operations that modify the string in-place - for
2009     example, a C<substr> that doesn't change the length, or maybe a regex that
2010     doesn't change the length. Any other operation might cause the data to be
2011     copied.
2012    
2013     When the object is destroyed it will enqueue an implicit unmap operation
2014     on the queue that was used to create it.
2015    
2016 root 1.67 Keep in mind that you I<need> to unmap (or destroy) mapped objects before
2017     OpenCL sees the changes, even if some implementations don't need this
2018     sometimes.
2019    
2020 root 1.66 Example, replace the first two floats in the mapped buffer by 1 and 2.
2021    
2022     my $mapped = $queue->map_buffer ($buf, ...
2023     $mapped->event->wait; # make sure it's there
2024    
2025     # now replace first 8 bytes by new data, which is exactly 8 bytes long
2026     # we blindly assume device endianness to equal host endianness
2027     # (and of course, we assume iee 754 single precision floats :)
2028     substr $$mapped, 0, 8, pack "f*", 1, 2;
2029    
2030     =over 4
2031    
2032 root 1.67 =item $ev = $mapped->unmap ($wait_events...)
2033    
2034     Unmaps the mapped memory object, using the queue originally used to create
2035     it, quite similarly to C<< $queue->unmap ($mapped, ...) >>.
2036    
2037 root 1.66 =item $bool = $mapped->mapped
2038    
2039     Returns whether the object is still mapped - true before an C<unmap> is
2040     enqueued, false afterwards.
2041    
2042     =item $ev = $mapped->event
2043    
2044     Return the event object associated with the mapped object. Initially, this
2045     will be the event object created when mapping the object, and after an
2046     unmap, this will be the event object that the unmap operation created.
2047    
2048     =item $mapped->wait
2049    
2050     Same as C<< $mapped->event->wait >> - makes sure no operations on this
2051     mapped object are outstanding.
2052    
2053     =item $bytes = $mapped->size
2054    
2055     Returns the size of the mapped area, in bytes. Same as C<length $$mapped>.
2056    
2057     =item $ptr = $mapped->ptr
2058    
2059 root 1.68 Returns the raw memory address of the mapped area.
2060 root 1.66
2061 root 1.67 =item $mapped->set ($offset, $data)
2062    
2063     Replaces the data at the given C<$offset> in the memory area by the new
2064 root 1.68 C<$data>. This method is safer than direct manipulation of C<$mapped>
2065     because it does bounds-checking, but also slower.
2066 root 1.67
2067     =item $data = $mapped->get ($offset, $length)
2068    
2069     Returns (without copying) a scalar representing the data at the given
2070     C<$offset> and C<$length> in the mapped memory area. This is the same as
2071 root 1.68 the following substr, except much slower;
2072 root 1.67
2073     $data = substr $$mapped, $offset, $length
2074    
2075     =cut
2076    
2077 root 1.68 sub OpenCL::Mapped::get {
2078 root 1.67 substr ${$_[0]}, $_[1], $_[2]
2079     }
2080    
2081 root 1.66 =back
2082    
2083     =head2 THE OpenCL::MappedBuffer CLASS
2084    
2085     This is a subclass of OpenCL::Mapped, representing mapped buffers.
2086    
2087     =head2 THE OpenCL::MappedImage CLASS
2088    
2089     This is a subclass of OpenCL::Mapped, representing mapped images.
2090    
2091     =over 4
2092    
2093 root 1.67 =item $bytes = $mapped->row_pitch
2094    
2095     =item $bytes = $mapped->slice_pitch
2096    
2097     Return the row or slice pitch of the image that has been mapped.
2098    
2099 root 1.66 =back
2100    
2101    
2102 root 1.1 =cut
2103    
2104     1;
2105    
2106     =head1 AUTHOR
2107    
2108     Marc Lehmann <schmorp@schmorp.de>
2109     http://home.schmorp.de/
2110    
2111     =cut
2112