ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/OpenCL/OpenCL.pm
Revision: 1.64
Committed: Mon Apr 30 23:12:43 2012 UTC (12 years ago) by root
Branch: MAIN
Changes since 1.63: +16 -13 lines
Log Message:
*** empty log message ***

File Contents

# User Rev Content
1 root 1.1 =head1 NAME
2    
3 root 1.5 OpenCL - Open Computing Language Bindings
4 root 1.1
5     =head1 SYNOPSIS
6    
7     use OpenCL;
8    
9     =head1 DESCRIPTION
10    
11 root 1.7 This is an early release which might be useful, but hasn't seen much testing.
12 root 1.1
13 root 1.9 =head2 OpenCL FROM 10000 FEET HEIGHT
14    
15     Here is a high level overview of OpenCL:
16    
17     First you need to find one or more OpenCL::Platforms (kind of like
18     vendors) - usually there is only one.
19    
20     Each platform gives you access to a number of OpenCL::Device objects, e.g.
21     your graphics card.
22    
23 root 1.11 From a platform and some device(s), you create an OpenCL::Context, which is
24 root 1.9 a very central object in OpenCL: Once you have a context you can create
25     most other objects:
26    
27 root 1.11 OpenCL::Program objects, which store source code and, after building for a
28     specific device ("compiling and linking"), also binary programs. For each
29     kernel function in a program you can then create an OpenCL::Kernel object
30     which represents basically a function call with argument values.
31 root 1.9
32 root 1.20 OpenCL::Memory objects of various flavours: OpenCL::Buffer objects (flat
33 root 1.16 memory areas, think arrays or structs) and OpenCL::Image objects (think 2d
34     or 3d array) for bulk data and input and output for kernels.
35 root 1.9
36     OpenCL::Sampler objects, which are kind of like texture filter modes in
37     OpenGL.
38    
39     OpenCL::Queue objects - command queues, which allow you to submit memory
40     reads, writes and copies, as well as kernel calls to your devices. They
41     also offer a variety of methods to synchronise request execution, for
42     example with barriers or OpenCL::Event objects.
43    
44     OpenCL::Event objects are used to signal when something is complete.
45    
46     =head2 HELPFUL RESOURCES
47 root 1.3
48 root 1.57 The OpenCL specs used to develop this module:
49 root 1.3
50     http://www.khronos.org/registry/cl/specs/opencl-1.1.pdf
51 root 1.57 http://www.khronos.org/registry/cl/specs/opencl-1.2.pdf
52     http://www.khronos.org/registry/cl/specs/opencl-1.2-extensions.pdf
53 root 1.3
54     OpenCL manpages:
55    
56     http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/
57 root 1.57 http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/
58 root 1.3
59 root 1.18 If you are into UML class diagrams, the following diagram might help - if
60 root 1.57 not, it will be mildly confusing (also, the class hierarchy of this module
61     is much more fine-grained):
62 root 1.18
63 root 1.57 http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/classDiagram.html
64 root 1.18
65 root 1.16 Here's a tutorial from AMD (very AMD-centric, too), not sure how useful it
66     is, but at least it's free of charge:
67    
68     http://developer.amd.com/zones/OpenCLZone/courses/Documents/Introduction_to_OpenCL_Programming%20Training_Guide%20%28201005%29.pdf
69    
70 root 1.18 And here's NVIDIA's OpenCL Best Practises Guide:
71 root 1.16
72 root 1.18 http://developer.download.nvidia.com/compute/cuda/3_2/toolkit/docs/OpenCL_Best_Practices_Guide.pdf
73 root 1.16
74 root 1.9 =head1 BASIC WORKFLOW
75    
76 root 1.11 To get something done, you basically have to do this once (refer to the
77     examples below for actual code, this is just a high-level description):
78 root 1.9
79 root 1.11 Find some platform (e.g. the first one) and some device(s) (e.g. the first
80     device of the platform), and create a context from those.
81 root 1.9
82 root 1.11 Create program objects from your OpenCL source code, then build (compile)
83     the programs for each device you want to run them on.
84 root 1.9
85 root 1.11 Create kernel objects for all kernels you want to use (surprisingly, these
86     are not device-specific).
87 root 1.9
88 root 1.11 Then, to execute stuff, you repeat these steps, possibly resuing or
89     sharing some buffers:
90 root 1.9
91 root 1.11 Create some input and output buffers from your context. Set these as
92     arguments to your kernel.
93    
94     Enqueue buffer writes to initialise your input buffers (when not
95     initialised at creation time).
96 root 1.9
97     Enqueue the kernel execution.
98    
99     Enqueue buffer reads for your output buffer to read results.
100    
101 root 1.3 =head1 EXAMPLES
102    
103 root 1.5 =head2 Enumerate all devices and get contexts for them.
104 root 1.1
105 root 1.11 Best run this once to get a feel for the platforms and devices in your
106     system.
107    
108 root 1.1 for my $platform (OpenCL::platforms) {
109 root 1.24 printf "platform: %s\n", $platform->name;
110     printf "extensions: %s\n", $platform->extensions;
111 root 1.1 for my $device ($platform->devices) {
112 root 1.24 printf "+ device: %s\n", $device->name;
113 root 1.29 my $ctx = $platform->context (undef, [$device]);
114 root 1.1 # do stuff
115     }
116     }
117    
118 root 1.5 =head2 Get a useful context and a command queue.
119 root 1.1
120 root 1.11 This is a useful boilerplate for any OpenCL program that only wants to use
121     one device,
122    
123     my ($platform) = OpenCL::platforms; # find first platform
124     my ($dev) = $platform->devices; # find first device of platform
125     my $ctx = $platform->context (undef, [$dev]); # create context out of those
126     my $queue = $ctx->queue ($dev); # create a command queue for the device
127 root 1.1
128 root 1.5 =head2 Print all supported image formats of a context.
129    
130 root 1.11 Best run this once for your context, to see whats available and how to
131     gather information.
132    
133 root 1.5 for my $type (OpenCL::MEM_OBJECT_IMAGE2D, OpenCL::MEM_OBJECT_IMAGE3D) {
134 root 1.10 print "supported image formats for ", OpenCL::enum2str $type, "\n";
135 root 1.5
136     for my $f ($ctx->supported_image_formats (0, $type)) {
137     printf " %-10s %-20s\n", OpenCL::enum2str $f->[0], OpenCL::enum2str $f->[1];
138     }
139     }
140    
141     =head2 Create a buffer with some predefined data, read it back synchronously,
142     then asynchronously.
143 root 1.3
144     my $buf = $ctx->buffer_sv (OpenCL::MEM_COPY_HOST_PTR, "helmut");
145    
146 root 1.59 $queue->read_buffer ($buf, 1, 1, 3, my $data);
147 root 1.10 print "$data\n";
148 root 1.3
149 root 1.59 my $ev = $queue->read_buffer ($buf, 0, 1, 3, my $data);
150 root 1.3 $ev->wait;
151 root 1.10 print "$data\n"; # prints "elm"
152 root 1.3
153 root 1.5 =head2 Create and build a program, then create a kernel out of one of its
154     functions.
155 root 1.3
156     my $src = '
157 root 1.31 kernel void
158     squareit (global float *input, global float *output)
159 root 1.3 {
160 root 1.15 $id = get_global_id (0);
161 root 1.3 output [id] = input [id] * input [id];
162     }
163     ';
164    
165 root 1.51 my $prog = $ctx->build_program ($src);
166 root 1.3 my $kernel = $prog->kernel ("squareit");
167    
168 root 1.11 =head2 Create some input and output float buffers, then call the
169     'squareit' kernel on them.
170 root 1.4
171     my $input = $ctx->buffer_sv (OpenCL::MEM_COPY_HOST_PTR, pack "f*", 1, 2, 3, 4.5);
172     my $output = $ctx->buffer (0, OpenCL::SIZEOF_FLOAT * 5);
173    
174     # set buffer
175     $kernel->set_buffer (0, $input);
176     $kernel->set_buffer (1, $output);
177    
178     # execute it for all 4 numbers
179 root 1.59 $queue->nd_range_kernel ($kernel, undef, [4], undef);
180 root 1.4
181 root 1.5 # enqueue a synchronous read
182 root 1.59 $queue->read_buffer ($output, 1, 0, OpenCL::SIZEOF_FLOAT * 4, my $data);
183 root 1.5
184     # print the results:
185 root 1.10 printf "%s\n", join ", ", unpack "f*", $data;
186 root 1.5
187     =head2 The same enqueue operations as before, but assuming an out-of-order queue,
188     showing off barriers.
189    
190     # execute it for all 4 numbers
191 root 1.59 $queue->nd_range_kernel ($kernel, undef, [4], undef);
192 root 1.5
193     # enqueue a barrier to ensure in-order execution
194 root 1.59 $queue->barrier;
195 root 1.4
196 root 1.5 # enqueue an async read
197 root 1.59 $queue->read_buffer ($output, 0, 0, OpenCL::SIZEOF_FLOAT * 4, my $data);
198 root 1.5
199     # wait for all requests to finish
200     $queue->finish;
201    
202     =head2 The same enqueue operations as before, but assuming an out-of-order queue,
203     showing off event objects and wait lists.
204    
205     # execute it for all 4 numbers
206 root 1.59 my $ev = $queue->nd_range_kernel ($kernel, undef, [4], undef);
207 root 1.5
208     # enqueue an async read
209 root 1.59 $ev = $queue->read_buffer ($output, 0, 0, OpenCL::SIZEOF_FLOAT * 4, my $data, $ev);
210 root 1.5
211     # wait for the last event to complete
212 root 1.4 $ev->wait;
213    
214 root 1.38 =head2 Use the OpenGL module to share a texture between OpenCL and OpenGL and draw some julia
215     set tunnel effect.
216    
217 root 1.64 This is quite a long example to get you going - you can download it from
218     L<http://cvs.schmorp.de/OpenCL/examples/juliaflight>.
219 root 1.38
220     use OpenGL ":all";
221     use OpenCL;
222    
223 root 1.64 my $S = $ARGV[0] || 256; # window/texture size, smaller is faster
224    
225 root 1.38 # open a window and create a gl texture
226 root 1.64 OpenGL::glpOpenWindow width => $S, height => $S;
227 root 1.38 my $texid = glGenTextures_p 1;
228     glBindTexture GL_TEXTURE_2D, $texid;
229 root 1.64 glTexImage2D_c GL_TEXTURE_2D, 0, GL_RGBA8, $S, $S, 0, GL_RGBA, GL_UNSIGNED_BYTE, 0;
230 root 1.38
231     # find and use the first opencl device that let's us get a shared opengl context
232     my $platform;
233     my $dev;
234     my $ctx;
235    
236     for (OpenCL::platforms) {
237     $platform = $_;
238     for ($platform->devices) {
239     $dev = $_;
240     $ctx = $platform->context ([OpenCL::GLX_DISPLAY_KHR, undef, OpenCL::GL_CONTEXT_KHR, undef], [$dev])
241     and last;
242     }
243     }
244    
245     $ctx
246     or die "cannot find suitable OpenCL device\n";
247    
248     my $queue = $ctx->queue ($dev);
249    
250     # now attach an opencl image2d object to the opengl texture
251     my $tex = $ctx->gl_texture2d (OpenCL::MEM_WRITE_ONLY, GL_TEXTURE_2D, 0, $texid);
252    
253     # now the boring opencl code
254     my $src = <<EOF;
255     kernel void
256     juliatunnel (write_only image2d_t img, float time)
257     {
258 root 1.64 int2 xy = (int2)(get_global_id (0), get_global_id (1));
259     float2 p = convert_float2 (xy) / $S.f * 2.f - 1.f;
260 root 1.38
261 root 1.64 float2 m = (float2)(1.f, p.y) / fabs (p.x); // tunnel
262     m.x = fabs (fmod (m.x + time * 0.05f, 4.f) - 2.f);
263 root 1.38
264     float2 z = m;
265 root 1.64 float2 c = (float2)(sin (time * 0.01133f), cos (time * 0.02521f));
266 root 1.38
267 root 1.64 for (int i = 0; i < 25 && dot (z, z) < 4.f; ++i) // standard julia
268 root 1.38 z = (float2)(z.x * z.x - z.y * z.y, 2.f * z.x * z.y) + c;
269    
270 root 1.64 float3 colour = (float3)(z.x, z.y, atan2 (z.y, z.x));
271     write_imagef (img, xy, (float4)(colour * p.x * p.x, 1.));
272 root 1.38 }
273     EOF
274    
275 root 1.51 my $prog = $ctx->build_program ($src);
276 root 1.38 my $kernel = $prog->kernel ("juliatunnel");
277    
278     # program compiled, kernel ready, now draw and loop
279    
280     for (my $time; ; ++$time) {
281     # acquire objects from opengl
282 root 1.59 $queue->acquire_gl_objects ([$tex]);
283 root 1.38
284     # configure and run our kernel
285 root 1.64 $kernel->setf ("mf", $tex, $time*2); # mf = memory object, float
286     $queue->nd_range_kernel ($kernel, undef, [$S, $S], undef);
287 root 1.38
288     # release objects to opengl again
289 root 1.59 $queue->release_gl_objects ([$tex]);
290 root 1.38
291     # wait
292 root 1.40 $queue->finish;
293 root 1.38
294     # now draw the texture, the defaults should be all right
295     glTexParameterf GL_TEXTURE_2D, GL_TEXTURE_MIN_FILTER, GL_NEAREST;
296    
297     glEnable GL_TEXTURE_2D;
298     glBegin GL_QUADS;
299     glTexCoord2f 0, 1; glVertex3i -1, -1, -1;
300     glTexCoord2f 0, 0; glVertex3i 1, -1, -1;
301     glTexCoord2f 1, 0; glVertex3i 1, 1, -1;
302     glTexCoord2f 1, 1; glVertex3i -1, 1, -1;
303     glEnd;
304    
305     glXSwapBuffers;
306    
307     select undef, undef, undef, 1/60;
308     }
309    
310 root 1.5 =head1 DOCUMENTATION
311    
312     =head2 BASIC CONVENTIONS
313    
314 root 1.14 This is not a one-to-one C-style translation of OpenCL to Perl - instead
315     I attempted to make the interface as type-safe as possible by introducing
316 root 1.5 object syntax where it makes sense. There are a number of important
317     differences between the OpenCL C API and this module:
318    
319     =over 4
320    
321     =item * Object lifetime managament is automatic - there is no need
322     to free objects explicitly (C<clReleaseXXX>), the release function
323     is called automatically once all Perl references to it go away.
324    
325 root 1.20 =item * OpenCL uses CamelCase for function names
326     (e.g. C<clGetPlatformIDs>, C<clGetPlatformInfo>), while this module
327     uses underscores as word separator and often leaves out prefixes
328     (C<OpenCL::platforms>, C<< $platform->info >>).
329 root 1.5
330     =item * OpenCL often specifies fixed vector function arguments as short
331 root 1.19 arrays (C<size_t origin[3]>), while this module explicitly expects the
332     components as separate arguments (C<$orig_x, $orig_y, $orig_z>) in
333     function calls.
334 root 1.5
335 root 1.19 =item * Structures are often specified by flattening out their components
336     as with short vectors, and returned as arrayrefs.
337 root 1.5
338     =item * When enqueuing commands, the wait list is specified by adding
339 root 1.9 extra arguments to the function - anywhere a C<$wait_events...> argument
340 root 1.44 is documented this can be any number of event objects. As an extsnion
341     implemented by this module, C<undef> values will be ignored in the event
342     list.
343 root 1.5
344     =item * When enqueuing commands, if the enqueue method is called in void
345     context, no event is created. In all other contexts an event is returned
346     by the method.
347    
348     =item * This module expects all functions to return C<CL_SUCCESS>. If any
349     other status is returned the function will throw an exception, so you
350     don't normally have to to any error checking.
351    
352     =back
353    
354 root 1.7 =head2 PERL AND OPENCL TYPES
355    
356 root 1.8 This handy(?) table lists OpenCL types and their perl, PDL and pack/unpack
357 root 1.7 format equivalents:
358    
359 root 1.8 OpenCL perl PDL pack/unpack
360     char IV - c
361     uchar IV byte C
362     short IV short s
363     ushort IV ushort S
364     int IV long? l
365     uint IV - L
366     long IV longlong q
367     ulong IV - Q
368     float NV float f
369     half IV ushort S
370     double NV double d
371 root 1.7
372 root 1.36 =head2 GLX SUPPORT
373    
374     Due to the sad state that OpenGL support is in in Perl (mostly the OpenGL
375     module, which has little to no documentation and has little to no support
376 root 1.38 for glX), this module, as a special extension, treats context creation
377 root 1.36 properties C<OpenCL::GLX_DISPLAY_KHR> and C<OpenCL::GL_CONTEXT_KHR>
378     specially: If either or both of these are C<undef>, then the OpenCL
379 root 1.38 module tries to dynamically resolve C<glXGetCurrentDisplay> and
380     C<glXGetCurrentContext>, call these functions and use their return values
381 root 1.36 instead.
382    
383     For this to work, the OpenGL library must be loaded, a GLX context must
384     have been created and be made current, and C<dlsym> must be available and
385     capable of finding the function via C<RTLD_DEFAULT>.
386    
387 root 1.55 =head2 EVENT SYSTEM
388    
389     OpenCL can generate a number of (potentially) asynchronous events, for
390     example, after compiling a program, to signal a context-related error or,
391     perhaps most important, to signal completion of queued jobs (by setting
392     callbacks on OpenCL::Event objects).
393    
394     To facilitate this, this module maintains an event queue - each
395     time an asynchronous event happens, it is queued, and perl will be
396     interrupted. This is implemented via the L<Async::Interrupt> module. In
397     addition, this module has L<AnyEvent> support, so it can seamlessly
398     integrate itself into many event loops.
399    
400     Since this module is a bit hard to understand, here are some case examples:
401    
402     =head3 Don't use callbacks.
403    
404     When your program never uses any callbacks, then there will never be any
405     notifications you need to take care of, and therefore no need to worry
406     about all this.
407    
408     You can achieve a great deal by explicitly waiting for events, or using
409     barriers and flush calls. In many programs, there is no need at all to
410     tinker with asynchronous events.
411    
412     =head3 Use AnyEvent
413    
414     This module automatically registers a watcher that invokes all outstanding
415     event callbacks when AnyEvent is initialised (and block asynchronous
416     interruptions). Using this mode of operations is the safest and most
417     recommended one.
418    
419     To use this, simply use AnyEvent and this module normally, make sure you
420     have an event loop running:
421    
422     use Gtk2 -init;
423     use AnyEvent;
424    
425     # initialise AnyEvent, by creating a watcher, or:
426     AnyEvent::detect;
427    
428 root 1.59 my $e = $queue->marker;
429 root 1.55 $e->cb (sub {
430     warn "opencl is finished\n";
431     })
432    
433     main Gtk2;
434    
435     Note that this module will not initialise AnyEvent for you. Before
436     AnyEvent is initialised, the module will asynchronously interrupt perl
437     instead. To avoid any surprises, it's best to explicitly initialise
438     AnyEvent.
439    
440     You can temporarily enable asynchronous interruptions (see next paragraph)
441     by calling C<$OpenCL::INTERRUPT->unblock> and disable them again by
442     calling C<$OpenCL::INTERRUPT->block>.
443    
444     =head3 Let yourself be interrupted at any time
445    
446     This mode is the default unless AnyEvent is loaded and initialised. In
447     this mode, OpenCL asynchronously interrupts a running perl program. The
448     emphasis is on both I<asynchronously> and I<running> here.
449    
450     Asynchronously means that perl might execute your callbacks at any
451     time. For example, in the following code (I<THAT YOU SHOULD NOT COPY>),
452     the C<until> loop following the marker call will be interrupted by the
453     callback:
454    
455 root 1.59 my $e = $queue->marker;
456 root 1.55 my $flag;
457     $e->cb (sub { $flag = 1 });
458     1 until $flag;
459     # $flag is now 1
460    
461     The reason why you shouldn't blindly copy the above code is that
462     busy waiting is a really really bad thing, and really really bad for
463     performance.
464    
465     While at first this asynchronous business might look exciting, it can be
466     really hard, because you need to be prepared for the callback code to be
467     executed at any time, which limits the amount of things the callback code
468     can do safely.
469    
470     This can be mitigated somewhat by using C<<
471     $OpenCL::INTERRUPT->scope_block >> (see the L<Async::Interrupt>
472     documentation for details).
473    
474     The other problem is that your program must be actively I<running> to be
475     interrupted. When you calculate stuff, your program is running. When you
476     hang in some C functions or other block execution (by calling C<sleep>,
477     C<select>, running an event loop and so on), your program is waiting, not
478     running.
479    
480     One way around that would be to attach a read watcher to your event loop,
481     listening for events on C<< $OpenCL::INTERRUPT->pipe_fileno >>, using a
482     dummy callback (C<sub { }>) to temporarily execute some perl code.
483    
484     That is then awfully close to using the built-in AnyEvent support above,
485     though, so consider that one instead.
486    
487     =head3 Be creative
488    
489     OpenCL exports the L<Async::Interrupt> object it uses in the global
490     variable C<$OpenCL::INTERRUPT>. You can configure it in any way you like.
491    
492     So if you want to feel like a real pro, err, wait, if you feel no risk
493     menas no fun, you can experiment by implementing your own mode of
494     operations.
495    
496 root 1.52 =cut
497    
498     package OpenCL;
499    
500     use common::sense;
501 root 1.62 use Carp ();
502 root 1.55 use Async::Interrupt ();
503    
504     our $POLL_FUNC; # set by XS
505 root 1.52
506     BEGIN {
507 root 1.62 our $VERSION = '0.98';
508 root 1.52
509     require XSLoader;
510     XSLoader::load (__PACKAGE__, $VERSION);
511    
512     @OpenCL::Platform::ISA =
513     @OpenCL::Device::ISA =
514     @OpenCL::Context::ISA =
515     @OpenCL::Queue::ISA =
516     @OpenCL::Memory::ISA =
517     @OpenCL::Sampler::ISA =
518     @OpenCL::Program::ISA =
519     @OpenCL::Kernel::ISA =
520     @OpenCL::Event::ISA = OpenCL::Object::;
521    
522     @OpenCL::Buffer::ISA =
523     @OpenCL::Image::ISA = OpenCL::Memory::;
524    
525     @OpenCL::BufferObj::ISA = OpenCL::Buffer::;
526    
527     @OpenCL::Image2D::ISA =
528     @OpenCL::Image3D::ISA =
529     @OpenCL::Image2DArray::ISA =
530     @OpenCL::Image1D::ISA =
531     @OpenCL::Image1DArray::ISA =
532     @OpenCL::Image1DBuffer::ISA = OpenCL::Image::;
533    
534     @OpenCL::UserEvent::ISA = OpenCL::Event::;
535     }
536    
537 root 1.5 =head2 THE OpenCL PACKAGE
538    
539     =over 4
540    
541     =item $int = OpenCL::errno
542    
543 root 1.11 The last error returned by a function - it's only valid after an error occured
544     and before calling another OpenCL function.
545 root 1.5
546 root 1.61 =item $str = OpenCL::err2str [$errval]
547 root 1.5
548 root 1.61 Converts an error value into a human readable string. IF no error value is
549     given, then the last error will be used (as returned by OpenCL::errno).
550 root 1.5
551 root 1.9 =item $str = OpenCL::enum2str $enum
552 root 1.5
553 root 1.30 Converts most enum values (of parameter names, image format constants,
554 root 1.5 object types, addressing and filter modes, command types etc.) into a
555 root 1.30 human readable string. When confronted with some random integer it can be
556 root 1.5 very helpful to pass it through this function to maybe get some readable
557     string out of it.
558    
559     =item @platforms = OpenCL::platforms
560    
561     Returns all available OpenCL::Platform objects.
562    
563     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetPlatformIDs.html>
564    
565 root 1.56 =item $ctx = OpenCL::context_from_type $properties, $type = OpenCL::DEVICE_TYPE_DEFAULT, $callback->($err, $pvt) = $print_stderr
566 root 1.5
567 root 1.56 Tries to create a context from a default device and platform type - never worked for me.
568 root 1.5
569     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateContextFromType.html>
570    
571 root 1.56 =item $ctx = OpenCL::context $properties, \@devices, $callback->($err, $pvt) = $print_stderr)
572    
573     Create a new OpenCL::Context object using the given device object(s). This
574     function isn't implemented yet, use C<< $platform->context >> instead.
575    
576     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateContext.html>
577    
578 root 1.5 =item OpenCL::wait_for_events $wait_events...
579    
580     Waits for all events to complete.
581    
582     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clWaitForEvents.html>
583    
584 root 1.55 =item OpenCL::poll
585    
586     Checks if there are any outstanding events (see L<EVENT SYSTEM>) and
587     invokes their callbacks.
588    
589     =item $OpenCL::INTERRUPT
590    
591     The L<Async::Interrupt> object used to signal asynchronous events (see
592     L<EVENT SYSTEM>).
593    
594     =cut
595    
596     our $INTERRUPT = new Async::Interrupt c_cb => [$POLL_FUNC, 0];
597    
598     &_eq_initialise ($INTERRUPT->signal_func);
599    
600     =item $OpenCL::WATCHER
601    
602     The L<AnyEvent> watcher object used to watch for asynchronous events (see
603     L<EVENT SYSTEM>). This variable is C<undef> until L<AnyEvent> has been
604     loaded I<and> initialised (e.g. by calling C<AnyEvent::detect>).
605    
606     =cut
607    
608     our $WATCHER;
609    
610     sub _init_anyevent {
611     $INTERRUPT->block;
612     $WATCHER = AE::io ($INTERRUPT->pipe_fileno, 0, sub { $INTERRUPT->handle });
613     }
614    
615     if (defined $AnyEvent::MODEL) {
616     _init_anyevent;
617     } else {
618     push @AnyEvent::post_detect, \&_init_anyevent;
619     }
620    
621 root 1.5 =back
622    
623 root 1.52 =head2 THE OpenCL::Object CLASS
624    
625     This is the base class for all objects in the OpenCL module. The only
626     method it implements is the C<id> method, which is only useful if you want
627     to interface to OpenCL on the C level.
628    
629     =over 4
630    
631     =item $iv = $obj->id
632    
633     OpenCL objects are represented by pointers or integers on the C level. If
634     you want to interface to an OpenCL object directly on the C level, then
635     you need this value, which is returned by this method. You should use an
636     C<IV> type in your code and cast that to the correct type.
637    
638     =cut
639    
640     sub OpenCL::Object::id {
641 root 1.55 ref $_[0] eq "SCALAR"
642     ? ${ $_[0] }
643     : $_[0][0]
644 root 1.52 }
645    
646     =back
647    
648 root 1.5 =head2 THE OpenCL::Platform CLASS
649    
650     =over 4
651    
652     =item @devices = $platform->devices ($type = OpenCL::DEVICE_TYPE_ALL)
653    
654     Returns a list of matching OpenCL::Device objects.
655    
656 root 1.56 =item $ctx = $platform->context_from_type ($properties, $type = OpenCL::DEVICE_TYPE_DEFAULT, $callback->($err, $pvt) = $print_stderr)
657 root 1.5
658 root 1.22 Tries to create a context. Never worked for me, and you need devices explicitly anyway.
659 root 1.5
660     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateContextFromType.html>
661    
662 root 1.56 =item $ctx = $platform->context ($properties, \@devices, $callback->($err, $pvt) = $print_stderr)
663 root 1.11
664     Create a new OpenCL::Context object using the given device object(s)- a
665     CL_CONTEXT_PLATFORM property is supplied automatically.
666    
667     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateContext.html>
668    
669 root 1.20 =item $packed_value = $platform->info ($name)
670    
671     Calls C<clGetPlatformInfo> and returns the packed, raw value - for
672 root 1.22 strings, this will be the string (possibly including terminating \0), for
673     other values you probably need to use the correct C<unpack>.
674 root 1.20
675 root 1.22 It's best to avoid this method and use one of the following convenience
676     wrappers.
677 root 1.20
678     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetPlatformInfo.html>
679    
680 root 1.50 =item $platform->unload_compiler
681    
682     Attempts to unload the compiler for this platform, for endless
683     profit. Does nothing on OpenCL 1.1.
684    
685     L<http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/clUnloadPlatformCompiler.html>
686    
687 root 1.20 =for gengetinfo begin platform
688    
689     =item $string = $platform->profile
690    
691 root 1.24 Calls C<clGetPlatformInfo> with C<CL_PLATFORM_PROFILE> and returns the result.
692 root 1.20
693     =item $string = $platform->version
694    
695 root 1.24 Calls C<clGetPlatformInfo> with C<CL_PLATFORM_VERSION> and returns the result.
696 root 1.20
697     =item $string = $platform->name
698    
699 root 1.24 Calls C<clGetPlatformInfo> with C<CL_PLATFORM_NAME> and returns the result.
700 root 1.20
701     =item $string = $platform->vendor
702    
703 root 1.24 Calls C<clGetPlatformInfo> with C<CL_PLATFORM_VENDOR> and returns the result.
704 root 1.20
705     =item $string = $platform->extensions
706    
707 root 1.24 Calls C<clGetPlatformInfo> with C<CL_PLATFORM_EXTENSIONS> and returns the result.
708 root 1.21
709 root 1.20 =for gengetinfo end platform
710    
711 root 1.5 =back
712    
713     =head2 THE OpenCL::Device CLASS
714    
715     =over 4
716    
717     =item $packed_value = $device->info ($name)
718    
719     See C<< $platform->info >> for details.
720    
721     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetDeviceInfo.html>
722    
723 root 1.21 =for gengetinfo begin device
724    
725     =item $device_type = $device->type
726    
727 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_TYPE> and returns the result.
728 root 1.21
729     =item $uint = $device->vendor_id
730    
731 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_VENDOR_ID> and returns the result.
732 root 1.21
733     =item $uint = $device->max_compute_units
734    
735 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_COMPUTE_UNITS> and returns the result.
736 root 1.21
737     =item $uint = $device->max_work_item_dimensions
738    
739 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS> and returns the result.
740 root 1.21
741     =item $int = $device->max_work_group_size
742    
743 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_WORK_GROUP_SIZE> and returns the result.
744 root 1.21
745     =item @ints = $device->max_work_item_sizes
746    
747 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_WORK_ITEM_SIZES> and returns the result.
748 root 1.21
749     =item $uint = $device->preferred_vector_width_char
750    
751 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PREFERRED_VECTOR_WIDTH_CHAR> and returns the result.
752 root 1.21
753     =item $uint = $device->preferred_vector_width_short
754    
755 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PREFERRED_VECTOR_WIDTH_SHORT> and returns the result.
756 root 1.21
757     =item $uint = $device->preferred_vector_width_int
758    
759 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PREFERRED_VECTOR_WIDTH_INT> and returns the result.
760 root 1.21
761     =item $uint = $device->preferred_vector_width_long
762    
763 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PREFERRED_VECTOR_WIDTH_LONG> and returns the result.
764 root 1.21
765     =item $uint = $device->preferred_vector_width_float
766    
767 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PREFERRED_VECTOR_WIDTH_FLOAT> and returns the result.
768 root 1.21
769     =item $uint = $device->preferred_vector_width_double
770    
771 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PREFERRED_VECTOR_WIDTH_DOUBLE> and returns the result.
772 root 1.21
773     =item $uint = $device->max_clock_frequency
774    
775 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_CLOCK_FREQUENCY> and returns the result.
776 root 1.21
777     =item $bitfield = $device->address_bits
778    
779 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_ADDRESS_BITS> and returns the result.
780 root 1.21
781     =item $uint = $device->max_read_image_args
782    
783 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_READ_IMAGE_ARGS> and returns the result.
784 root 1.21
785     =item $uint = $device->max_write_image_args
786    
787 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_WRITE_IMAGE_ARGS> and returns the result.
788 root 1.21
789     =item $ulong = $device->max_mem_alloc_size
790    
791 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_MEM_ALLOC_SIZE> and returns the result.
792 root 1.21
793     =item $int = $device->image2d_max_width
794    
795 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_IMAGE2D_MAX_WIDTH> and returns the result.
796 root 1.21
797     =item $int = $device->image2d_max_height
798    
799 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_IMAGE2D_MAX_HEIGHT> and returns the result.
800 root 1.21
801     =item $int = $device->image3d_max_width
802    
803 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_IMAGE3D_MAX_WIDTH> and returns the result.
804 root 1.21
805     =item $int = $device->image3d_max_height
806    
807 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_IMAGE3D_MAX_HEIGHT> and returns the result.
808 root 1.21
809     =item $int = $device->image3d_max_depth
810    
811 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_IMAGE3D_MAX_DEPTH> and returns the result.
812 root 1.21
813     =item $uint = $device->image_support
814    
815 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_IMAGE_SUPPORT> and returns the result.
816 root 1.21
817     =item $int = $device->max_parameter_size
818    
819 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_PARAMETER_SIZE> and returns the result.
820 root 1.21
821     =item $uint = $device->max_samplers
822    
823 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_SAMPLERS> and returns the result.
824 root 1.21
825     =item $uint = $device->mem_base_addr_align
826    
827 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MEM_BASE_ADDR_ALIGN> and returns the result.
828 root 1.21
829     =item $uint = $device->min_data_type_align_size
830    
831 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MIN_DATA_TYPE_ALIGN_SIZE> and returns the result.
832 root 1.21
833     =item $device_fp_config = $device->single_fp_config
834    
835 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_SINGLE_FP_CONFIG> and returns the result.
836 root 1.21
837     =item $device_mem_cache_type = $device->global_mem_cache_type
838    
839 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_GLOBAL_MEM_CACHE_TYPE> and returns the result.
840 root 1.21
841     =item $uint = $device->global_mem_cacheline_size
842    
843 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_GLOBAL_MEM_CACHELINE_SIZE> and returns the result.
844 root 1.21
845     =item $ulong = $device->global_mem_cache_size
846    
847 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_GLOBAL_MEM_CACHE_SIZE> and returns the result.
848 root 1.21
849     =item $ulong = $device->global_mem_size
850    
851 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_GLOBAL_MEM_SIZE> and returns the result.
852 root 1.21
853     =item $ulong = $device->max_constant_buffer_size
854    
855 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_CONSTANT_BUFFER_SIZE> and returns the result.
856 root 1.21
857     =item $uint = $device->max_constant_args
858    
859 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_MAX_CONSTANT_ARGS> and returns the result.
860 root 1.21
861     =item $device_local_mem_type = $device->local_mem_type
862    
863 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_LOCAL_MEM_TYPE> and returns the result.
864 root 1.21
865     =item $ulong = $device->local_mem_size
866    
867 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_LOCAL_MEM_SIZE> and returns the result.
868 root 1.21
869     =item $boolean = $device->error_correction_support
870    
871 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_ERROR_CORRECTION_SUPPORT> and returns the result.
872 root 1.21
873     =item $int = $device->profiling_timer_resolution
874    
875 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PROFILING_TIMER_RESOLUTION> and returns the result.
876 root 1.21
877     =item $boolean = $device->endian_little
878    
879 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_ENDIAN_LITTLE> and returns the result.
880 root 1.21
881     =item $boolean = $device->available
882    
883 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_AVAILABLE> and returns the result.
884 root 1.21
885     =item $boolean = $device->compiler_available
886    
887 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_COMPILER_AVAILABLE> and returns the result.
888 root 1.21
889     =item $device_exec_capabilities = $device->execution_capabilities
890    
891 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_EXECUTION_CAPABILITIES> and returns the result.
892 root 1.21
893     =item $command_queue_properties = $device->properties
894    
895 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_QUEUE_PROPERTIES> and returns the result.
896 root 1.21
897     =item $ = $device->platform
898    
899 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PLATFORM> and returns the result.
900 root 1.21
901     =item $string = $device->name
902    
903 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_NAME> and returns the result.
904 root 1.21
905     =item $string = $device->vendor
906    
907 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_VENDOR> and returns the result.
908 root 1.21
909     =item $string = $device->driver_version
910    
911 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DRIVER_VERSION> and returns the result.
912 root 1.21
913     =item $string = $device->profile
914    
915 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PROFILE> and returns the result.
916 root 1.21
917     =item $string = $device->version
918    
919 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_VERSION> and returns the result.
920 root 1.21
921     =item $string = $device->extensions
922    
923 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_EXTENSIONS> and returns the result.
924 root 1.21
925     =item $uint = $device->preferred_vector_width_half
926    
927 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PREFERRED_VECTOR_WIDTH_HALF> and returns the result.
928 root 1.21
929     =item $uint = $device->native_vector_width_char
930    
931 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_NATIVE_VECTOR_WIDTH_CHAR> and returns the result.
932 root 1.21
933     =item $uint = $device->native_vector_width_short
934    
935 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_NATIVE_VECTOR_WIDTH_SHORT> and returns the result.
936 root 1.21
937     =item $uint = $device->native_vector_width_int
938    
939 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_NATIVE_VECTOR_WIDTH_INT> and returns the result.
940 root 1.21
941     =item $uint = $device->native_vector_width_long
942    
943 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_NATIVE_VECTOR_WIDTH_LONG> and returns the result.
944 root 1.21
945     =item $uint = $device->native_vector_width_float
946    
947 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_NATIVE_VECTOR_WIDTH_FLOAT> and returns the result.
948 root 1.21
949     =item $uint = $device->native_vector_width_double
950    
951 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_NATIVE_VECTOR_WIDTH_DOUBLE> and returns the result.
952 root 1.21
953     =item $uint = $device->native_vector_width_half
954    
955 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_NATIVE_VECTOR_WIDTH_HALF> and returns the result.
956 root 1.21
957     =item $device_fp_config = $device->double_fp_config
958    
959 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_DOUBLE_FP_CONFIG> and returns the result.
960 root 1.21
961     =item $device_fp_config = $device->half_fp_config
962    
963 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_HALF_FP_CONFIG> and returns the result.
964 root 1.21
965     =item $boolean = $device->host_unified_memory
966    
967 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_HOST_UNIFIED_MEMORY> and returns the result.
968 root 1.21
969     =item $device = $device->parent_device_ext
970    
971 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PARENT_DEVICE_EXT> and returns the result.
972 root 1.21
973     =item @device_partition_property_exts = $device->partition_types_ext
974    
975 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PARTITION_TYPES_EXT> and returns the result.
976 root 1.21
977     =item @device_partition_property_exts = $device->affinity_domains_ext
978    
979 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_AFFINITY_DOMAINS_EXT> and returns the result.
980 root 1.21
981 root 1.45 =item $uint = $device->reference_count_ext
982 root 1.21
983 root 1.48 Calls C<clGetDeviceInfo> with C<CL_DEVICE_REFERENCE_COUNT_EXT> and returns the result.
984 root 1.21
985     =item @device_partition_property_exts = $device->partition_style_ext
986    
987 root 1.24 Calls C<clGetDeviceInfo> with C<CL_DEVICE_PARTITION_STYLE_EXT> and returns the result.
988 root 1.21
989     =for gengetinfo end device
990    
991 root 1.5 =back
992    
993     =head2 THE OpenCL::Context CLASS
994    
995     =over 4
996    
997 root 1.51 =item $prog = $ctx->build_program ($program, $options = "")
998    
999     This convenience function tries to build the program on all devices in
1000     the context. If the build fails, then the function will C<croak> with the
1001     build log. Otherwise ti returns the program object.
1002    
1003     The C<$program> can either be a C<OpenCL::Program> object or a string
1004     containing the program. In the latter case, a program objetc will be
1005     created automatically.
1006    
1007     =cut
1008    
1009     sub OpenCL::Context::build_program {
1010     my ($self, $prog, $options) = @_;
1011    
1012     $prog = $self->program_with_source ($prog)
1013     unless ref $prog;
1014    
1015 root 1.61 eval { $prog->build (undef, $options); 1 }
1016     or errno == BUILD_PROGRAM_FAILURE
1017 root 1.63 or errno == INVALID_BINARY # workaround nvidia bug
1018 root 1.61 or Carp::croak "OpenCL::Context->build_program: " . err2str;
1019    
1020     # we check status for all devices
1021 root 1.51 for my $dev ($self->devices) {
1022 root 1.61 $prog->build_status ($dev) == BUILD_SUCCESS
1023 root 1.62 or Carp::croak "Building OpenCL program for device '" . $dev->name . "' failed:\n"
1024     . $prog->build_log ($dev);
1025 root 1.51 }
1026    
1027     $prog
1028     }
1029    
1030 root 1.9 =item $queue = $ctx->queue ($device, $properties)
1031 root 1.5
1032 root 1.9 Create a new OpenCL::Queue object from the context and the given device.
1033 root 1.5
1034     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateCommandQueue.html>
1035    
1036 root 1.45 Example: create an out-of-order queue.
1037    
1038     $queue = $ctx->queue ($device, OpenCL::QUEUE_OUT_OF_ORDER_EXEC_MODE_ENABLE);
1039    
1040 root 1.5 =item $ev = $ctx->user_event
1041    
1042     Creates a new OpenCL::UserEvent object.
1043    
1044     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateUserEvent.html>
1045    
1046     =item $buf = $ctx->buffer ($flags, $len)
1047    
1048 root 1.27 Creates a new OpenCL::Buffer (actually OpenCL::BufferObj) object with the
1049     given flags and octet-size.
1050 root 1.5
1051     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateBuffer.html>
1052    
1053     =item $buf = $ctx->buffer_sv ($flags, $data)
1054    
1055 root 1.27 Creates a new OpenCL::Buffer (actually OpenCL::BufferObj) object and
1056     initialise it with the given data values.
1057 root 1.5
1058 root 1.59 =item $img = $ctx->image ($self, $flags, $channel_order, $channel_type, $type, $width, $height, $depth = 0, $array_size = 0, $row_pitch = 0, $slice_pitch = 0, $num_mip_level = 0, $num_samples = 0, $*data = &PL_sv_undef)
1059 root 1.49
1060     Creates a new OpenCL::Image object and optionally initialises it with
1061     the given data values.
1062    
1063     L<http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/clCreateImage.html>
1064    
1065 root 1.18 =item $img = $ctx->image2d ($flags, $channel_order, $channel_type, $width, $height, $row_pitch = 0, $data = undef)
1066 root 1.5
1067 root 1.27 Creates a new OpenCL::Image2D object and optionally initialises it with
1068     the given data values.
1069 root 1.5
1070     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateImage2D.html>
1071    
1072 root 1.18 =item $img = $ctx->image3d ($flags, $channel_order, $channel_type, $width, $height, $depth, $row_pitch = 0, $slice_pitch = 0, $data = undef)
1073 root 1.5
1074 root 1.27 Creates a new OpenCL::Image3D object and optionally initialises it with
1075     the given data values.
1076 root 1.5
1077     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateImage3D.html>
1078    
1079 root 1.33 =item $buffer = $ctx->gl_buffer ($flags, $bufobj)
1080    
1081     Creates a new OpenCL::Buffer (actually OpenCL::BufferObj) object that refers to the given
1082     OpenGL buffer object.
1083    
1084     http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateFromGLBuffer.html
1085    
1086 root 1.47 =item $img = $ctx->gl_texture ($flags, $target, $miplevel, $texture)
1087    
1088     Creates a new OpenCL::Image object that refers to the given OpenGL
1089     texture object or buffer.
1090    
1091     http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/clCreateFromGLTexture.html
1092    
1093     =item $img = $ctx->gl_texture2d ($flags, $target, $miplevel, $texture)
1094 root 1.33
1095     Creates a new OpenCL::Image2D object that refers to the given OpenGL
1096     2D texture object.
1097    
1098     http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateFromGLTexture2D.html
1099    
1100 root 1.47 =item $img = $ctx->gl_texture3d ($flags, $target, $miplevel, $texture)
1101 root 1.33
1102     Creates a new OpenCL::Image3D object that refers to the given OpenGL
1103     3D texture object.
1104    
1105     http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateFromGLTexture3D.html
1106    
1107     =item $ctx->gl_renderbuffer ($flags, $renderbuffer)
1108    
1109     Creates a new OpenCL::Image2D object that refers to the given OpenGL
1110     render buffer.
1111    
1112     http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateFromGLRenderbuffer.html
1113    
1114 root 1.5 =item @formats = $ctx->supported_image_formats ($flags, $image_type)
1115    
1116     Returns a list of matching image formats - each format is an arrayref with
1117     two values, $channel_order and $channel_type, in it.
1118    
1119     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetSupportedImageFormats.html>
1120    
1121     =item $sampler = $ctx->sampler ($normalized_coords, $addressing_mode, $filter_mode)
1122    
1123     Creates a new OpenCL::Sampler object.
1124    
1125     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateSampler.html>
1126    
1127     =item $program = $ctx->program_with_source ($string)
1128    
1129     Creates a new OpenCL::Program object from the given source code.
1130    
1131     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateProgramWithSource.html>
1132    
1133 root 1.20 =item $packed_value = $ctx->info ($name)
1134    
1135     See C<< $platform->info >> for details.
1136    
1137     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetContextInfo.html>
1138    
1139     =for gengetinfo begin context
1140    
1141 root 1.21 =item $uint = $context->reference_count
1142    
1143 root 1.24 Calls C<clGetContextInfo> with C<CL_CONTEXT_REFERENCE_COUNT> and returns the result.
1144 root 1.21
1145     =item @devices = $context->devices
1146    
1147 root 1.24 Calls C<clGetContextInfo> with C<CL_CONTEXT_DEVICES> and returns the result.
1148 root 1.21
1149     =item @property_ints = $context->properties
1150    
1151 root 1.24 Calls C<clGetContextInfo> with C<CL_CONTEXT_PROPERTIES> and returns the result.
1152 root 1.21
1153     =item $uint = $context->num_devices
1154    
1155 root 1.24 Calls C<clGetContextInfo> with C<CL_CONTEXT_NUM_DEVICES> and returns the result.
1156 root 1.21
1157 root 1.20 =for gengetinfo end context
1158    
1159 root 1.5 =back
1160    
1161     =head2 THE OpenCL::Queue CLASS
1162    
1163     An OpenCL::Queue represents an execution queue for OpenCL. You execute
1164 root 1.59 requests by calling their respective method and waiting for it to complete
1165     in some way.
1166 root 1.5
1167 root 1.59 Most methods that enqueue some request return an event object that can
1168     be used to wait for completion (optionally using a callback), unless
1169     the method is called in void context, in which case no event object is
1170     created.
1171 root 1.5
1172     They also allow you to specify any number of other event objects that this
1173     request has to wait for before it starts executing, by simply passing the
1174 root 1.45 event objects as extra parameters to the enqueue methods. To simplify
1175     program design, this module ignores any C<undef> values in the list of
1176     events. This makes it possible to code operations such as this, without
1177     having to put a valid event object into C<$event> first:
1178    
1179 root 1.59 $event = $queue->xxx (..., $event);
1180 root 1.5
1181     Queues execute in-order by default, without any parallelism, so in most
1182 root 1.6 cases (i.e. you use only one queue) it's not necessary to wait for or
1183 root 1.45 create event objects, althoguh an our of order queue is often a bit
1184     faster.
1185 root 1.5
1186     =over 4
1187    
1188 root 1.59 =item $ev = $queue->read_buffer ($buffer, $blocking, $offset, $len, $data, $wait_events...)
1189 root 1.5
1190     Reads data from buffer into the given string.
1191    
1192     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueReadBuffer.html>
1193    
1194 root 1.59 =item $ev = $queue->write_buffer ($buffer, $blocking, $offset, $data, $wait_events...)
1195 root 1.5
1196     Writes data to buffer from the given string.
1197    
1198     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueWriteBuffer.html>
1199    
1200 root 1.59 =item $ev = $queue->copy_buffer ($src, $dst, $src_offset, $dst_offset, $len, $wait_events...)
1201 root 1.5
1202     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueCopyBuffer.html>
1203    
1204 root 1.59 =item $ev = $queue->read_buffer_rect (OpenCL::Memory buf, cl_bool blocking, $buf_x, $buf_y, $buf_z, $host_x, $host_y, $host_z, $width, $height, $depth, $buf_row_pitch, $buf_slice_pitch, $host_row_pitch, $host_slice_pitch, $data, $wait_events...)
1205 root 1.25
1206     http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueReadBufferRect.html
1207    
1208 root 1.59 =item $ev = $queue->write_buffer_rect (OpenCL::Memory buf, cl_bool blocking, $buf_x, $buf_y, $buf_z, $host_x, $host_y, $host_z, $width, $height, $depth, $buf_row_pitch, $buf_slice_pitch, $host_row_pitch, $host_slice_pitch, $data, $wait_events...)
1209 root 1.25
1210     http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueWriteBufferRect.html
1211    
1212 root 1.59 =item $ev = $queue->read_image ($src, $blocking, $x, $y, $z, $width, $height, $depth, $row_pitch, $slice_pitch, $data, $wait_events...)
1213 root 1.5
1214 root 1.27 L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueCopyBufferRect.html>
1215    
1216 root 1.59 =item $ev = $queue->copy_buffer_to_image ($src_buffer, $dst_image, $src_offset, $dst_x, $dst_y, $dst_z, $width, $height, $depth, $wait_events...)
1217 root 1.27
1218 root 1.5 L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueReadImage.html>
1219    
1220 root 1.59 =item $ev = $queue->write_image ($src, $blocking, $x, $y, $z, $width, $height, $depth, $row_pitch, $slice_pitch, $data, $wait_events...)
1221 root 1.5
1222     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueWriteImage.html>
1223    
1224 root 1.59 =item $ev = $queue->copy_image ($src_image, $dst_image, $src_x, $src_y, $src_z, $dst_x, $dst_y, $dst_z, $width, $height, $depth, $wait_events...)
1225 root 1.5
1226     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueCopyImage.html>
1227    
1228 root 1.59 =item $ev = $queue->copy_image_to_buffer ($src_image, $dst_image, $src_x, $src_y, $src_z, $width, $height, $depth, $dst_offset, $wait_events...)
1229 root 1.5
1230     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueCopyImageToBuffer.html>
1231    
1232 root 1.59 =item $ev = $queue->copy_buffer_rect ($src, $dst, $src_x, $src_y, $src_z, $dst_x, $dst_y, $dst_z, $width, $height, $depth, $src_row_pitch, $src_slice_pitch, $dst_row_pitch, $dst_slice_pitch, $wait_event...)
1233 root 1.27
1234     Yeah.
1235    
1236     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueCopyBufferToImage.html>.
1237    
1238 root 1.59 =item $ev = $queue->fill_buffer ($mem, $pattern, $offset, $size, ...)
1239 root 1.52
1240     Fills the given buffer object with repeated applications of C<$pattern>,
1241     starting at C<$offset> for C<$size> octets.
1242    
1243     L<http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/clEnqueueFillBuffer.html>
1244    
1245 root 1.59 =item $ev = $queue->fill_image ($img, $r, $g, $b, $a, $x, $y, $z, $width, $height, $depth, ...)
1246 root 1.52
1247     Fills the given image area with the given rgba colour components. The
1248     components are normally floating point values between C<0> and C<1>,
1249     except when the image channel data type is a signe dor unsigned
1250     unnormalised format, in which case the range is determined by the format.
1251    
1252     L<http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/clEnqueueFillImage.html>
1253    
1254 root 1.59 =item $ev = $queue->task ($kernel, $wait_events...)
1255 root 1.5
1256     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueTask.html>
1257    
1258 root 1.59 =item $ev = $queue->nd_range_kernel ($kernel, \@global_work_offset, \@global_work_size, \@local_work_size, $wait_events...)
1259 root 1.5
1260     Enqueues a kernel execution.
1261    
1262 root 1.57 \@global_work_size must be specified as a reference to an array of
1263 root 1.5 integers specifying the work sizes (element counts).
1264    
1265 root 1.57 \@global_work_offset must be either C<undef> (in which case all offsets
1266 root 1.5 are C<0>), or a reference to an array of work offsets, with the same number
1267 root 1.57 of elements as \@global_work_size.
1268 root 1.5
1269 root 1.57 \@local_work_size must be either C<undef> (in which case the
1270 root 1.5 implementation is supposed to choose good local work sizes), or a
1271     reference to an array of local work sizes, with the same number of
1272 root 1.57 elements as \@global_work_size.
1273 root 1.5
1274     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueNDRangeKernel.html>
1275    
1276 root 1.59 =item $ev = $queue->acquire_gl_objects ([object, ...], $wait_events...)
1277 root 1.35
1278     Enqueues a list (an array-ref of OpenCL::Memory objects) to be acquired
1279     for subsequent OpenCL usage.
1280    
1281     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueAcquireGLObjects.html>
1282    
1283 root 1.59 =item $ev = $queue->release_gl_objects ([object, ...], $wait_events...)
1284 root 1.35
1285     Enqueues a list (an array-ref of OpenCL::Memory objects) to be released
1286     for subsequent OpenGL usage.
1287    
1288     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueReleaseGLObjects.html>
1289    
1290 root 1.59 =item $ev = $queue->wait_for_events ($wait_events...)
1291 root 1.5
1292     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueWaitForEvents.html>
1293    
1294 root 1.59 =item $ev = $queue->marker ($wait_events...)
1295 root 1.46
1296     L<http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/clEnqueueMarkerWithWaitList.html>
1297    
1298 root 1.59 =item $ev = $queue->barrier ($wait_events...)
1299 root 1.5
1300 root 1.46 L<http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/clEnqueueBarrierWithWaitList.html>
1301 root 1.5
1302     =item $queue->flush
1303    
1304     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clFlush.html>
1305    
1306     =item $queue->finish
1307    
1308     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clFinish.html>
1309    
1310 root 1.21 =item $packed_value = $queue->info ($name)
1311    
1312     See C<< $platform->info >> for details.
1313    
1314     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetCommandQueueInfo.html>
1315    
1316     =for gengetinfo begin command_queue
1317    
1318     =item $ctx = $command_queue->context
1319    
1320 root 1.24 Calls C<clGetCommandQueueInfo> with C<CL_QUEUE_CONTEXT> and returns the result.
1321 root 1.21
1322     =item $device = $command_queue->device
1323    
1324 root 1.24 Calls C<clGetCommandQueueInfo> with C<CL_QUEUE_DEVICE> and returns the result.
1325 root 1.21
1326     =item $uint = $command_queue->reference_count
1327    
1328 root 1.24 Calls C<clGetCommandQueueInfo> with C<CL_QUEUE_REFERENCE_COUNT> and returns the result.
1329 root 1.21
1330     =item $command_queue_properties = $command_queue->properties
1331    
1332 root 1.24 Calls C<clGetCommandQueueInfo> with C<CL_QUEUE_PROPERTIES> and returns the result.
1333 root 1.21
1334     =for gengetinfo end command_queue
1335    
1336 root 1.5 =back
1337    
1338     =head2 THE OpenCL::Memory CLASS
1339    
1340     This the superclass of all memory objects - OpenCL::Buffer, OpenCL::Image,
1341 root 1.21 OpenCL::Image2D and OpenCL::Image3D.
1342 root 1.5
1343     =over 4
1344    
1345     =item $packed_value = $memory->info ($name)
1346    
1347     See C<< $platform->info >> for details.
1348    
1349     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetMemObjectInfo.html>
1350    
1351 root 1.21 =for gengetinfo begin mem
1352    
1353     =item $mem_object_type = $mem->type
1354    
1355 root 1.24 Calls C<clGetMemObjectInfo> with C<CL_MEM_TYPE> and returns the result.
1356 root 1.21
1357     =item $mem_flags = $mem->flags
1358    
1359 root 1.24 Calls C<clGetMemObjectInfo> with C<CL_MEM_FLAGS> and returns the result.
1360 root 1.21
1361     =item $int = $mem->size
1362    
1363 root 1.24 Calls C<clGetMemObjectInfo> with C<CL_MEM_SIZE> and returns the result.
1364 root 1.21
1365     =item $ptr_value = $mem->host_ptr
1366    
1367 root 1.24 Calls C<clGetMemObjectInfo> with C<CL_MEM_HOST_PTR> and returns the result.
1368 root 1.21
1369     =item $uint = $mem->map_count
1370    
1371 root 1.24 Calls C<clGetMemObjectInfo> with C<CL_MEM_MAP_COUNT> and returns the result.
1372 root 1.21
1373     =item $uint = $mem->reference_count
1374    
1375 root 1.24 Calls C<clGetMemObjectInfo> with C<CL_MEM_REFERENCE_COUNT> and returns the result.
1376 root 1.21
1377     =item $ctx = $mem->context
1378    
1379 root 1.24 Calls C<clGetMemObjectInfo> with C<CL_MEM_CONTEXT> and returns the result.
1380 root 1.21
1381     =item $mem = $mem->associated_memobject
1382    
1383 root 1.24 Calls C<clGetMemObjectInfo> with C<CL_MEM_ASSOCIATED_MEMOBJECT> and returns the result.
1384 root 1.21
1385     =item $int = $mem->offset
1386    
1387 root 1.24 Calls C<clGetMemObjectInfo> with C<CL_MEM_OFFSET> and returns the result.
1388 root 1.21
1389     =for gengetinfo end mem
1390    
1391 root 1.34 =item ($type, $name) = $mem->gl_object_info
1392    
1393     Returns the OpenGL object type (e.g. OpenCL::GL_OBJECT_TEXTURE2D) and the
1394     object "name" (e.g. the texture name) used to create this memory object.
1395    
1396     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetGLObjectInfo.html>
1397    
1398 root 1.5 =back
1399    
1400 root 1.27 =head2 THE OpenCL::Buffer CLASS
1401    
1402     This is a subclass of OpenCL::Memory, and the superclass of
1403     OpenCL::BufferObj. Its purpose is simply to distinguish between buffers
1404     and sub-buffers.
1405    
1406     =head2 THE OpenCL::BufferObj CLASS
1407    
1408     This is a subclass of OpenCL::Buffer and thus OpenCL::Memory. It exists
1409     because one cna create sub buffers of OpenLC::BufferObj objects, but not
1410     sub buffers from these sub buffers.
1411    
1412     =over 4
1413    
1414     =item $subbuf = $buf_obj->sub_buffer_region ($flags, $origin, $size)
1415    
1416     Creates an OpenCL::Buffer objects from this buffer and returns it. The
1417     C<buffer_create_type> is assumed to be C<CL_BUFFER_CREATE_TYPE_REGION>.
1418    
1419     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateSubBuffer.html>
1420    
1421     =back
1422    
1423 root 1.20 =head2 THE OpenCL::Image CLASS
1424    
1425 root 1.47 This is the superclass of all image objects - OpenCL::Image1D,
1426     OpenCL::Image1DArray, OpenCL::Image1DBuffer, OpenCL::Image2D,
1427     OpenCL::Image2DArray and OpenCL::Image3D.
1428 root 1.20
1429     =over 4
1430    
1431 root 1.53 =item $packed_value = $image->image_info ($name)
1432 root 1.20
1433     See C<< $platform->info >> for details.
1434    
1435     The reason this method is not called C<info> is that there already is an
1436     C<< ->info >> method inherited from C<OpenCL::Memory>.
1437    
1438     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetImageInfo.html>
1439    
1440 root 1.53 =item ($channel_order, $channel_data_type) = $image->format
1441    
1442     Returns the channel order and type used to create the image by calling
1443     C<clGetImageInfo> with C<CL_IMAGE_FORMAT>.
1444    
1445 root 1.21 =for gengetinfo begin image
1446    
1447     =item $int = $image->element_size
1448    
1449 root 1.24 Calls C<clGetImageInfo> with C<CL_IMAGE_ELEMENT_SIZE> and returns the result.
1450 root 1.21
1451     =item $int = $image->row_pitch
1452    
1453 root 1.24 Calls C<clGetImageInfo> with C<CL_IMAGE_ROW_PITCH> and returns the result.
1454 root 1.21
1455     =item $int = $image->slice_pitch
1456    
1457 root 1.24 Calls C<clGetImageInfo> with C<CL_IMAGE_SLICE_PITCH> and returns the result.
1458 root 1.21
1459     =item $int = $image->width
1460    
1461 root 1.24 Calls C<clGetImageInfo> with C<CL_IMAGE_WIDTH> and returns the result.
1462 root 1.21
1463     =item $int = $image->height
1464    
1465 root 1.24 Calls C<clGetImageInfo> with C<CL_IMAGE_HEIGHT> and returns the result.
1466 root 1.21
1467     =item $int = $image->depth
1468    
1469 root 1.24 Calls C<clGetImageInfo> with C<CL_IMAGE_DEPTH> and returns the result.
1470 root 1.21
1471     =for gengetinfo end image
1472    
1473 root 1.34 =for gengetinfo begin gl_texture
1474    
1475     =item $GLenum = $gl_texture->target
1476    
1477 root 1.37 Calls C<clGetGLTextureInfo> with C<CL_GL_TEXTURE_TARGET> and returns the result.
1478 root 1.34
1479     =item $GLint = $gl_texture->gl_mipmap_level
1480    
1481 root 1.37 Calls C<clGetGLTextureInfo> with C<CL_GL_MIPMAP_LEVEL> and returns the result.
1482 root 1.34
1483     =for gengetinfo end gl_texture
1484    
1485 root 1.20 =back
1486    
1487 root 1.5 =head2 THE OpenCL::Sampler CLASS
1488    
1489     =over 4
1490    
1491     =item $packed_value = $sampler->info ($name)
1492    
1493     See C<< $platform->info >> for details.
1494    
1495     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetSamplerInfo.html>
1496    
1497 root 1.21 =for gengetinfo begin sampler
1498    
1499     =item $uint = $sampler->reference_count
1500    
1501 root 1.24 Calls C<clGetSamplerInfo> with C<CL_SAMPLER_REFERENCE_COUNT> and returns the result.
1502 root 1.21
1503     =item $ctx = $sampler->context
1504    
1505 root 1.24 Calls C<clGetSamplerInfo> with C<CL_SAMPLER_CONTEXT> and returns the result.
1506 root 1.21
1507     =item $addressing_mode = $sampler->normalized_coords
1508    
1509 root 1.24 Calls C<clGetSamplerInfo> with C<CL_SAMPLER_NORMALIZED_COORDS> and returns the result.
1510 root 1.21
1511     =item $filter_mode = $sampler->addressing_mode
1512    
1513 root 1.24 Calls C<clGetSamplerInfo> with C<CL_SAMPLER_ADDRESSING_MODE> and returns the result.
1514 root 1.21
1515     =item $boolean = $sampler->filter_mode
1516    
1517 root 1.24 Calls C<clGetSamplerInfo> with C<CL_SAMPLER_FILTER_MODE> and returns the result.
1518 root 1.21
1519     =for gengetinfo end sampler
1520    
1521 root 1.5 =back
1522    
1523     =head2 THE OpenCL::Program CLASS
1524    
1525     =over 4
1526    
1527 root 1.55 =item $program->build (\@devices = undef, $options = "", $cb->($program) = undef)
1528 root 1.5
1529 root 1.51 Tries to build the program with the given options. See also the
1530     C<$ctx->build> convenience function.
1531 root 1.5
1532 root 1.55 If a callback is specified, then it will be called when compilation is
1533     finished. Note that many OpenCL implementations block your program while
1534     compiling whether you use a callback or not. See C<build_async> if you
1535     want to make sure the build is done in the background.
1536    
1537 root 1.63 Note that some OpenCL implementations act up badly, and don't call the
1538 root 1.55 callback in some error cases (but call it in others). This implementation
1539     assumes the callback will always be called, and leaks memory if this is
1540     not so. So best make sure you don't pass in invalid values.
1541    
1542 root 1.63 Some implementations fail with C<OpenCL::INVALID_BINARY> when the
1543     compilation state is successful but some later stage fails.
1544    
1545 root 1.5 L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clBuildProgram.html>
1546    
1547 root 1.55 =item $program->build_async (\@devices = undef, $options = "", $cb->($program) = undef)
1548    
1549     Similar to C<< ->build >>, except it starts a thread, and never fails (you
1550     need to check the compilation status form the callback, or by polling).
1551    
1552 root 1.5 =item $packed_value = $program->build_info ($device, $name)
1553    
1554     Similar to C<< $platform->info >>, but returns build info for a previous
1555     build attempt for the given device.
1556    
1557     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetBuildInfo.html>
1558    
1559     =item $kernel = $program->kernel ($function_name)
1560    
1561     Creates an OpenCL::Kernel object out of the named C<__kernel> function in
1562     the program.
1563    
1564     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateKernel.html>
1565    
1566 root 1.50 =item @kernels = $program->kernels_in_program
1567    
1568     Returns all kernels successfully compiled for all devices in program.
1569    
1570     http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateKernelsInProgram.html
1571    
1572 root 1.21 =for gengetinfo begin program_build
1573    
1574     =item $build_status = $program->build_status ($device)
1575    
1576 root 1.24 Calls C<clGetProgramBuildInfo> with C<CL_PROGRAM_BUILD_STATUS> and returns the result.
1577 root 1.21
1578     =item $string = $program->build_options ($device)
1579    
1580 root 1.24 Calls C<clGetProgramBuildInfo> with C<CL_PROGRAM_BUILD_OPTIONS> and returns the result.
1581 root 1.21
1582     =item $string = $program->build_log ($device)
1583    
1584 root 1.24 Calls C<clGetProgramBuildInfo> with C<CL_PROGRAM_BUILD_LOG> and returns the result.
1585 root 1.21
1586     =for gengetinfo end program_build
1587    
1588     =item $packed_value = $program->info ($name)
1589    
1590     See C<< $platform->info >> for details.
1591    
1592     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetProgramInfo.html>
1593    
1594     =for gengetinfo begin program
1595    
1596     =item $uint = $program->reference_count
1597    
1598 root 1.24 Calls C<clGetProgramInfo> with C<CL_PROGRAM_REFERENCE_COUNT> and returns the result.
1599 root 1.21
1600     =item $ctx = $program->context
1601    
1602 root 1.24 Calls C<clGetProgramInfo> with C<CL_PROGRAM_CONTEXT> and returns the result.
1603 root 1.21
1604     =item $uint = $program->num_devices
1605    
1606 root 1.24 Calls C<clGetProgramInfo> with C<CL_PROGRAM_NUM_DEVICES> and returns the result.
1607 root 1.21
1608     =item @devices = $program->devices
1609    
1610 root 1.24 Calls C<clGetProgramInfo> with C<CL_PROGRAM_DEVICES> and returns the result.
1611 root 1.21
1612     =item $string = $program->source
1613    
1614 root 1.24 Calls C<clGetProgramInfo> with C<CL_PROGRAM_SOURCE> and returns the result.
1615 root 1.21
1616     =item @ints = $program->binary_sizes
1617    
1618 root 1.24 Calls C<clGetProgramInfo> with C<CL_PROGRAM_BINARY_SIZES> and returns the result.
1619 root 1.21
1620     =for gengetinfo end program
1621    
1622 root 1.23 =item @blobs = $program->binaries
1623    
1624     Returns a string for the compiled binary for every device associated with
1625     the program, empty strings indicate missing programs, and an empty result
1626     means no program binaries are available.
1627    
1628     These "binaries" are often, in fact, informative low-level assembly
1629     sources.
1630    
1631     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetProgramInfo.html>
1632    
1633 root 1.5 =back
1634    
1635     =head2 THE OpenCL::Kernel CLASS
1636    
1637     =over 4
1638    
1639     =item $packed_value = $kernel->info ($name)
1640    
1641     See C<< $platform->info >> for details.
1642    
1643     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetKernelInfo.html>
1644    
1645 root 1.21 =for gengetinfo begin kernel
1646    
1647     =item $string = $kernel->function_name
1648    
1649 root 1.24 Calls C<clGetKernelInfo> with C<CL_KERNEL_FUNCTION_NAME> and returns the result.
1650 root 1.21
1651     =item $uint = $kernel->num_args
1652    
1653 root 1.24 Calls C<clGetKernelInfo> with C<CL_KERNEL_NUM_ARGS> and returns the result.
1654 root 1.21
1655     =item $uint = $kernel->reference_count
1656    
1657 root 1.24 Calls C<clGetKernelInfo> with C<CL_KERNEL_REFERENCE_COUNT> and returns the result.
1658 root 1.21
1659     =item $ctx = $kernel->context
1660    
1661 root 1.24 Calls C<clGetKernelInfo> with C<CL_KERNEL_CONTEXT> and returns the result.
1662 root 1.21
1663     =item $program = $kernel->program
1664    
1665 root 1.24 Calls C<clGetKernelInfo> with C<CL_KERNEL_PROGRAM> and returns the result.
1666 root 1.21
1667     =for gengetinfo end kernel
1668    
1669 root 1.20 =item $packed_value = $kernel->work_group_info ($device, $name)
1670    
1671     See C<< $platform->info >> for details.
1672    
1673     The reason this method is not called C<info> is that there already is an
1674     C<< ->info >> method.
1675    
1676     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetKernelWorkGroupInfo.html>
1677    
1678 root 1.21 =for gengetinfo begin kernel_work_group
1679    
1680     =item $int = $kernel->work_group_size ($device)
1681    
1682 root 1.24 Calls C<clGetKernelWorkGroupInfo> with C<CL_KERNEL_WORK_GROUP_SIZE> and returns the result.
1683 root 1.21
1684     =item @ints = $kernel->compile_work_group_size ($device)
1685    
1686 root 1.24 Calls C<clGetKernelWorkGroupInfo> with C<CL_KERNEL_COMPILE_WORK_GROUP_SIZE> and returns the result.
1687 root 1.21
1688     =item $ulong = $kernel->local_mem_size ($device)
1689    
1690 root 1.24 Calls C<clGetKernelWorkGroupInfo> with C<CL_KERNEL_LOCAL_MEM_SIZE> and returns the result.
1691 root 1.21
1692     =item $int = $kernel->preferred_work_group_size_multiple ($device)
1693    
1694 root 1.24 Calls C<clGetKernelWorkGroupInfo> with C<CL_KERNEL_PREFERRED_WORK_GROUP_SIZE_MULTIPLE> and returns the result.
1695 root 1.21
1696     =item $ulong = $kernel->private_mem_size ($device)
1697    
1698 root 1.24 Calls C<clGetKernelWorkGroupInfo> with C<CL_KERNEL_PRIVATE_MEM_SIZE> and returns the result.
1699 root 1.21
1700     =for gengetinfo end kernel_work_group
1701    
1702 root 1.60 =item $kernel->setf ($format, ...)
1703    
1704     Sets the arguments of a kernel. Since OpenCL 1.1 doesn't have a generic
1705     way to set arguments (and with OpenCL 1.2 it might be rather slow), you
1706     need to specify a format argument, much as with C<printf>, to tell OpenCL
1707     what type of argument it is.
1708    
1709     The format arguments are single letters:
1710    
1711     c char
1712     C unsigned char
1713     s short
1714     S unsigned short
1715     i int
1716     I unsigned int
1717     l long
1718     L unsigned long
1719    
1720     h half float (0..65535)
1721     f float
1722     d double
1723    
1724     z local (octet size)
1725    
1726     m memory object (buffer or image)
1727     a sampler
1728     e event
1729    
1730     Space characters in the format string are ignored.
1731    
1732     Example: set the arguments for a kernel that expects an int, two floats, a buffer and an image.
1733    
1734     $kernel->setf ("i ff mm", 5, 0.5, 3, $buffer, $image);
1735    
1736 root 1.58 =item $kernel->set_TYPE ($index, $value)
1737 root 1.5
1738 root 1.58 =item $kernel->set_char ($index, $value)
1739 root 1.5
1740 root 1.58 =item $kernel->set_uchar ($index, $value)
1741    
1742     =item $kernel->set_short ($index, $value)
1743    
1744     =item $kernel->set_ushort ($index, $value)
1745    
1746     =item $kernel->set_int ($index, $value)
1747    
1748     =item $kernel->set_uint ($index, $value)
1749    
1750     =item $kernel->set_long ($index, $value)
1751    
1752     =item $kernel->set_ulong ($index, $value)
1753    
1754     =item $kernel->set_half ($index, $value)
1755    
1756     =item $kernel->set_float ($index, $value)
1757    
1758     =item $kernel->set_double ($index, $value)
1759    
1760     =item $kernel->set_memory ($index, $value)
1761    
1762     =item $kernel->set_buffer ($index, $value)
1763    
1764     =item $kernel->set_image ($index, $value)
1765    
1766     =item $kernel->set_sampler ($index, $value)
1767    
1768     =item $kernel->set_local ($index, $value)
1769    
1770     =item $kernel->set_event ($index, $value)
1771    
1772     This is a family of methods to set the kernel argument with the number
1773     C<$index> to the give C<$value>.
1774 root 1.5
1775     Chars and integers (including the half type) are specified as integers,
1776 root 1.58 float and double as floating point values, memory/buffer/image must be
1777     an object of that type or C<undef>, local-memory arguments are set by
1778     specifying the size, and sampler and event must be objects of that type.
1779    
1780     Note that C<set_memory> works for all memory objects (all types of buffers
1781     and images) - the main purpose of the more specific C<set_TYPE> functions
1782     is type checking.
1783 root 1.5
1784 root 1.50 Setting an argument for a kernel does NOT keep a reference to the object -
1785     for example, if you set an argument to some image object, free the image,
1786     and call the kernel, you will run into undefined behaviour.
1787    
1788 root 1.5 L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clSetKernelArg.html>
1789    
1790     =back
1791    
1792     =head2 THE OpenCL::Event CLASS
1793    
1794     This is the superclass for all event objects (including OpenCL::UserEvent
1795     objects).
1796    
1797     =over 4
1798    
1799 root 1.21 =item $ev->wait
1800    
1801     Waits for the event to complete.
1802    
1803     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clWaitForEvents.html>
1804    
1805 root 1.55 =item $ev->cb ($exec_callback_type, $callback->($event, $event_command_exec_status))
1806    
1807     Adds a callback to the callback stack for the given event type. There is
1808     no way to remove a callback again.
1809    
1810     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clSetEventCallback.html>
1811    
1812 root 1.5 =item $packed_value = $ev->info ($name)
1813    
1814     See C<< $platform->info >> for details.
1815    
1816     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetEventInfo.html>
1817    
1818 root 1.21 =for gengetinfo begin event
1819    
1820     =item $queue = $event->command_queue
1821    
1822 root 1.24 Calls C<clGetEventInfo> with C<CL_EVENT_COMMAND_QUEUE> and returns the result.
1823 root 1.21
1824     =item $command_type = $event->command_type
1825    
1826 root 1.24 Calls C<clGetEventInfo> with C<CL_EVENT_COMMAND_TYPE> and returns the result.
1827 root 1.21
1828     =item $uint = $event->reference_count
1829    
1830 root 1.24 Calls C<clGetEventInfo> with C<CL_EVENT_REFERENCE_COUNT> and returns the result.
1831 root 1.21
1832     =item $uint = $event->command_execution_status
1833    
1834 root 1.24 Calls C<clGetEventInfo> with C<CL_EVENT_COMMAND_EXECUTION_STATUS> and returns the result.
1835 root 1.21
1836     =item $ctx = $event->context
1837    
1838 root 1.24 Calls C<clGetEventInfo> with C<CL_EVENT_CONTEXT> and returns the result.
1839 root 1.21
1840     =for gengetinfo end event
1841    
1842 root 1.20 =item $packed_value = $ev->profiling_info ($name)
1843    
1844     See C<< $platform->info >> for details.
1845    
1846     The reason this method is not called C<info> is that there already is an
1847     C<< ->info >> method.
1848    
1849     L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetProfilingInfo.html>
1850    
1851 root 1.21 =for gengetinfo begin profiling
1852    
1853     =item $ulong = $event->profiling_command_queued
1854    
1855 root 1.24 Calls C<clGetEventProfilingInfo> with C<CL_PROFILING_COMMAND_QUEUED> and returns the result.
1856 root 1.21
1857     =item $ulong = $event->profiling_command_submit
1858    
1859 root 1.24 Calls C<clGetEventProfilingInfo> with C<CL_PROFILING_COMMAND_SUBMIT> and returns the result.
1860 root 1.21
1861     =item $ulong = $event->profiling_command_start
1862    
1863 root 1.24 Calls C<clGetEventProfilingInfo> with C<CL_PROFILING_COMMAND_START> and returns the result.
1864 root 1.21
1865     =item $ulong = $event->profiling_command_end
1866 root 1.5
1867 root 1.24 Calls C<clGetEventProfilingInfo> with C<CL_PROFILING_COMMAND_END> and returns the result.
1868 root 1.5
1869 root 1.21 =for gengetinfo end profiling
1870 root 1.5
1871     =back
1872    
1873     =head2 THE OpenCL::UserEvent CLASS
1874    
1875     This is a subclass of OpenCL::Event.
1876 root 1.4
1877 root 1.1 =over 4
1878    
1879 root 1.5 =item $ev->set_status ($execution_status)
1880    
1881 root 1.55 Sets the execution status of the user event. Can only be called once,
1882     either with OpenCL::COMPLETE or a negative number as status.
1883    
1884 root 1.5 L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clSetUserEventStatus.html>
1885    
1886     =back
1887    
1888 root 1.1 =cut
1889    
1890     1;
1891    
1892     =head1 AUTHOR
1893    
1894     Marc Lehmann <schmorp@schmorp.de>
1895     http://home.schmorp.de/
1896    
1897     =cut
1898