ViewVC Help
View File | Revision Log | Show Annotations | Download File
/cvs/OpenCL/OpenCL.pm
(Generate patch)

Comparing OpenCL/OpenCL.pm (file contents):
Revision 1.39 by root, Thu Apr 19 19:37:18 2012 UTC vs.
Revision 1.57 by root, Tue Apr 24 23:58:34 2012 UTC

43 43
44OpenCL::Event objects are used to signal when something is complete. 44OpenCL::Event objects are used to signal when something is complete.
45 45
46=head2 HELPFUL RESOURCES 46=head2 HELPFUL RESOURCES
47 47
48The OpenCL spec used to develop this module (1.2 spec was available, but 48The OpenCL specs used to develop this module:
49no implementation was available to me :).
50 49
51 http://www.khronos.org/registry/cl/specs/opencl-1.1.pdf 50 http://www.khronos.org/registry/cl/specs/opencl-1.1.pdf
51 http://www.khronos.org/registry/cl/specs/opencl-1.2.pdf
52 http://www.khronos.org/registry/cl/specs/opencl-1.2-extensions.pdf
52 53
53OpenCL manpages: 54OpenCL manpages:
54 55
55 http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/ 56 http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/
57 http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/
56 58
57If you are into UML class diagrams, the following diagram might help - if 59If you are into UML class diagrams, the following diagram might help - if
58not, it will be mildly cobfusing: 60not, it will be mildly confusing (also, the class hierarchy of this module
61is much more fine-grained):
59 62
60 http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/classDiagram.html 63 http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/classDiagram.html
61 64
62Here's a tutorial from AMD (very AMD-centric, too), not sure how useful it 65Here's a tutorial from AMD (very AMD-centric, too), not sure how useful it
63is, but at least it's free of charge: 66is, but at least it's free of charge:
64 67
65 http://developer.amd.com/zones/OpenCLZone/courses/Documents/Introduction_to_OpenCL_Programming%20Training_Guide%20%28201005%29.pdf 68 http://developer.amd.com/zones/OpenCLZone/courses/Documents/Introduction_to_OpenCL_Programming%20Training_Guide%20%28201005%29.pdf
157 $id = get_global_id (0); 160 $id = get_global_id (0);
158 output [id] = input [id] * input [id]; 161 output [id] = input [id] * input [id];
159 } 162 }
160 '; 163 ';
161 164
162 my $prog = $ctx->program_with_source ($src); 165 my $prog = $ctx->build_program ($src);
163
164 # build croaks on compile errors, so catch it and print the compile errors
165 eval { $prog->build ($dev); 1 }
166 or die $prog->build_log;
167
168 my $kernel = $prog->kernel ("squareit"); 166 my $kernel = $prog->kernel ("squareit");
169 167
170=head2 Create some input and output float buffers, then call the 168=head2 Create some input and output float buffers, then call the
171'squareit' kernel on them. 169'squareit' kernel on them.
172 170
267 265
268 float3 colour = (float3)(z.x, z.y, z.x * z.y); 266 float3 colour = (float3)(z.x, z.y, z.x * z.y);
269 write_imagef (img, (int2)(get_global_id (0), get_global_id (1)), (float4)(colour * p.x * p.x, 1.)); 267 write_imagef (img, (int2)(get_global_id (0), get_global_id (1)), (float4)(colour * p.x * p.x, 1.));
270 } 268 }
271 EOF 269 EOF
270
272 my $prog = $ctx->program_with_source ($src); 271 my $prog = $ctx->build_program ($src);
273 eval { $prog->build ($dev); 1 }
274 or die $prog->build_log ($dev);
275
276 my $kernel = $prog->kernel ("juliatunnel"); 272 my $kernel = $prog->kernel ("juliatunnel");
277 273
278 # program compiled, kernel ready, now draw and loop 274 # program compiled, kernel ready, now draw and loop
279 275
280 for (my $time; ; ++$time) { 276 for (my $time; ; ++$time) {
288 284
289 # release objects to opengl again 285 # release objects to opengl again
290 $queue->enqueue_release_gl_objects ([$tex]); 286 $queue->enqueue_release_gl_objects ([$tex]);
291 287
292 # wait 288 # wait
293 $queue->flush; 289 $queue->finish;
294 290
295 # now draw the texture, the defaults should be all right 291 # now draw the texture, the defaults should be all right
296 glTexParameterf GL_TEXTURE_2D, GL_TEXTURE_MIN_FILTER, GL_NEAREST; 292 glTexParameterf GL_TEXTURE_2D, GL_TEXTURE_MIN_FILTER, GL_NEAREST;
297 293
298 glEnable GL_TEXTURE_2D; 294 glEnable GL_TEXTURE_2D;
336=item * Structures are often specified by flattening out their components 332=item * Structures are often specified by flattening out their components
337as with short vectors, and returned as arrayrefs. 333as with short vectors, and returned as arrayrefs.
338 334
339=item * When enqueuing commands, the wait list is specified by adding 335=item * When enqueuing commands, the wait list is specified by adding
340extra arguments to the function - anywhere a C<$wait_events...> argument 336extra arguments to the function - anywhere a C<$wait_events...> argument
341is documented this can be any number of event objects. 337is documented this can be any number of event objects. As an extsnion
338implemented by this module, C<undef> values will be ignored in the event
339list.
342 340
343=item * When enqueuing commands, if the enqueue method is called in void 341=item * When enqueuing commands, if the enqueue method is called in void
344context, no event is created. In all other contexts an event is returned 342context, no event is created. In all other contexts an event is returned
345by the method. 343by the method.
346 344
381 379
382For this to work, the OpenGL library must be loaded, a GLX context must 380For this to work, the OpenGL library must be loaded, a GLX context must
383have been created and be made current, and C<dlsym> must be available and 381have been created and be made current, and C<dlsym> must be available and
384capable of finding the function via C<RTLD_DEFAULT>. 382capable of finding the function via C<RTLD_DEFAULT>.
385 383
384=head2 EVENT SYSTEM
385
386OpenCL can generate a number of (potentially) asynchronous events, for
387example, after compiling a program, to signal a context-related error or,
388perhaps most important, to signal completion of queued jobs (by setting
389callbacks on OpenCL::Event objects).
390
391To facilitate this, this module maintains an event queue - each
392time an asynchronous event happens, it is queued, and perl will be
393interrupted. This is implemented via the L<Async::Interrupt> module. In
394addition, this module has L<AnyEvent> support, so it can seamlessly
395integrate itself into many event loops.
396
397Since this module is a bit hard to understand, here are some case examples:
398
399=head3 Don't use callbacks.
400
401When your program never uses any callbacks, then there will never be any
402notifications you need to take care of, and therefore no need to worry
403about all this.
404
405You can achieve a great deal by explicitly waiting for events, or using
406barriers and flush calls. In many programs, there is no need at all to
407tinker with asynchronous events.
408
409=head3 Use AnyEvent
410
411This module automatically registers a watcher that invokes all outstanding
412event callbacks when AnyEvent is initialised (and block asynchronous
413interruptions). Using this mode of operations is the safest and most
414recommended one.
415
416To use this, simply use AnyEvent and this module normally, make sure you
417have an event loop running:
418
419 use Gtk2 -init;
420 use AnyEvent;
421
422 # initialise AnyEvent, by creating a watcher, or:
423 AnyEvent::detect;
424
425 my $e = $queue->enqueue_marker;
426 $e->cb (sub {
427 warn "opencl is finished\n";
428 })
429
430 main Gtk2;
431
432Note that this module will not initialise AnyEvent for you. Before
433AnyEvent is initialised, the module will asynchronously interrupt perl
434instead. To avoid any surprises, it's best to explicitly initialise
435AnyEvent.
436
437You can temporarily enable asynchronous interruptions (see next paragraph)
438by calling C<$OpenCL::INTERRUPT->unblock> and disable them again by
439calling C<$OpenCL::INTERRUPT->block>.
440
441=head3 Let yourself be interrupted at any time
442
443This mode is the default unless AnyEvent is loaded and initialised. In
444this mode, OpenCL asynchronously interrupts a running perl program. The
445emphasis is on both I<asynchronously> and I<running> here.
446
447Asynchronously means that perl might execute your callbacks at any
448time. For example, in the following code (I<THAT YOU SHOULD NOT COPY>),
449the C<until> loop following the marker call will be interrupted by the
450callback:
451
452 my $e = $queue->enqueue_marker;
453 my $flag;
454 $e->cb (sub { $flag = 1 });
455 1 until $flag;
456 # $flag is now 1
457
458The reason why you shouldn't blindly copy the above code is that
459busy waiting is a really really bad thing, and really really bad for
460performance.
461
462While at first this asynchronous business might look exciting, it can be
463really hard, because you need to be prepared for the callback code to be
464executed at any time, which limits the amount of things the callback code
465can do safely.
466
467This can be mitigated somewhat by using C<<
468$OpenCL::INTERRUPT->scope_block >> (see the L<Async::Interrupt>
469documentation for details).
470
471The other problem is that your program must be actively I<running> to be
472interrupted. When you calculate stuff, your program is running. When you
473hang in some C functions or other block execution (by calling C<sleep>,
474C<select>, running an event loop and so on), your program is waiting, not
475running.
476
477One way around that would be to attach a read watcher to your event loop,
478listening for events on C<< $OpenCL::INTERRUPT->pipe_fileno >>, using a
479dummy callback (C<sub { }>) to temporarily execute some perl code.
480
481That is then awfully close to using the built-in AnyEvent support above,
482though, so consider that one instead.
483
484=head3 Be creative
485
486OpenCL exports the L<Async::Interrupt> object it uses in the global
487variable C<$OpenCL::INTERRUPT>. You can configure it in any way you like.
488
489So if you want to feel like a real pro, err, wait, if you feel no risk
490menas no fun, you can experiment by implementing your own mode of
491operations.
492
493=cut
494
495package OpenCL;
496
497use common::sense;
498use Async::Interrupt ();
499
500our $POLL_FUNC; # set by XS
501
502BEGIN {
503 our $VERSION = '0.97';
504
505 require XSLoader;
506 XSLoader::load (__PACKAGE__, $VERSION);
507
508 @OpenCL::Platform::ISA =
509 @OpenCL::Device::ISA =
510 @OpenCL::Context::ISA =
511 @OpenCL::Queue::ISA =
512 @OpenCL::Memory::ISA =
513 @OpenCL::Sampler::ISA =
514 @OpenCL::Program::ISA =
515 @OpenCL::Kernel::ISA =
516 @OpenCL::Event::ISA = OpenCL::Object::;
517
518 @OpenCL::Buffer::ISA =
519 @OpenCL::Image::ISA = OpenCL::Memory::;
520
521 @OpenCL::BufferObj::ISA = OpenCL::Buffer::;
522
523 @OpenCL::Image2D::ISA =
524 @OpenCL::Image3D::ISA =
525 @OpenCL::Image2DArray::ISA =
526 @OpenCL::Image1D::ISA =
527 @OpenCL::Image1DArray::ISA =
528 @OpenCL::Image1DBuffer::ISA = OpenCL::Image::;
529
530 @OpenCL::UserEvent::ISA = OpenCL::Event::;
531}
532
386=head2 THE OpenCL PACKAGE 533=head2 THE OpenCL PACKAGE
387 534
388=over 4 535=over 4
389 536
390=item $int = OpenCL::errno 537=item $int = OpenCL::errno
408 555
409Returns all available OpenCL::Platform objects. 556Returns all available OpenCL::Platform objects.
410 557
411L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetPlatformIDs.html> 558L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetPlatformIDs.html>
412 559
413=item $ctx = OpenCL::context_from_type $properties, $type = OpenCL::DEVICE_TYPE_DEFAULT, $notify = undef 560=item $ctx = OpenCL::context_from_type $properties, $type = OpenCL::DEVICE_TYPE_DEFAULT, $callback->($err, $pvt) = $print_stderr
414 561
415Tries to create a context from a default device and platform - never worked for me. 562Tries to create a context from a default device and platform type - never worked for me.
416 563
417L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateContextFromType.html> 564L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateContextFromType.html>
418 565
566=item $ctx = OpenCL::context $properties, \@devices, $callback->($err, $pvt) = $print_stderr)
567
568Create a new OpenCL::Context object using the given device object(s). This
569function isn't implemented yet, use C<< $platform->context >> instead.
570
571L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateContext.html>
572
419=item OpenCL::wait_for_events $wait_events... 573=item OpenCL::wait_for_events $wait_events...
420 574
421Waits for all events to complete. 575Waits for all events to complete.
422 576
423L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clWaitForEvents.html> 577L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clWaitForEvents.html>
424 578
579=item OpenCL::poll
580
581Checks if there are any outstanding events (see L<EVENT SYSTEM>) and
582invokes their callbacks.
583
584=item $OpenCL::INTERRUPT
585
586The L<Async::Interrupt> object used to signal asynchronous events (see
587L<EVENT SYSTEM>).
588
589=cut
590
591our $INTERRUPT = new Async::Interrupt c_cb => [$POLL_FUNC, 0];
592
593&_eq_initialise ($INTERRUPT->signal_func);
594
595=item $OpenCL::WATCHER
596
597The L<AnyEvent> watcher object used to watch for asynchronous events (see
598L<EVENT SYSTEM>). This variable is C<undef> until L<AnyEvent> has been
599loaded I<and> initialised (e.g. by calling C<AnyEvent::detect>).
600
601=cut
602
603our $WATCHER;
604
605sub _init_anyevent {
606 $INTERRUPT->block;
607 $WATCHER = AE::io ($INTERRUPT->pipe_fileno, 0, sub { $INTERRUPT->handle });
608}
609
610if (defined $AnyEvent::MODEL) {
611 _init_anyevent;
612} else {
613 push @AnyEvent::post_detect, \&_init_anyevent;
614}
615
425=back 616=back
426 617
618=head2 THE OpenCL::Object CLASS
619
620This is the base class for all objects in the OpenCL module. The only
621method it implements is the C<id> method, which is only useful if you want
622to interface to OpenCL on the C level.
623
624=over 4
625
626=item $iv = $obj->id
627
628OpenCL objects are represented by pointers or integers on the C level. If
629you want to interface to an OpenCL object directly on the C level, then
630you need this value, which is returned by this method. You should use an
631C<IV> type in your code and cast that to the correct type.
632
633=cut
634
635sub OpenCL::Object::id {
636 ref $_[0] eq "SCALAR"
637 ? ${ $_[0] }
638 : $_[0][0]
639}
640
641=back
642
427=head2 THE OpenCL::Platform CLASS 643=head2 THE OpenCL::Platform CLASS
428 644
429=over 4 645=over 4
430 646
431=item @devices = $platform->devices ($type = OpenCL::DEVICE_TYPE_ALL) 647=item @devices = $platform->devices ($type = OpenCL::DEVICE_TYPE_ALL)
432 648
433Returns a list of matching OpenCL::Device objects. 649Returns a list of matching OpenCL::Device objects.
434 650
435=item $ctx = $platform->context_from_type ($properties, $type = OpenCL::DEVICE_TYPE_DEFAULT, $notify = undef) 651=item $ctx = $platform->context_from_type ($properties, $type = OpenCL::DEVICE_TYPE_DEFAULT, $callback->($err, $pvt) = $print_stderr)
436 652
437Tries to create a context. Never worked for me, and you need devices explicitly anyway. 653Tries to create a context. Never worked for me, and you need devices explicitly anyway.
438 654
439L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateContextFromType.html> 655L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateContextFromType.html>
440 656
441=item $ctx = $platform->context ($properties = undef, @$devices, $notify = undef) 657=item $ctx = $platform->context ($properties, \@devices, $callback->($err, $pvt) = $print_stderr)
442 658
443Create a new OpenCL::Context object using the given device object(s)- a 659Create a new OpenCL::Context object using the given device object(s)- a
444CL_CONTEXT_PLATFORM property is supplied automatically. 660CL_CONTEXT_PLATFORM property is supplied automatically.
445 661
446L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateContext.html> 662L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateContext.html>
454It's best to avoid this method and use one of the following convenience 670It's best to avoid this method and use one of the following convenience
455wrappers. 671wrappers.
456 672
457L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetPlatformInfo.html> 673L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetPlatformInfo.html>
458 674
675=item $platform->unload_compiler
676
677Attempts to unload the compiler for this platform, for endless
678profit. Does nothing on OpenCL 1.1.
679
680L<http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/clUnloadPlatformCompiler.html>
681
459=for gengetinfo begin platform 682=for gengetinfo begin platform
460 683
461=item $string = $platform->profile 684=item $string = $platform->profile
462 685
463Calls C<clGetPlatformInfo> with C<CL_PLATFORM_PROFILE> and returns the result. 686Calls C<clGetPlatformInfo> with C<CL_PLATFORM_PROFILE> and returns the result.
748 971
749=item @device_partition_property_exts = $device->affinity_domains_ext 972=item @device_partition_property_exts = $device->affinity_domains_ext
750 973
751Calls C<clGetDeviceInfo> with C<CL_DEVICE_AFFINITY_DOMAINS_EXT> and returns the result. 974Calls C<clGetDeviceInfo> with C<CL_DEVICE_AFFINITY_DOMAINS_EXT> and returns the result.
752 975
753=item $uint = $device->reference_count_ext 976=item $uint = $device->reference_count_ext
754 977
755Calls C<clGetDeviceInfo> with C<CL_DEVICE_REFERENCE_COUNT_EXT > and returns the result. 978Calls C<clGetDeviceInfo> with C<CL_DEVICE_REFERENCE_COUNT_EXT> and returns the result.
756 979
757=item @device_partition_property_exts = $device->partition_style_ext 980=item @device_partition_property_exts = $device->partition_style_ext
758 981
759Calls C<clGetDeviceInfo> with C<CL_DEVICE_PARTITION_STYLE_EXT> and returns the result. 982Calls C<clGetDeviceInfo> with C<CL_DEVICE_PARTITION_STYLE_EXT> and returns the result.
760 983
764 987
765=head2 THE OpenCL::Context CLASS 988=head2 THE OpenCL::Context CLASS
766 989
767=over 4 990=over 4
768 991
992=item $prog = $ctx->build_program ($program, $options = "")
993
994This convenience function tries to build the program on all devices in
995the context. If the build fails, then the function will C<croak> with the
996build log. Otherwise ti returns the program object.
997
998The C<$program> can either be a C<OpenCL::Program> object or a string
999containing the program. In the latter case, a program objetc will be
1000created automatically.
1001
1002=cut
1003
1004sub OpenCL::Context::build_program {
1005 my ($self, $prog, $options) = @_;
1006
1007 require Carp;
1008
1009 $prog = $self->program_with_source ($prog)
1010 unless ref $prog;
1011
1012 # we build separately per device so we instantly know which one failed
1013 for my $dev ($self->devices) {
1014 eval { $prog->build ([$dev], $options); 1 }
1015 or Carp::croak ("Building OpenCL program for device '" . $dev->name . "' failed:\n"
1016 . $prog->build_log ($dev));
1017 }
1018
1019 $prog
1020}
1021
769=item $queue = $ctx->queue ($device, $properties) 1022=item $queue = $ctx->queue ($device, $properties)
770 1023
771Create a new OpenCL::Queue object from the context and the given device. 1024Create a new OpenCL::Queue object from the context and the given device.
772 1025
773L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateCommandQueue.html> 1026L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateCommandQueue.html>
1027
1028Example: create an out-of-order queue.
1029
1030 $queue = $ctx->queue ($device, OpenCL::QUEUE_OUT_OF_ORDER_EXEC_MODE_ENABLE);
774 1031
775=item $ev = $ctx->user_event 1032=item $ev = $ctx->user_event
776 1033
777Creates a new OpenCL::UserEvent object. 1034Creates a new OpenCL::UserEvent object.
778 1035
788=item $buf = $ctx->buffer_sv ($flags, $data) 1045=item $buf = $ctx->buffer_sv ($flags, $data)
789 1046
790Creates a new OpenCL::Buffer (actually OpenCL::BufferObj) object and 1047Creates a new OpenCL::Buffer (actually OpenCL::BufferObj) object and
791initialise it with the given data values. 1048initialise it with the given data values.
792 1049
1050=item $img = $ctx->image ($self, $flags, $channel_order, $channel_type, $type, $width, $height, $depth, $array_size = 0, $row_pitch = 0, $slice_pitch = 0, $num_mip_level = 0, $num_samples = 0, $*data = &PL_sv_undef)
1051
1052Creates a new OpenCL::Image object and optionally initialises it with
1053the given data values.
1054
1055L<http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/clCreateImage.html>
1056
793=item $img = $ctx->image2d ($flags, $channel_order, $channel_type, $width, $height, $row_pitch = 0, $data = undef) 1057=item $img = $ctx->image2d ($flags, $channel_order, $channel_type, $width, $height, $row_pitch = 0, $data = undef)
794 1058
795Creates a new OpenCL::Image2D object and optionally initialises it with 1059Creates a new OpenCL::Image2D object and optionally initialises it with
796the given data values. 1060the given data values.
797 1061
809Creates a new OpenCL::Buffer (actually OpenCL::BufferObj) object that refers to the given 1073Creates a new OpenCL::Buffer (actually OpenCL::BufferObj) object that refers to the given
810OpenGL buffer object. 1074OpenGL buffer object.
811 1075
812http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateFromGLBuffer.html 1076http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateFromGLBuffer.html
813 1077
1078=item $img = $ctx->gl_texture ($flags, $target, $miplevel, $texture)
1079
1080Creates a new OpenCL::Image object that refers to the given OpenGL
1081texture object or buffer.
1082
1083http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/clCreateFromGLTexture.html
1084
814=item $ctx->gl_texture2d ($flags, $target, $miplevel, $texture) 1085=item $img = $ctx->gl_texture2d ($flags, $target, $miplevel, $texture)
815 1086
816Creates a new OpenCL::Image2D object that refers to the given OpenGL 1087Creates a new OpenCL::Image2D object that refers to the given OpenGL
8172D texture object. 10882D texture object.
818 1089
819http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateFromGLTexture2D.html 1090http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateFromGLTexture2D.html
820 1091
821=item $ctx->gl_texture3d ($flags, $target, $miplevel, $texture) 1092=item $img = $ctx->gl_texture3d ($flags, $target, $miplevel, $texture)
822 1093
823Creates a new OpenCL::Image3D object that refers to the given OpenGL 1094Creates a new OpenCL::Image3D object that refers to the given OpenGL
8243D texture object. 10953D texture object.
825 1096
826http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateFromGLTexture3D.html 1097http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateFromGLTexture3D.html
889for completion, unless the method is called in void context, in which case 1160for completion, unless the method is called in void context, in which case
890no event object is created. 1161no event object is created.
891 1162
892They also allow you to specify any number of other event objects that this 1163They also allow you to specify any number of other event objects that this
893request has to wait for before it starts executing, by simply passing the 1164request has to wait for before it starts executing, by simply passing the
894event objects as extra parameters to the enqueue methods. 1165event objects as extra parameters to the enqueue methods. To simplify
1166program design, this module ignores any C<undef> values in the list of
1167events. This makes it possible to code operations such as this, without
1168having to put a valid event object into C<$event> first:
1169
1170 $event = $queue->enqueue_xxx (..., $event);
895 1171
896Queues execute in-order by default, without any parallelism, so in most 1172Queues execute in-order by default, without any parallelism, so in most
897cases (i.e. you use only one queue) it's not necessary to wait for or 1173cases (i.e. you use only one queue) it's not necessary to wait for or
898create event objects. 1174create event objects, althoguh an our of order queue is often a bit
1175faster.
899 1176
900=over 4 1177=over 4
901 1178
902=item $ev = $queue->enqueue_read_buffer ($buffer, $blocking, $offset, $len, $data, $wait_events...) 1179=item $ev = $queue->enqueue_read_buffer ($buffer, $blocking, $offset, $len, $data, $wait_events...)
903 1180
947 1224
948Yeah. 1225Yeah.
949 1226
950L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueCopyBufferToImage.html>. 1227L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueCopyBufferToImage.html>.
951 1228
1229=item $ev = $queue->enqueue_fill_buffer ($mem, $pattern, $offset, $size, ...)
1230
1231Fills the given buffer object with repeated applications of C<$pattern>,
1232starting at C<$offset> for C<$size> octets.
1233
1234L<http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/clEnqueueFillBuffer.html>
1235
1236=item $ev = $queue->enqueue_fill_image ($img, $r, $g, $b, $a, $x, $y, $z, $width, $height, $depth, ...)
1237
1238Fills the given image area with the given rgba colour components. The
1239components are normally floating point values between C<0> and C<1>,
1240except when the image channel data type is a signe dor unsigned
1241unnormalised format, in which case the range is determined by the format.
1242
1243L<http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/clEnqueueFillImage.html>
1244
952=item $ev = $queue->enqueue_task ($kernel, $wait_events...) 1245=item $ev = $queue->enqueue_task ($kernel, $wait_events...)
953 1246
954L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueTask.html> 1247L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueTask.html>
955 1248
956=item $ev = $queue->enqueue_nd_range_kernel ($kernel, @$global_work_offset, @$global_work_size, @$local_work_size, $wait_events...) 1249=item $ev = $queue->enqueue_nd_range_kernel ($kernel, \@global_work_offset, \@global_work_size, \@local_work_size, $wait_events...)
957 1250
958Enqueues a kernel execution. 1251Enqueues a kernel execution.
959 1252
960@$global_work_size must be specified as a reference to an array of 1253\@global_work_size must be specified as a reference to an array of
961integers specifying the work sizes (element counts). 1254integers specifying the work sizes (element counts).
962 1255
963@$global_work_offset must be either C<undef> (in which case all offsets 1256\@global_work_offset must be either C<undef> (in which case all offsets
964are C<0>), or a reference to an array of work offsets, with the same number 1257are C<0>), or a reference to an array of work offsets, with the same number
965of elements as @$global_work_size. 1258of elements as \@global_work_size.
966 1259
967@$local_work_size must be either C<undef> (in which case the 1260\@local_work_size must be either C<undef> (in which case the
968implementation is supposed to choose good local work sizes), or a 1261implementation is supposed to choose good local work sizes), or a
969reference to an array of local work sizes, with the same number of 1262reference to an array of local work sizes, with the same number of
970elements as @$global_work_size. 1263elements as \@global_work_size.
971 1264
972L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueNDRangeKernel.html> 1265L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueNDRangeKernel.html>
973
974=item $ev = $queue->enqueue_marker ($wait_events...)
975
976L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueMarker.html>
977 1266
978=item $ev = $queue->enqueue_acquire_gl_objects ([object, ...], $wait_events...) 1267=item $ev = $queue->enqueue_acquire_gl_objects ([object, ...], $wait_events...)
979 1268
980Enqueues a list (an array-ref of OpenCL::Memory objects) to be acquired 1269Enqueues a list (an array-ref of OpenCL::Memory objects) to be acquired
981for subsequent OpenCL usage. 1270for subsequent OpenCL usage.
991 1280
992=item $ev = $queue->enqueue_wait_for_events ($wait_events...) 1281=item $ev = $queue->enqueue_wait_for_events ($wait_events...)
993 1282
994L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueWaitForEvents.html> 1283L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueWaitForEvents.html>
995 1284
996=item $queue->enqueue_barrier 1285=item $ev = $queue->enqueue_marker ($wait_events...)
997 1286
1287L<http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/clEnqueueMarkerWithWaitList.html>
1288
1289=item $ev = $queue->enqueue_barrier ($wait_events...)
1290
998L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clEnqueueBarrier.html> 1291L<http://www.khronos.org/registry/cl/sdk/1.2/docs/man/xhtml/clEnqueueBarrierWithWaitList.html>
999 1292
1000=item $queue->flush 1293=item $queue->flush
1001 1294
1002L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clFlush.html> 1295L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clFlush.html>
1003 1296
1118 1411
1119=back 1412=back
1120 1413
1121=head2 THE OpenCL::Image CLASS 1414=head2 THE OpenCL::Image CLASS
1122 1415
1123This is the superclass of all image objects - OpenCL::Image2D and OpenCL::Image3D. 1416This is the superclass of all image objects - OpenCL::Image1D,
1417OpenCL::Image1DArray, OpenCL::Image1DBuffer, OpenCL::Image2D,
1418OpenCL::Image2DArray and OpenCL::Image3D.
1124 1419
1125=over 4 1420=over 4
1126 1421
1127=item $packed_value = $ev->image_info ($name) 1422=item $packed_value = $image->image_info ($name)
1128 1423
1129See C<< $platform->info >> for details. 1424See C<< $platform->info >> for details.
1130 1425
1131The reason this method is not called C<info> is that there already is an 1426The reason this method is not called C<info> is that there already is an
1132C<< ->info >> method inherited from C<OpenCL::Memory>. 1427C<< ->info >> method inherited from C<OpenCL::Memory>.
1133 1428
1134L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetImageInfo.html> 1429L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetImageInfo.html>
1135 1430
1431=item ($channel_order, $channel_data_type) = $image->format
1432
1433Returns the channel order and type used to create the image by calling
1434C<clGetImageInfo> with C<CL_IMAGE_FORMAT>.
1435
1136=for gengetinfo begin image 1436=for gengetinfo begin image
1137 1437
1138=item $int = $image->element_size 1438=item $int = $image->element_size
1139 1439
1140Calls C<clGetImageInfo> with C<CL_IMAGE_ELEMENT_SIZE> and returns the result. 1440Calls C<clGetImageInfo> with C<CL_IMAGE_ELEMENT_SIZE> and returns the result.
1213 1513
1214=head2 THE OpenCL::Program CLASS 1514=head2 THE OpenCL::Program CLASS
1215 1515
1216=over 4 1516=over 4
1217 1517
1218=item $program->build ($device, $options = "") 1518=item $program->build (\@devices = undef, $options = "", $cb->($program) = undef)
1219 1519
1220Tries to build the program with the givne options. 1520Tries to build the program with the given options. See also the
1521C<$ctx->build> convenience function.
1522
1523If a callback is specified, then it will be called when compilation is
1524finished. Note that many OpenCL implementations block your program while
1525compiling whether you use a callback or not. See C<build_async> if you
1526want to make sure the build is done in the background.
1527
1528Note that some OpenCL implementations atc up badly, and don't call the
1529callback in some error cases (but call it in others). This implementation
1530assumes the callback will always be called, and leaks memory if this is
1531not so. So best make sure you don't pass in invalid values.
1221 1532
1222L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clBuildProgram.html> 1533L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clBuildProgram.html>
1534
1535=item $program->build_async (\@devices = undef, $options = "", $cb->($program) = undef)
1536
1537Similar to C<< ->build >>, except it starts a thread, and never fails (you
1538need to check the compilation status form the callback, or by polling).
1223 1539
1224=item $packed_value = $program->build_info ($device, $name) 1540=item $packed_value = $program->build_info ($device, $name)
1225 1541
1226Similar to C<< $platform->info >>, but returns build info for a previous 1542Similar to C<< $platform->info >>, but returns build info for a previous
1227build attempt for the given device. 1543build attempt for the given device.
1232 1548
1233Creates an OpenCL::Kernel object out of the named C<__kernel> function in 1549Creates an OpenCL::Kernel object out of the named C<__kernel> function in
1234the program. 1550the program.
1235 1551
1236L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateKernel.html> 1552L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateKernel.html>
1553
1554=item @kernels = $program->kernels_in_program
1555
1556Returns all kernels successfully compiled for all devices in program.
1557
1558http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clCreateKernelsInProgram.html
1237 1559
1238=for gengetinfo begin program_build 1560=for gengetinfo begin program_build
1239 1561
1240=item $build_status = $program->build_status ($device) 1562=item $build_status = $program->build_status ($device)
1241 1563
1369 1691
1370This is a family of methods to set the kernel argument with the number C<$index> to the give C<$value>. 1692This is a family of methods to set the kernel argument with the number C<$index> to the give C<$value>.
1371 1693
1372TYPE is one of C<char>, C<uchar>, C<short>, C<ushort>, C<int>, C<uint>, 1694TYPE is one of C<char>, C<uchar>, C<short>, C<ushort>, C<int>, C<uint>,
1373C<long>, C<ulong>, C<half>, C<float>, C<double>, C<memory>, C<buffer>, 1695C<long>, C<ulong>, C<half>, C<float>, C<double>, C<memory>, C<buffer>,
1374C<image2d>, C<image3d>, C<sampler> or C<event>. 1696C<image2d>, C<image3d>, C<sampler>, C<local> or C<event>.
1375 1697
1376Chars and integers (including the half type) are specified as integers, 1698Chars and integers (including the half type) are specified as integers,
1377float and double as floating point values, memory/buffer/image2d/image3d 1699float and double as floating point values, memory/buffer/image2d/image3d
1378must be an object of that type or C<undef>, and sampler and event must be 1700must be an object of that type or C<undef>, local-memory arguments are
1379objects of that type. 1701set by specifying the size, and sampler and event must be objects of that
1702type.
1703
1704Setting an argument for a kernel does NOT keep a reference to the object -
1705for example, if you set an argument to some image object, free the image,
1706and call the kernel, you will run into undefined behaviour.
1380 1707
1381L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clSetKernelArg.html> 1708L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clSetKernelArg.html>
1382 1709
1383=back 1710=back
1384 1711
1393 1720
1394Waits for the event to complete. 1721Waits for the event to complete.
1395 1722
1396L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clWaitForEvents.html> 1723L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clWaitForEvents.html>
1397 1724
1725=item $ev->cb ($exec_callback_type, $callback->($event, $event_command_exec_status))
1726
1727Adds a callback to the callback stack for the given event type. There is
1728no way to remove a callback again.
1729
1730L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clSetEventCallback.html>
1731
1398=item $packed_value = $ev->info ($name) 1732=item $packed_value = $ev->info ($name)
1399 1733
1400See C<< $platform->info >> for details. 1734See C<< $platform->info >> for details.
1401 1735
1402L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetEventInfo.html> 1736L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clGetEventInfo.html>
1462 1796
1463=over 4 1797=over 4
1464 1798
1465=item $ev->set_status ($execution_status) 1799=item $ev->set_status ($execution_status)
1466 1800
1801Sets the execution status of the user event. Can only be called once,
1802either with OpenCL::COMPLETE or a negative number as status.
1803
1467L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clSetUserEventStatus.html> 1804L<http://www.khronos.org/registry/cl/sdk/1.1/docs/man/xhtml/clSetUserEventStatus.html>
1468 1805
1469=back 1806=back
1470 1807
1471=cut 1808=cut
1472
1473package OpenCL;
1474
1475use common::sense;
1476
1477BEGIN {
1478 our $VERSION = '0.95';
1479
1480 require XSLoader;
1481 XSLoader::load (__PACKAGE__, $VERSION);
1482
1483 @OpenCL::Buffer::ISA =
1484 @OpenCL::Image::ISA = OpenCL::Memory::;
1485
1486 @OpenCL::BufferObj::ISA = OpenCL::Buffer::;
1487
1488 @OpenCL::Image2D::ISA =
1489 @OpenCL::Image3D::ISA = OpenCL::Image::;
1490
1491 @OpenCL::UserEvent::ISA = OpenCL::Event::;
1492}
1493 1809
14941; 18101;
1495 1811
1496=head1 AUTHOR 1812=head1 AUTHOR
1497 1813

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines