[ViewVC] Diff of: cvs/AnyEvent-MP/MP.pm

Comparing AnyEvent-MP/MP.pm (file contents):
Revision 1.17 by root, Mon Aug 3 08:35:40 2009 UTC vs.
Revision 1.50 by root, Fri Aug 14 14:01:05 2009 UTC

…		…
4		4
5	=head1 SYNOPSIS	5	=head1 SYNOPSIS
6		6
7	use AnyEvent::MP;	7	use AnyEvent::MP;
8		8
9	NODE # returns this node identifier
10	$NODE # contains this node identifier	9	$NODE # contains this node's noderef
		10	NODE # returns this node's noderef
		11	NODE $port # returns the noderef of the port
11		12
		13	$SELF # receiving/own port id in rcv callbacks
		14
		15	# initialise the node so it can send/receive messages
		16	initialise_node; # -OR-
		17	initialise_node "localhost:4040"; # -OR-
		18	initialise_node "slave/", "localhost:4040"
		19
		20	# ports are message endpoints
		21
		22	# sending messages
12	snd $port, type => data...;	23	snd $port, type => data...;
		24	snd $port, @msg;
		25	snd @msg_with_first_element_being_a_port;
13		26
14	rcv $port, smartmatch => $cb->($port, @msg);	27	# creating/using ports, the simple way
		28	my $somple_port = port { my @msg = @_; 0 };
15		29
16	# examples:	30	# creating/using ports, type matching
		31	my $port = port;
17	rcv $port2, ping => sub { snd $_[0], "pong"; 0 };	32	rcv $port, ping => sub { snd $_[0], "pong"; 0 };
18	rcv $port1, pong => sub { warn "pong received\n" };	33	rcv $port, pong => sub { warn "pong received\n"; 0 };
19	snd $port2, ping => $port1;
20		34
21	# more, smarter, matches (_any_ is exported by this module)	35	# create a port on another node
22	rcv $port, [child_died => $pid] => sub { ...	36	my $port = spawn $node, $initfunc, @initdata;
23	rcv $port, [_any_, _any_, 3] => sub { .. $_[2] is 3	37
		38	# monitoring
		39	mon $port, $cb->(@msg) # callback is invoked on death
		40	mon $port, $otherport # kill otherport on abnormal death
		41	mon $port, $otherport, @msg # send message on death
		42
		43	=head1 CURRENT STATUS
		44
		45	AnyEvent::MP - stable API, should work
		46	AnyEvent::MP::Intro - outdated
		47	AnyEvent::MP::Kernel - WIP
		48	AnyEvent::MP::Transport - mostly stable
		49
		50	stay tuned.
24		51
25	=head1 DESCRIPTION	52	=head1 DESCRIPTION
26		53
27	This module (-family) implements a simple message passing framework.	54	This module (-family) implements a simple message passing framework.
28		55
29	Despite its simplicity, you can securely message other processes running	56	Despite its simplicity, you can securely message other processes running
30	on the same or other hosts.	57	on the same or other hosts.
31		58
		59	For an introduction to this module family, see the L<AnyEvent::MP::Intro>
		60	manual page.
		61
32	At the moment, this module family is severly brokena nd underdocumented,	62	At the moment, this module family is severly broken and underdocumented,
33	so do not use. This was uploaded mainly to resreve the CPAN namespace -	63	so do not use. This was uploaded mainly to reserve the CPAN namespace -
34	stay tuned!	64	stay tuned!
35		65
36	=head1 CONCEPTS	66	=head1 CONCEPTS
37		67
38	=over 4	68	=over 4
39		69
40	=item port	70	=item port
41		71
42	A port is something you can send messages to with the C<snd> function, and	72	A port is something you can send messages to (with the C<snd> function).
43	you can register C<rcv> handlers with. All C<rcv> handlers will receive	73
44	messages they match, messages will not be queued.	74	Some ports allow you to register C<rcv> handlers that can match specific
		75	messages. All C<rcv> handlers will receive messages they match, messages
		76	will not be queued.
45		77
46	=item port id - C<noderef#portname>	78	=item port id - C<noderef#portname>
47		79
48	A port id is always the noderef, a hash-mark (C<#>) as separator, followed	80	A port id is normaly the concatenation of a noderef, a hash-mark (C<#>) as
49	by a port name (a printable string of unspecified format).	81	separator, and a port name (a printable string of unspecified format). An
		82	exception is the the node port, whose ID is identical to its node
		83	reference.
50		84
51	=item node	85	=item node
52		86
53	A node is a single process containing at least one port - the node	87	A node is a single process containing at least one port - the node
54	port. You can send messages to node ports to let them create new ports,	88	port. You can send messages to node ports to find existing ports or to
55	among other things.	89	create new ports, among other things.
56		90
57	Initially, nodes are either private (single-process only) or hidden	91	Nodes are either private (single-process only), slaves (connected to a
58	(connected to a master node only). Only when they epxlicitly "become	92	master node only) or public nodes (connectable from unrelated nodes).
59	public" can you send them messages from unrelated other nodes.
60		93
61	=item noderef - C<host:port,host:port...>, C<id@noderef>, C<id>	94	=item noderef - C<host:port,host:port...>, C<id@noderef>, C<id>
62		95
63	A noderef is a string that either uniquely identifies a given node (for	96	A node reference is a string that either simply identifies the node (for
64	private and hidden nodes), or contains a recipe on how to reach a given	97	private and slave nodes), or contains a recipe on how to reach a given
65	node (for public nodes).	98	node (for public nodes).
66		99
		100	This recipe is simply a comma-separated list of C<address:port> pairs (for
		101	TCP/IP, other protocols might look different).
		102
		103	Node references come in two flavours: resolved (containing only numerical
		104	addresses) or unresolved (where hostnames are used instead of addresses).
		105
		106	Before using an unresolved node reference in a message you first have to
		107	resolve it.
		108
67	=back	109	=back
68		110
69	=head1 VARIABLES/FUNCTIONS	111	=head1 VARIABLES/FUNCTIONS
70		112
71	=over 4	113	=over 4
72		114
73	=cut	115	=cut
74		116
75	package AnyEvent::MP;	117	package AnyEvent::MP;
76		118
77	use AnyEvent::MP::Base;	119	use AnyEvent::MP::Kernel;
78		120
79	use common::sense;	121	use common::sense;
80		122
81	use Carp ();	123	use Carp ();
82		124
83	use AE ();	125	use AE ();
84		126
85	use base "Exporter";	127	use base "Exporter";
86		128
87	our $VERSION = '0.02';	129	our $VERSION = $AnyEvent::MP::Kernel::VERSION;
		130
88	our @EXPORT = qw(	131	our @EXPORT = qw(
89	NODE $NODE $PORT snd rcv _any_	132	NODE $NODE *SELF node_of _any_
90	create_port create_port_on	133	resolve_node initialise_node
91	create_miniport	134	snd rcv mon kil reg psub spawn
92	become_slave become_public	135	port
93	);	136	);
94		137
		138	our $SELF;
		139
		140	sub _self_die() {
		141	my $msg = $@;
		142	$msg =~ s/\n+$// unless ref $msg;
		143	kil $SELF, die => $msg;
		144	}
		145
95	=item NODE / $NODE	146	=item $thisnode = NODE / $NODE
96		147
97	The C<NODE ()> function and the C<$NODE> variable contain the noderef of	148	The C<NODE> function returns, and the C<$NODE> variable contains
98	the local node. The value is initialised by a call to C<become_public> or	149	the noderef of the local node. The value is initialised by a call
99	C<become_slave>, after which all local port identifiers become invalid.	150	to C<become_public> or C<become_slave>, after which all local port
		151	identifiers become invalid.
100		152
		153	=item $noderef = node_of $port
		154
		155	Extracts and returns the noderef from a portid or a noderef.
		156
		157	=item initialise_node $noderef, $seednode, $seednode...
		158
		159	=item initialise_node "slave/", $master, $master...
		160
		161	Before a node can talk to other nodes on the network it has to initialise
		162	itself - the minimum a node needs to know is it's own name, and optionally
		163	it should know the noderefs of some other nodes in the network.
		164
		165	This function initialises a node - it must be called exactly once (or
		166	never) before calling other AnyEvent::MP functions.
		167
		168	All arguments (optionally except for the first) are noderefs, which can be
		169	either resolved or unresolved.
		170
		171	The first argument will be looked up in the configuration database first
		172	(if it is C<undef> then the current nodename will be used instead) to find
		173	the relevant configuration profile (see L<aemp>). If none is found then
		174	the default configuration is used. The configuration supplies additional
		175	seed/master nodes and can override the actual noderef.
		176
		177	There are two types of networked nodes, public nodes and slave nodes:
		178
		179	=over 4
		180
		181	=item public nodes
		182
		183	For public nodes, C<$noderef> (supplied either directly to
		184	C<initialise_node> or indirectly via a profile or the nodename) must be a
		185	noderef (possibly unresolved, in which case it will be resolved).
		186
		187	After resolving, the node will bind itself on all endpoints and try to
		188	connect to all additional C<$seednodes> that are specified. Seednodes are
		189	optional and can be used to quickly bootstrap the node into an existing
		190	network.
		191
		192	=item slave nodes
		193
		194	When the C<$noderef> (either as given or overriden by the config file)
		195	is the special string C<slave/>, then the node will become a slave
		196	node. Slave nodes cannot be contacted from outside and will route most of
		197	their traffic to the master node that they attach to.
		198
		199	At least one additional noderef is required (either by specifying it
		200	directly or because it is part of the configuration profile): The node
		201	will try to connect to all of them and will become a slave attached to the
		202	first node it can successfully connect to.
		203
		204	=back
		205
		206	This function will block until all nodes have been resolved and, for slave
		207	nodes, until it has successfully established a connection to a master
		208	server.
		209
		210	Example: become a public node listening on the guessed noderef, or the one
		211	specified via C<aemp> for the current node. This should be the most common
		212	form of invocation for "daemon"-type nodes.
		213
		214	initialise_node;
		215
		216	Example: become a slave node to any of the the seednodes specified via
		217	C<aemp>. This form is often used for commandline clients.
		218
		219	initialise_node "slave/";
		220
		221	Example: become a slave node to any of the specified master servers. This
		222	form is also often used for commandline clients.
		223
		224	initialise_node "slave/", "master1", "192.168.13.17", "mp.example.net";
		225
		226	Example: become a public node, and try to contact some well-known master
		227	servers to become part of the network.
		228
		229	initialise_node undef, "master1", "master2";
		230
		231	Example: become a public node listening on port C<4041>.
		232
		233	initialise_node 4041;
		234
		235	Example: become a public node, only visible on localhost port 4044.
		236
		237	initialise_node "localhost:4044";
		238
		239	=item $cv = resolve_node $noderef
		240
		241	Takes an unresolved node reference that may contain hostnames and
		242	abbreviated IDs, resolves all of them and returns a resolved node
		243	reference.
		244
		245	In addition to C<address:port> pairs allowed in resolved noderefs, the
		246	following forms are supported:
		247
		248	=over 4
		249
		250	=item the empty string
		251
		252	An empty-string component gets resolved as if the default port (4040) was
		253	specified.
		254
		255	=item naked port numbers (e.g. C<1234>)
		256
		257	These are resolved by prepending the local nodename and a colon, to be
		258	further resolved.
		259
		260	=item hostnames (e.g. C<localhost:1234>, C<localhost>)
		261
		262	These are resolved by using AnyEvent::DNS to resolve them, optionally
		263	looking up SRV records for the C<aemp=4040> port, if no port was
		264	specified.
		265
		266	=back
		267
		268	=item $SELF
		269
		270	Contains the current port id while executing C<rcv> callbacks or C<psub>
		271	blocks.
		272
		273	=item SELF, %SELF, @SELF...
		274
		275	Due to some quirks in how perl exports variables, it is impossible to
		276	just export C<$SELF>, all the symbols called C<SELF> are exported by this
		277	module, but only C<$SELF> is currently used.
		278
101	=item snd $portid, type => @data	279	=item snd $port, type => @data
102		280
103	=item snd $portid, @msg	281	=item snd $port, @msg
104		282
105	Send the given message to the given port ID, which can identify either	283	Send the given message to the given port ID, which can identify either
106	a local or a remote port, and can be either a string or soemthignt hat	284	a local or a remote port, and can be either a string or soemthignt hat
107	stringifies a sa port ID (such as a port object :).	285	stringifies a sa port ID (such as a port object :).
108		286
…		…
118	JSON is used, then only strings, numbers and arrays and hashes consisting	296	JSON is used, then only strings, numbers and arrays and hashes consisting
119	of those are allowed (no objects). When Storable is used, then anything	297	of those are allowed (no objects). When Storable is used, then anything
120	that Storable can serialise and deserialise is allowed, and for the local	298	that Storable can serialise and deserialise is allowed, and for the local
121	node, anything can be passed.	299	node, anything can be passed.
122		300
123	=item $local_port = create_port	301	=item $local_port = port
124		302
125	Create a new local port object. See the next section for allowed methods.	303	Create a new local port object and returns its port ID. Initially it has
		304	no callbacks set and will throw an error when it receives messages.
126		305
127	=cut	306	=item $local_port = port { my @msg = @_ }
128		307
129	sub create_port {	308	Creates a new local port, and returns its ID. Semantically the same as
130	my $id = "$AnyEvent::MP::Base::UNIQ." . $AnyEvent::MP::Base::ID++;	309	creating a port and calling C<rcv $port, $callback> on it.
131		310
132	my $self = bless {	311	The block will be called for every message received on the port, with the
133	id => "$NODE#$id",	312	global variable C<$SELF> set to the port ID. Runtime errors will cause the
134	names => [$id],	313	port to be C<kil>ed. The message will be passed as-is, no extra argument
135	}, "AnyEvent::MP::Port";	314	(i.e. no port ID) will be passed to the callback.
136		315
137	$AnyEvent::MP::Base::PORT{$id} = sub {	316	If you want to stop/destroy the port, simply C<kil> it:
138	unshift @_, $self;
139		317
140	for (@{ $self->{rc0}{$_[1]} }) {	318	my $port = port {
141	$_ && &{$_->[0]}	319	my @msg = @_;
142	&& undef $_;	320	...
		321	kil $SELF;
		322	};
		323
		324	=cut
		325
		326	sub rcv($@);
		327
		328	sub _kilme {
		329	die "received message on port without callback";
		330	}
		331
		332	sub port(;&) {
		333	my $id = "$UNIQ." . $ID++;
		334	my $port = "$NODE#$id";
		335
		336	rcv $port, shift \|\| \&_kilme;
		337
		338	$port
		339	}
		340
		341	=item rcv $local_port, $callback->(@msg)
		342
		343	Replaces the default callback on the specified port. There is no way to
		344	remove the default callback: use C<sub { }> to disable it, or better
		345	C<kil> the port when it is no longer needed.
		346
		347	The global C<$SELF> (exported by this module) contains C<$port> while
		348	executing the callback. Runtime errors during callback execution will
		349	result in the port being C<kil>ed.
		350
		351	The default callback received all messages not matched by a more specific
		352	C<tag> match.
		353
		354	=item rcv $local_port, tag => $callback->(@msg_without_tag), ...
		355
		356	Register callbacks to be called on messages starting with the given tag on
		357	the given port (and return the port), or unregister it (when C<$callback>
		358	is C<$undef>).
		359
		360	The original message will be passed to the callback, after the first
		361	element (the tag) has been removed. The callback will use the same
		362	environment as the default callback (see above).
		363
		364	Example: create a port and bind receivers on it in one go.
		365
		366	my $port = rcv port,
		367	msg1 => sub { ... },
		368	msg2 => sub { ... },
		369	;
		370
		371	Example: create a port, bind receivers and send it in a message elsewhere
		372	in one go:
		373
		374	snd $otherport, reply =>
		375	rcv port,
		376	msg1 => sub { ... },
		377	...
		378	;
		379
		380	=cut
		381
		382	sub rcv($@) {
		383	my $port = shift;
		384	my ($noderef, $portid) = split /#/, $port, 2;
		385
		386	($NODE{$noderef} \|\| add_node $noderef) == $NODE{""}
		387	or Carp::croak "$port: rcv can only be called on local ports, caught";
		388
		389	while (@_) {
		390	if (ref $_[0]) {
		391	if (my $self = $PORT_DATA{$portid}) {
		392	"AnyEvent::MP::Port" eq ref $self
		393	or Carp::croak "$port: rcv can only be called on message matching ports, caught";
		394
		395	$self->[2] = shift;
		396	} else {
		397	my $cb = shift;
		398	$PORT{$portid} = sub {
		399	local $SELF = $port;
		400	eval { &$cb }; _self_die if $@;
		401	};
		402	}
		403	} elsif (defined $_[0]) {
		404	my $self = $PORT_DATA{$portid} \|\|= do {
		405	my $self = bless [$PORT{$port} \|\| sub { }, { }, $port], "AnyEvent::MP::Port";
		406
		407	$PORT{$portid} = sub {
		408	local $SELF = $port;
		409
		410	if (my $cb = $self->[1]{$_[0]}) {
		411	shift;
		412	eval { &$cb }; _self_die if $@;
		413	} else {
		414	&{ $self->[0] };
		415	}
		416	};
		417
		418	$self
		419	};
		420
		421	"AnyEvent::MP::Port" eq ref $self
		422	or Carp::croak "$port: rcv can only be called on message matching ports, caught";
		423
		424	my ($tag, $cb) = splice @_, 0, 2;
		425
		426	if (defined $cb) {
		427	$self->[1]{$tag} = $cb;
		428	} else {
		429	delete $self->[1]{$tag};
		430	}
143	}	431	}
		432	}
144		433
145	for (@{ $self->{rcv}{$_[1]} }) {	434	$port
146	$_ && [@_[1 .. @{$_->[1]}]] ~~ $_->[1]	435	}
147	&& &{$_->[0]}	436
148	&& undef $_;	437	=item $closure = psub { BLOCK }
		438
		439	Remembers C<$SELF> and creates a closure out of the BLOCK. When the
		440	closure is executed, sets up the environment in the same way as in C<rcv>
		441	callbacks, i.e. runtime errors will cause the port to get C<kil>ed.
		442
		443	This is useful when you register callbacks from C<rcv> callbacks:
		444
		445	rcv delayed_reply => sub {
		446	my ($delay, @reply) = @_;
		447	my $timer = AE::timer $delay, 0, psub {
		448	snd @reply, $SELF;
		449	};
		450	};
		451
		452	=cut
		453
		454	sub psub(&) {
		455	my $cb = shift;
		456
		457	my $port = $SELF
		458	or Carp::croak "psub can only be called from within rcv or psub callbacks, not";
		459
		460	sub {
		461	local $SELF = $port;
		462
		463	if (wantarray) {
		464	my @res = eval { &$cb };
		465	_self_die if $@;
		466	@res
		467	} else {
		468	my $res = eval { &$cb };
		469	_self_die if $@;
		470	$res
149	}	471	}
150
151	for (@{ $self->{any} }) {
152	$_ && [@_[0 .. $#{$_->[1]}]] ~~ $_->[1]
153	&& &{$_->[0]}
154	&& undef $_;
155	}
156	};
157
158	$self
159	}
160
161	=item $portid = miniport { my @msg = @_; $finished }
162
163	Creates a "mini port", that is, a very lightweight port without any
164	pattern matching behind it, and returns its ID.
165
166	The block will be called for every message received on the port. When the
167	callback returns a true value its job is considered "done" and the port
168	will be destroyed. Otherwise it will stay alive.
169
170	The message will be passed as-is, no extra argument (i.e. no port id) will
171	be passed to the callback.
172
173	If you need the local port id in the callback, this works nicely:
174
175	my $port; $port = miniport {
176	snd $otherport, reply => $port;
177	};
178
179	=cut
180
181	sub miniport(&) {
182	my $cb = shift;
183	my $id = "$AnyEvent::MP::Base::UNIQ." . $AnyEvent::MP::Base::ID++;
184
185	$AnyEvent::MP::Base::PORT{$id} = sub {
186	&$cb
187	and delete $AnyEvent::MP::Base::PORT{$id};
188	};
189
190	"$NODE#$id"
191	}
192
193	package AnyEvent::MP::Port;
194
195	=back
196
197	=head1 METHODS FOR PORT OBJECTS
198
199	=over 4
200
201	=item "$port"
202
203	A port object stringifies to its port ID, so can be used directly for
204	C<snd> operations.
205
206	=cut
207
208	use overload
209	'""' => sub { $_[0]{id} },
210	fallback => 1;
211
212	=item $port->rcv (type => $callback->($port, @msg))
213
214	=item $port->rcv ($smartmatch => $callback->($port, @msg))
215
216	=item $port->rcv ([$smartmatch...] => $callback->($port, @msg))
217
218	Register a callback on the given port.
219
220	The callback has to return a true value when its work is done, after
221	which is will be removed, or a false value in which case it will stay
222	registered.
223
224	If the match is an array reference, then it will be matched against the
225	first elements of the message, otherwise only the first element is being
226	matched.
227
228	Any element in the match that is specified as C<_any_> (a function
229	exported by this module) matches any single element of the message.
230
231	While not required, it is highly recommended that the first matching
232	element is a string identifying the message. The one-string-only match is
233	also the most efficient match (by far).
234
235	=cut
236
237	sub rcv($@) {
238	my ($self, $match, $cb) = @_;
239
240	if (!ref $match) {
241	push @{ $self->{rc0}{$match} }, [$cb];
242	} elsif (("ARRAY" eq ref $match && !ref $match->[0])) {
243	my ($type, @match) = @$match;
244	@match
245	? push @{ $self->{rcv}{$match->[0]} }, [$cb, \@match]
246	: push @{ $self->{rc0}{$match->[0]} }, [$cb];
247	} else {
248	push @{ $self->{any} }, [$cb, $match];
249	}	472	}
250	}	473	}
251		474
252	=item $port->register ($name)	475	=item $guard = mon $port, $cb->(@reason)
253		476
254	Registers the given port under the well known name C<$name>. If the name	477	=item $guard = mon $port, $rcvport
255	already exists it is replaced.
256		478
257	A port can only be registered under one well known name.	479	=item $guard = mon $port
258		480
259	=cut	481	=item $guard = mon $port, $rcvport, @msg
260		482
261	sub register {	483	Monitor the given port and do something when the port is killed or
262	my ($self, $name) = @_;	484	messages to it were lost, and optionally return a guard that can be used
		485	to stop monitoring again.
263		486
264	$self->{wkname} = $name;	487	C<mon> effectively guarantees that, in the absence of hardware failures,
265	$AnyEvent::MP::Base::WKP{$name} = "$self";	488	that after starting the monitor, either all messages sent to the port
		489	will arrive, or the monitoring action will be invoked after possible
		490	message loss has been detected. No messages will be lost "in between"
		491	(after the first lost message no further messages will be received by the
		492	port). After the monitoring action was invoked, further messages might get
		493	delivered again.
		494
		495	In the first form (callback), the callback is simply called with any
		496	number of C<@reason> elements (no @reason means that the port was deleted
		497	"normally"). Note also that I<< the callback B<must> never die >>, so use
		498	C<eval> if unsure.
		499
		500	In the second form (another port given), the other port (C<$rcvport>)
		501	will be C<kil>'ed with C<@reason>, iff a @reason was specified, i.e. on
		502	"normal" kils nothing happens, while under all other conditions, the other
		503	port is killed with the same reason.
		504
		505	The third form (kill self) is the same as the second form, except that
		506	C<$rvport> defaults to C<$SELF>.
		507
		508	In the last form (message), a message of the form C<@msg, @reason> will be
		509	C<snd>.
		510
		511	As a rule of thumb, monitoring requests should always monitor a port from
		512	a local port (or callback). The reason is that kill messages might get
		513	lost, just like any other message. Another less obvious reason is that
		514	even monitoring requests can get lost (for exmaple, when the connection
		515	to the other node goes down permanently). When monitoring a port locally
		516	these problems do not exist.
		517
		518	Example: call a given callback when C<$port> is killed.
		519
		520	mon $port, sub { warn "port died because of <@_>\n" };
		521
		522	Example: kill ourselves when C<$port> is killed abnormally.
		523
		524	mon $port;
		525
		526	Example: send us a restart message when another C<$port> is killed.
		527
		528	mon $port, $self => "restart";
		529
		530	=cut
		531
		532	sub mon {
		533	my ($noderef, $port) = split /#/, shift, 2;
		534
		535	my $node = $NODE{$noderef} \|\| add_node $noderef;
		536
		537	my $cb = @_ ? shift : $SELF \|\| Carp::croak 'mon: called with one argument only, but $SELF not set,';
		538
		539	unless (ref $cb) {
		540	if (@_) {
		541	# send a kill info message
		542	my (@msg) = ($cb, @_);
		543	$cb = sub { snd @msg, @_ };
		544	} else {
		545	# simply kill other port
		546	my $port = $cb;
		547	$cb = sub { kil $port, @_ if @_ };
		548	}
		549	}
		550
		551	$node->monitor ($port, $cb);
		552
		553	defined wantarray
		554	and AnyEvent::Util::guard { $node->unmonitor ($port, $cb) }
266	}	555	}
267		556
268	=item $port->destroy	557	=item $guard = mon_guard $port, $ref, $ref...
269		558
270	Explicitly destroy/remove/nuke/vaporise the port.	559	Monitors the given C<$port> and keeps the passed references. When the port
		560	is killed, the references will be freed.
271		561
272	Ports are normally kept alive by there mere existance alone, and need to	562	Optionally returns a guard that will stop the monitoring.
273	be destroyed explicitly.
274		563
275	=cut	564	This function is useful when you create e.g. timers or other watchers and
		565	want to free them when the port gets killed:
276		566
277	sub destroy {	567	$port->rcv (start => sub {
278	my ($self) = @_;	568	my $timer; $timer = mon_guard $port, AE::timer 1, 1, sub {
		569	undef $timer if 0.9 < rand;
		570	});
		571	});
279		572
280	delete $AnyEvent::MP::Base::WKP{ $self->{wkname} };	573	=cut
281		574
282	delete $AnyEvent::MP::Base::PORT{$_}	575	sub mon_guard {
283	for @{ $self->{names} };	576	my ($port, @refs) = @_;
		577
		578	#TODO: mon-less form?
		579
		580	mon $port, sub { 0 && @refs }
284	}	581	}
285		582
286	=back	583	=item kil $port[, @reason]
287		584
288	=head1 FUNCTIONS FOR NODES	585	Kill the specified port with the given C<@reason>.
289		586
290	=over 4	587	If no C<@reason> is specified, then the port is killed "normally" (linked
		588	ports will not be kileld, or even notified).
291		589
292	=item mon $noderef, $callback->($noderef, $status, $)	590	Otherwise, linked ports get killed with the same reason (second form of
		591	C<mon>, see below).
293		592
294	Monitors the given noderef.	593	Runtime errors while evaluating C<rcv> callbacks or inside C<psub> blocks
		594	will be reported as reason C<< die => $@ >>.
295		595
296	=item become_public endpoint...	596	Transport/communication errors are reported as C<< transport_error =>
		597	$message >>.
297		598
298	Tells the node to become a public node, i.e. reachable from other nodes.
299
300	If no arguments are given, or the first argument is C<undef>, then
301	AnyEvent::MP tries to bind on port C<4040> on all IP addresses that the
302	local nodename resolves to.
303
304	Otherwise the first argument must be an array-reference with transport
305	endpoints ("ip:port", "hostname:port") or port numbers (in which case the
306	local nodename is used as hostname). The endpoints are all resolved and
307	will become the node reference.
308
309	=cut	599	=cut
		600
		601	=item $port = spawn $node, $initfunc[, @initdata]
		602
		603	Creates a port on the node C<$node> (which can also be a port ID, in which
		604	case it's the node where that port resides).
		605
		606	The port ID of the newly created port is return immediately, and it is
		607	permissible to immediately start sending messages or monitor the port.
		608
		609	After the port has been created, the init function is
		610	called. This function must be a fully-qualified function name
		611	(e.g. C<MyApp::Chat::Server::init>). To specify a function in the main
		612	program, use C<::name>.
		613
		614	If the function doesn't exist, then the node tries to C<require>
		615	the package, then the package above the package and so on (e.g.
		616	C<MyApp::Chat::Server>, C<MyApp::Chat>, C<MyApp>) until the function
		617	exists or it runs out of package names.
		618
		619	The init function is then called with the newly-created port as context
		620	object (C<$SELF>) and the C<@initdata> values as arguments.
		621
		622	A common idiom is to pass your own port, monitor the spawned port, and
		623	in the init function, monitor the original port. This two-way monitoring
		624	ensures that both ports get cleaned up when there is a problem.
		625
		626	Example: spawn a chat server port on C<$othernode>.
		627
		628	# this node, executed from within a port context:
		629	my $server = spawn $othernode, "MyApp::Chat::Server::connect", $SELF;
		630	mon $server;
		631
		632	# init function on C<$othernode>
		633	sub connect {
		634	my ($srcport) = @_;
		635
		636	mon $srcport;
		637
		638	rcv $SELF, sub {
		639	...
		640	};
		641	}
		642
		643	=cut
		644
		645	sub _spawn {
		646	my $port = shift;
		647	my $init = shift;
		648
		649	local $SELF = "$NODE#$port";
		650	eval {
		651	&{ load_func $init }
		652	};
		653	_self_die if $@;
		654	}
		655
		656	sub spawn(@) {
		657	my ($noderef, undef) = split /#/, shift, 2;
		658
		659	my $id = "$RUNIQ." . $ID++;
		660
		661	$_[0] =~ /::/
		662	or Carp::croak "spawn init function must be a fully-qualified name, caught";
		663
		664	($NODE{$noderef} \|\| add_node $noderef)
		665	->send (["", "AnyEvent::MP::_spawn" => $id, @_]);
		666
		667	"$noderef#$id"
		668	}
310		669
311	=back	670	=back
312		671
313	=head1 NODE MESSAGES	672	=head1 NODE MESSAGES
314		673
315	Nodes understand the following messages sent to them. Many of them take	674	Nodes understand the following messages sent to them. Many of them take
316	arguments called C<@reply>, which will simply be used to compose a reply	675	arguments called C<@reply>, which will simply be used to compose a reply
317	message - C<$reply[0]> is the port to reply to, C<$reply[1]> the type and	676	message - C<$reply[0]> is the port to reply to, C<$reply[1]> the type and
318	the remaining arguments are simply the message data.	677	the remaining arguments are simply the message data.
319		678
		679	While other messages exist, they are not public and subject to change.
		680
320	=over 4	681	=over 4
321		682
322	=cut	683	=cut
323		684
324	=item wkp => $name, @reply	685	=item lookup => $name, @reply
325		686
326	Replies with the port ID of the specified well-known port, or C<undef>.	687	Replies with the port ID of the specified well-known port, or C<undef>.
327		688
328	=item devnull => ...	689	=item devnull => ...
329		690
…		…
352	snd $NODE, time => $myport, timereply => 1, 2;	713	snd $NODE, time => $myport, timereply => 1, 2;
353	# => snd $myport, timereply => 1, 2, <time>	714	# => snd $myport, timereply => 1, 2, <time>
354		715
355	=back	716	=back
356		717
		718	=head1 AnyEvent::MP vs. Distributed Erlang
		719
		720	AnyEvent::MP got lots of its ideas from distributed Erlang (Erlang node
		721	== aemp node, Erlang process == aemp port), so many of the documents and
		722	programming techniques employed by Erlang apply to AnyEvent::MP. Here is a
		723	sample:
		724
		725	http://www.Erlang.se/doc/programming_rules.shtml
		726	http://Erlang.org/doc/getting_started/part_frame.html # chapters 3 and 4
		727	http://Erlang.org/download/Erlang-book-part1.pdf # chapters 5 and 6
		728	http://Erlang.org/download/armstrong_thesis_2003.pdf # chapters 4 and 5
		729
		730	Despite the similarities, there are also some important differences:
		731
		732	=over 4
		733
		734	=item * Node references contain the recipe on how to contact them.
		735
		736	Erlang relies on special naming and DNS to work everywhere in the
		737	same way. AEMP relies on each node knowing it's own address(es), with
		738	convenience functionality.
		739
		740	This means that AEMP requires a less tightly controlled environment at the
		741	cost of longer node references and a slightly higher management overhead.
		742
		743	=item * Erlang uses processes and a mailbox, AEMP does not queue.
		744
		745	Erlang uses processes that selctively receive messages, and therefore
		746	needs a queue. AEMP is event based, queuing messages would serve no useful
		747	purpose.
		748
		749	(But see L<Coro::MP> for a more Erlang-like process model on top of AEMP).
		750
		751	=item * Erlang sends are synchronous, AEMP sends are asynchronous.
		752
		753	Sending messages in Erlang is synchronous and blocks the process. AEMP
		754	sends are immediate, connection establishment is handled in the
		755	background.
		756
		757	=item * Erlang can silently lose messages, AEMP cannot.
		758
		759	Erlang makes few guarantees on messages delivery - messages can get lost
		760	without any of the processes realising it (i.e. you send messages a, b,
		761	and c, and the other side only receives messages a and c).
		762
		763	AEMP guarantees correct ordering, and the guarantee that there are no
		764	holes in the message sequence.
		765
		766	=item * In Erlang, processes can be declared dead and later be found to be
		767	alive.
		768
		769	In Erlang it can happen that a monitored process is declared dead and
		770	linked processes get killed, but later it turns out that the process is
		771	still alive - and can receive messages.
		772
		773	In AEMP, when port monitoring detects a port as dead, then that port will
		774	eventually be killed - it cannot happen that a node detects a port as dead
		775	and then later sends messages to it, finding it is still alive.
		776
		777	=item * Erlang can send messages to the wrong port, AEMP does not.
		778
		779	In Erlang it is quite possible that a node that restarts reuses a process
		780	ID known to other nodes for a completely different process, causing
		781	messages destined for that process to end up in an unrelated process.
		782
		783	AEMP never reuses port IDs, so old messages or old port IDs floating
		784	around in the network will not be sent to an unrelated port.
		785
		786	=item * Erlang uses unprotected connections, AEMP uses secure
		787	authentication and can use TLS.
		788
		789	AEMP can use a proven protocol - SSL/TLS - to protect connections and
		790	securely authenticate nodes.
		791
		792	=item * The AEMP protocol is optimised for both text-based and binary
		793	communications.
		794
		795	The AEMP protocol, unlike the Erlang protocol, supports both
		796	language-independent text-only protocols (good for debugging) and binary,
		797	language-specific serialisers (e.g. Storable).
		798
		799	It has also been carefully designed to be implementable in other languages
		800	with a minimum of work while gracefully degrading fucntionality to make the
		801	protocol simple.
		802
		803	=item * AEMP has more flexible monitoring options than Erlang.
		804
		805	In Erlang, you can chose to receive I<all> exit signals as messages
		806	or I<none>, there is no in-between, so monitoring single processes is
		807	difficult to implement. Monitoring in AEMP is more flexible than in
		808	Erlang, as one can choose between automatic kill, exit message or callback
		809	on a per-process basis.
		810
		811	=item * Erlang tries to hide remote/local connections, AEMP does not.
		812
		813	Monitoring in Erlang is not an indicator of process death/crashes,
		814	as linking is (except linking is unreliable in Erlang).
		815
		816	In AEMP, you don't "look up" registered port names or send to named ports
		817	that might or might not be persistent. Instead, you normally spawn a port
		818	on the remote node. The init function monitors the you, and you monitor
		819	the remote port. Since both monitors are local to the node, they are much
		820	more reliable.
		821
		822	This also saves round-trips and avoids sending messages to the wrong port
		823	(hard to do in Erlang).
		824
		825	=back
		826
		827	=head1 RATIONALE
		828
		829	=over 4
		830
		831	=item Why strings for ports and noderefs, why not objects?
		832
		833	We considered "objects", but found that the actual number of methods
		834	thatc an be called are very low. Since port IDs and noderefs travel over
		835	the network frequently, the serialising/deserialising would add lots of
		836	overhead, as well as having to keep a proxy object.
		837
		838	Strings can easily be printed, easily serialised etc. and need no special
		839	procedures to be "valid".
		840
		841	And a a miniport consists of a single closure stored in a global hash - it
		842	can't become much cheaper.
		843
		844	=item Why favour JSON, why not real serialising format such as Storable?
		845
		846	In fact, any AnyEvent::MP node will happily accept Storable as framing
		847	format, but currently there is no way to make a node use Storable by
		848	default.
		849
		850	The default framing protocol is JSON because a) JSON::XS is many times
		851	faster for small messages and b) most importantly, after years of
		852	experience we found that object serialisation is causing more problems
		853	than it gains: Just like function calls, objects simply do not travel
		854	easily over the network, mostly because they will always be a copy, so you
		855	always have to re-think your design.
		856
		857	Keeping your messages simple, concentrating on data structures rather than
		858	objects, will keep your messages clean, tidy and efficient.
		859
		860	=back
		861
357	=head1 SEE ALSO	862	=head1 SEE ALSO
358		863
359	L<AnyEvent>.	864	L<AnyEvent>.
360		865
361	=head1 AUTHOR	866	=head1 AUTHOR

Diff Legend

-–
+Removed lines
-+
+Added lines
-<
+Changed lines
->
+Changed lines

Comparing AnyEvent-MP/MP.pm (file contents): Revision 1.17 by root, Mon Aug 3 08:35:40 2009 UTC vs. Revision 1.50 by root, Fri Aug 14 14:01:05 2009 UTC

Diff Legend

Comparing AnyEvent-MP/MP.pm (file contents):
Revision 1.17 by root, Mon Aug 3 08:35:40 2009 UTC vs.
Revision 1.50 by root, Fri Aug 14 14:01:05 2009 UTC