--- AnyEvent-MP/MP.pm	2009/08/04 22:05:43	1.26
+++ AnyEvent-MP/MP.pm	2009/08/05 19:58:46	1.32
@@ -45,31 +45,43 @@
 
 =item port
 
-A port is something you can send messages to with the C<snd> function, and
-you can register C<rcv> handlers with. All C<rcv> handlers will receive
-messages they match, messages will not be queued.
+A port is something you can send messages to (with the C<snd> function).
+
+Some ports allow you to register C<rcv> handlers that can match specific
+messages. All C<rcv> handlers will receive messages they match, messages
+will not be queued.
 
 =item port id - C<noderef#portname>
 
-A port id is always the noderef, a hash-mark (C<#>) as separator, followed
-by a port name (a printable string of unspecified format).
+A port id is normaly the concatenation of a noderef, a hash-mark (C<#>) as
+separator, and a port name (a printable string of unspecified format). An
+exception is the the node port, whose ID is identical to its node
+reference.
 
 =item node
 
 A node is a single process containing at least one port - the node
-port. You can send messages to node ports to let them create new ports,
-among other things.
+port. You can send messages to node ports to find existing ports or to
+create new ports, among other things.
 
-Initially, nodes are either private (single-process only) or hidden
-(connected to a master node only). Only when they epxlicitly "become
-public" can you send them messages from unrelated other nodes.
+Nodes are either private (single-process only), slaves (connected to a
+master node only) or public nodes (connectable from unrelated nodes).
 
 =item noderef - C<host:port,host:port...>, C<id@noderef>, C<id>
 
-A noderef is a string that either uniquely identifies a given node (for
-private and hidden nodes), or contains a recipe on how to reach a given
+A node reference is a string that either simply identifies the node (for
+private and slave nodes), or contains a recipe on how to reach a given
 node (for public nodes).
 
+This recipe is simply a comma-separated list of C<address:port> pairs (for
+TCP/IP, other protocols might look different).
+
+Node references come in two flavours: resolved (containing only numerical
+addresses) or unresolved (where hostnames are used instead of addresses).
+
+Before using an unresolved node reference in a message you first have to
+resolve it.
+
 =back
 
 =head1 VARIABLES/FUNCTIONS
@@ -93,7 +105,7 @@
 our $VERSION = '0.1';
 our @EXPORT = qw(
    NODE $NODE *SELF node_of _any_
-   become_slave become_public
+   resolve_node initialise_node
    snd rcv mon kil reg psub
    port
 );
@@ -117,6 +129,35 @@
 
 Extracts and returns the noderef from a portid or a noderef.
 
+=item $cv = resolve_node $noderef
+
+Takes an unresolved node reference that may contain hostnames and
+abbreviated IDs, resolves all of them and returns a resolved node
+reference.
+
+In addition to C<address:port> pairs allowed in resolved noderefs, the
+following forms are supported:
+
+=over 4
+
+=item the empty string
+
+An empty-string component gets resolved as if the default port (4040) was
+specified.
+
+=item naked port numbers (e.g. C<1234>)
+
+These are resolved by prepending the local nodename and a colon, to be
+further resolved.
+
+=item hostnames (e.g. C<localhost:1234>, C<localhost>)
+
+These are resolved by using AnyEvent::DNS to resolve them, optionally
+looking up SRV records for the C<aemp=4040> port, if no port was
+specified.
+
+=back
+
 =item $SELF
 
 Contains the current port id while executing C<rcv> callbacks or C<psub>
@@ -150,116 +191,11 @@
 that Storable can serialise and deserialise is allowed, and for the local
 node, anything can be passed.
 
-=item kil $portid[, @reason]
-
-Kill the specified port with the given C<@reason>.
-
-If no C<@reason> is specified, then the port is killed "normally" (linked
-ports will not be kileld, or even notified).
-
-Otherwise, linked ports get killed with the same reason (second form of
-C<mon>, see below).
-
-Runtime errors while evaluating C<rcv> callbacks or inside C<psub> blocks
-will be reported as reason C<< die => $@ >>.
-
-Transport/communication errors are reported as C<< transport_error =>
-$message >>.
-
-=item $guard = mon $portid, $cb->(@reason)
-
-=item $guard = mon $portid, $otherport
-
-=item $guard = mon $portid, $otherport, @msg
-
-Monitor the given port and do something when the port is killed.
-
-In the first form, the callback is simply called with any number
-of C<@reason> elements (no @reason means that the port was deleted
-"normally"). Note also that I<< the callback B<must> never die >>, so use
-C<eval> if unsure.
-
-In the second form, the other port will be C<kil>'ed with C<@reason>, iff
-a @reason was specified, i.e. on "normal" kils nothing happens, while
-under all other conditions, the other port is killed with the same reason.
-
-In the last form, a message of the form C<@msg, @reason> will be C<snd>.
-
-Example: call a given callback when C<$port> is killed.
-
-   mon $port, sub { warn "port died because of <@_>\n" };
-
-Example: kill ourselves when C<$port> is killed abnormally.
-
-   mon $port, $self;
-
-Example: send us a restart message another C<$port> is killed.
-
-   mon $port, $self => "restart";
-
-=cut
-
-sub mon {
-   my ($noderef, $port, $cb) = ((split /#/, shift, 2), shift);
-
-   my $node = $NODE{$noderef} || add_node $noderef;
-
-   #TODO: ports must not be references
-   if (!ref $cb or "AnyEvent::MP::Port" eq ref $cb) {
-      if (@_) {
-         # send a kill info message
-         my (@msg) = ($cb, @_);
-         $cb = sub { snd @msg, @_ };
-      } else {
-         # simply kill other port
-         my $port = $cb;
-         $cb = sub { kil $port, @_ if @_ };
-      }
-   }
-
-   $node->monitor ($port, $cb);
-
-   defined wantarray
-      and AnyEvent::Util::guard { $node->unmonitor ($port, $cb) }
-}
-
-=item $guard = mon_guard $port, $ref, $ref...
-
-Monitors the given C<$port> and keeps the passed references. When the port
-is killed, the references will be freed.
-
-Optionally returns a guard that will stop the monitoring.
-
-This function is useful when you create e.g. timers or other watchers and
-want to free them when the port gets killed:
-
-  $port->rcv (start => sub {
-     my $timer; $timer = mon_guard $port, AE::timer 1, 1, sub {
-        undef $timer if 0.9 < rand;
-     });
-  });
-
-=cut
-
-sub mon_guard {
-   my ($port, @refs) = @_;
-
-   mon $port, sub { 0 && @refs }
-}
-
-=item lnk $port1, $port2
-
-Link two ports. This is simply a shorthand for:
-
-   mon $port1, $port2;
-   mon $port2, $port1;
-
-It means that if either one is killed abnormally, the other one gets
-killed as well.
-
 =item $local_port = port
 
-Create a new local port object that supports message matching.
+Create a new local port object that can be used either as a pattern
+matching port ("full port") or a single-callback port ("miniport"),
+depending on how C<rcv> callbacks are bound to the object.
 
 =item $portid = port { my @msg = @_; $finished }
 
@@ -275,7 +211,7 @@
 
 If you need the local port id in the callback, this works nicely:
 
-   my $port; $port = miniport {
+   my $port; $port = port {
       snd $otherport, reply => $port;
    };
 
@@ -346,13 +282,19 @@
    $REG{$name} = $portid;
 }
 
+=item rcv $portid, $callback->(@msg)
+
+Replaces the callback on the specified miniport (or newly created port
+object, see C<port>). Full ports are configured with the following calls:
+
 =item rcv $portid, tagstring        => $callback->(@msg), ...
 
 =item rcv $portid, $smartmatch      => $callback->(@msg), ...
 
 =item rcv $portid, [$smartmatch...] => $callback->(@msg), ...
 
-Register callbacks to be called on matching messages on the given port.
+Register callbacks to be called on matching messages on the given full
+port (or newly created port).
 
 The callback has to return a true value when its work is done, after
 which is will be removed, or a false value in which case it will stay
@@ -378,7 +320,8 @@
 =cut
 
 sub rcv($@) {
-   my ($noderef, $port) = split /#/, shift, 2;
+   my $portid = shift;
+   my ($noderef, $port) = split /#/, $port, 2;
 
    ($NODE{$noderef} || add_node $noderef) == $NODE{""}
       or Carp::croak "$noderef#$port: rcv can only be called on local ports, caught";
@@ -403,6 +346,8 @@
          push @{ $self->{any}              }, [$cb, $match];
       }
    }
+
+   $portid
 }
 
 =item $closure = psub { BLOCK }
@@ -443,24 +388,130 @@
    }
 }
 
+=item $guard = mon $portid, $cb->(@reason)
+
+=item $guard = mon $portid, $otherport
+
+=item $guard = mon $portid, $otherport, @msg
+
+Monitor the given port and do something when the port is killed.
+
+In the first form, the callback is simply called with any number
+of C<@reason> elements (no @reason means that the port was deleted
+"normally"). Note also that I<< the callback B<must> never die >>, so use
+C<eval> if unsure.
+
+In the second form, the other port will be C<kil>'ed with C<@reason>, iff
+a @reason was specified, i.e. on "normal" kils nothing happens, while
+under all other conditions, the other port is killed with the same reason.
+
+In the last form, a message of the form C<@msg, @reason> will be C<snd>.
+
+Example: call a given callback when C<$port> is killed.
+
+   mon $port, sub { warn "port died because of <@_>\n" };
+
+Example: kill ourselves when C<$port> is killed abnormally.
+
+   mon $port, $self;
+
+Example: send us a restart message another C<$port> is killed.
+
+   mon $port, $self => "restart";
+
+=cut
+
+sub mon {
+   my ($noderef, $port) = split /#/, shift, 2;
+
+   my $node = $NODE{$noderef} || add_node $noderef;
+
+   my $cb = shift;
+
+   unless (ref $cb) {
+      if (@_) {
+         # send a kill info message
+         my (@msg) = ($cb, @_);
+         $cb = sub { snd @msg, @_ };
+      } else {
+         # simply kill other port
+         my $port = $cb;
+         $cb = sub { kil $port, @_ if @_ };
+      }
+   }
+
+   $node->monitor ($port, $cb);
+
+   defined wantarray
+      and AnyEvent::Util::guard { $node->unmonitor ($port, $cb) }
+}
+
+=item $guard = mon_guard $port, $ref, $ref...
+
+Monitors the given C<$port> and keeps the passed references. When the port
+is killed, the references will be freed.
+
+Optionally returns a guard that will stop the monitoring.
+
+This function is useful when you create e.g. timers or other watchers and
+want to free them when the port gets killed:
+
+  $port->rcv (start => sub {
+     my $timer; $timer = mon_guard $port, AE::timer 1, 1, sub {
+        undef $timer if 0.9 < rand;
+     });
+  });
+
+=cut
+
+sub mon_guard {
+   my ($port, @refs) = @_;
+
+   mon $port, sub { 0 && @refs }
+}
+
+=item lnk $port1, $port2
+
+Link two ports. This is simply a shorthand for:
+
+   mon $port1, $port2;
+   mon $port2, $port1;
+
+It means that if either one is killed abnormally, the other one gets
+killed as well.
+
+=item kil $portid[, @reason]
+
+Kill the specified port with the given C<@reason>.
+
+If no C<@reason> is specified, then the port is killed "normally" (linked
+ports will not be kileld, or even notified).
+
+Otherwise, linked ports get killed with the same reason (second form of
+C<mon>, see below).
+
+Runtime errors while evaluating C<rcv> callbacks or inside C<psub> blocks
+will be reported as reason C<< die => $@ >>.
+
+Transport/communication errors are reported as C<< transport_error =>
+$message >>.
+
 =back
 
 =head1 FUNCTIONS FOR NODES
 
 =over 4
 
-=item become_public endpoint...
+=item become_public $noderef
 
 Tells the node to become a public node, i.e. reachable from other nodes.
 
-If no arguments are given, or the first argument is C<undef>, then
-AnyEvent::MP tries to bind on port C<4040> on all IP addresses that the
-local nodename resolves to.
-
-Otherwise the first argument must be an array-reference with transport
-endpoints ("ip:port", "hostname:port") or port numbers (in which case the
-local nodename is used as hostname). The endpoints are all resolved and
-will become the node reference.
+The first argument is the (unresolved) node reference of the local node
+(if missing then the empty string is used).
+
+It is quite common to not specify anything, in which case the local node
+tries to listen on the default port, or to only specify a port number, in
+which case AnyEvent::MP tries to guess the local addresses.
 
 =cut
 
@@ -473,6 +524,8 @@
 message - C<$reply[0]> is the port to reply to, C<$reply[1]> the type and
 the remaining arguments are simply the message data.
 
+While other messages exist, they are not public and subject to change.
+
 =over 4
 
 =cut
@@ -512,9 +565,17 @@
 
 =head1 AnyEvent::MP vs. Distributed Erlang
 
-AnyEvent::MP got lots of its ideas from distributed erlang. Despite the
-similarities (erlang node == aemp node, erlang process == aemp port and so
-on), there are also some important differences:
+AnyEvent::MP got lots of its ideas from distributed erlang (erlang node
+== aemp node, erlang process == aemp port), so many of the documents and
+programming techniques employed by erlang apply to AnyEvent::MP. Here is a
+sample:
+
+   http://www.erlang.se/doc/programming_rules.shtml
+   http://erlang.org/doc/getting_started/part_frame.html # chapters 3 and 4
+   http://erlang.org/download/erlang-book-part1.pdf      # chapters 5 and 6
+   http://erlang.org/download/armstrong_thesis_2003.pdf  # chapters 4 and 5
+
+Despite the similarities, there are also some important differences:
 
 =over 4
 
@@ -524,6 +585,9 @@
 same way. AEMP relies on each node knowing it's own address(es), with
 convenience functionality.
 
+This means that AEMP requires a less tightly controlled environment at the
+cost of longer node references and a slightly higher management overhead.
+
 =item * Erlang uses processes and a mailbox, AEMP does not queue.
 
 Erlang uses processes that selctively receive messages, and therefore
@@ -573,6 +637,17 @@
 AEMP can use a proven protocol - SSL/TLS - to protect connections and
 securely authenticate nodes.
 
+=item * The AEMP protocol is optimised for both text-based and binary
+communications.
+
+The AEMP protocol, unlike the erlang protocol, supports both
+language-independent text-only protocols (good for debugging) and binary,
+language-specific serialisers (e.g. Storable).
+
+It has also been carefully designed to be implementable in other languages
+with a minimum of work while gracefully degrading fucntionality to make the
+protocol simple.
+
 =back
 
 =head1 SEE ALSO